None defined yet.
G^2RPO: Granular GRPO for Precise Reward in Flow Models
A long-context, multimodal document understanding benchmark