Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions Paper • 2606.09076 • Published 5 days ago • 55
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models Paper • 2606.11025 • Published 4 days ago • 40
ProEdit: Inversion-based Editing From Prompts Done Right Paper • 2512.22118 • Published Dec 26, 2025 • 19