Hugging's picture

3 9 6

Hugging

ChenDRAG

·

AI & ML interests

None yet

Recent Activity

authored a paper 13 days ago

Free Process Rewards without Process Labels

authored a paper 13 days ago

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

authored a paper 13 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

View all activity

Organizations

authored 5 papers 13 days ago

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 34

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

Paper • 2505.18116 • Published May 23 • 4

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 130

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 183

Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency

Paper • 2510.08431 • Published 13 days ago • 8

authored a paper 29 days ago

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Paper • 2509.16117 • Published Sep 19 • 20

authored a paper 9 months ago

Visual Generation Without Guidance

Paper • 2501.15420 • Published Jan 26 • 8

authored 6 papers about 1 year ago

Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning

Paper • 2304.12824 • Published Apr 25, 2023

Score Regularized Policy Optimization through Diffusion Behavior

Paper • 2310.07297 • Published Oct 11, 2023 • 1

Noise Contrastive Alignment of Language Models with Explicit Rewards

Paper • 2402.05369 • Published Feb 8, 2024 • 1

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Paper • 2410.07864 • Published Oct 10, 2024 • 1

Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control

Paper • 2407.09024 • Published Jul 12, 2024

Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment

Paper • 2410.09347 • Published Oct 12, 2024 • 5