1 7 2

ChengpengLi

AI & ML interests

LLM for Reasoning, reinforcement learning, recommendation system, diffusion models

Recent Activity

upvoted a paper 2 days ago

Enabling Scalable Oversight via Self-Evolving Critic

upvoted a paper 2 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

upvoted a paper 24 days ago

Qwen2.5 Technical Report

View all activity

Organizations

None yet

ChengpengLi's activity

upvoted 2 papers 2 days ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published 6 days ago • 62

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 3 days ago • 67

upvoted a paper 24 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 28 days ago • 340

liked a Space 3 months ago

Running

187

🧮

Qwen2.5 Math Demo

upvoted 2 collections 4 months ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 11 items • Updated 2 days ago • 62

Qwen2-Math

Collection

Math-specific model series based on Qwen2 • 8 items • Updated Nov 28, 2024 • 47

liked a model 5 months ago

Qwen/Qwen2-Math-72B

Text Generation • Updated Aug 8, 2024 • 104 • 28

authored 2 papers 6 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 161

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Paper • 2407.04078 • Published Jul 4, 2024 • 18

upvoted 2 papers 6 months ago

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19, 2024 • 16

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Paper • 2407.04078 • Published Jul 4, 2024 • 18

authored a paper 7 months ago

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19, 2024 • 16