Chujie Zheng's picture

Chujie Zheng

chujiezheng

·

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

upvoted a paper 1 day ago

Soft Adaptive Policy Optimization

authored a paper 1 day ago

Soft Adaptive Policy Optimization

authored a paper 1 day ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

upvoted a paper 1 day ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published 8 days ago • 33

authored 2 papers 1 day ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published 8 days ago • 33

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 2 days ago • 62

upvoted a paper 1 day ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 2 days ago • 62

liked 16 models about 1 month ago

Qwen/Qwen3-VL-32B-Thinking-FP8

Image-Text-to-Text • 33B • Updated 7 days ago • 9.09k • 17

Qwen/Qwen3-VL-2B-Thinking

Image-Text-to-Text • 2B • Updated Oct 20 • 36.5k • 86

Qwen/Qwen3-VL-2B-Instruct-FP8

Image-Text-to-Text • 2B • Updated Oct 20 • 9.4k • 27

Qwen/Qwen3-VL-2B-Thinking-FP8

Image-Text-to-Text • 2B • Updated 7 days ago • 6.91k • 19

Qwen/Qwen3-VL-32B-Thinking

Image-Text-to-Text • 33B • Updated Oct 21 • 538k • 68

Qwen/Qwen3-VL-32B-Instruct-FP8

Image-Text-to-Text • 33B • Updated Oct 22 • 113k • 27

Qwen/Qwen3-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Oct 23 • 436k • 215

Qwen/Qwen3-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Oct 21 • 1.53M • 132

Qwen/Qwen3-4B-Thinking-2507-FP8

Text Generation • 4B • Updated Aug 6 • 183k • 39

Qwen/Qwen3-4B-Thinking-2507

Text Generation • 4B • Updated Aug 6 • 678k • • 473

Qwen/Qwen3-VL-30B-A3B-Instruct

Image-Text-to-Text • 31B • Updated 7 days ago • 1.38M • • 423

Qwen/Qwen3-VL-235B-A22B-Thinking-FP8

Image-Text-to-Text • 236B • Updated 7 days ago • 30.1k • 24

Qwen/Qwen3-VL-30B-A3B-Thinking-FP8

Image-Text-to-Text • 31B • Updated 7 days ago • 117k • 45

Qwen/Qwen3-VL-235B-A22B-Instruct-FP8

Image-Text-to-Text • 236B • Updated 7 days ago • 206k • 31

Qwen/Qwen3-VL-30B-A3B-Instruct-FP8

Image-Text-to-Text • 31B • Updated 7 days ago • 210k • 89

Qwen/Qwen3-VL-30B-A3B-Thinking

Image-Text-to-Text • 31B • Updated 7 days ago • 49.3k • • 162