6 27 11

Bingxiang He

hbx

https://hbx-hbx.github.io/

AI & ML interests

NLP

Recent Activity

upvoted a paper about 20 hours ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

upvoted a paper 8 days ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

liked a model 12 days ago

lllyx/Qwen3-4B-Base-GRPO

View all activity

Organizations

upvoted a paper about 20 hours ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 3 days ago • 126

upvoted a paper 8 days ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published 10 days ago • 22

liked 2 models 12 days ago

lllyx/Qwen3-4B-Base-GRPO

Text Generation • 4B • Updated 12 days ago • 174 • 2

lllyx/Qwen3-1.7B-SFT

Text Generation • 2B • Updated 4 days ago • 891 • 3

updated a collection 14 days ago

JustRL

Collection

3 items • Updated 14 days ago • 5

commented a paper 24 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 94 •

authored a paper 30 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 94

commented a paper about 1 month ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 94 •

upvoted a paper about 1 month ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 94

submitted a paper to Daily Papers about 1 month ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 94

commented a paper 2 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59 •

upvoted a paper 2 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59

submitted a paper to Daily Papers 2 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59

liked a model 3 months ago

openbmb/MiniCPM-SALA

Text Generation • 9B • Updated 9 days ago • 13k • 678

upvoted a paper 3 months ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

liked a model 3 months ago

openbmb/MiniCPM-o-4_5

Any-to-Any • 9B • Updated 6 days ago • 130k • 1.37k

liked a model 4 months ago

openbmb/AgentCPM-Explore

Text Generation • 4B • Updated Jan 18 • 986 • • 414

updated 2 models 5 months ago

hbx/JustRL-Nemotron-1.5B

Text Generation • 2B • Updated Dec 29, 2025 • 842 • • 3

hbx/JustRL-DeepSeek-1.5B

Text Generation • 2B • Updated Dec 29, 2025 • 1.31k • • 10

upvoted a collection 5 months ago

JustRL

Collection

3 items • Updated 14 days ago • 5

Bingxiang He

AI & ML interests

Recent Activity

Organizations

hbx's activity