3 8

Luo

ramiroluo

LuoXiaoHeics

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

upvoted a paper 17 days ago

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

upvoted a paper 18 days ago

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

View all activity

Organizations

upvoted a paper 16 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 19 days ago • 159

upvoted a paper 17 days ago

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Paper • 2605.06326 • Published 25 days ago • 26

upvoted a paper 18 days ago

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

Paper • 2605.10832 • Published 21 days ago • 21

upvoted a paper about 1 month ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

Paper • 2604.19295 • Published Apr 21 • 35

upvoted a paper 2 months ago

GEMS: Agent-Native Multimodal Generation with Memory and Skills

Paper • 2603.28088 • Published Mar 30 • 85

upvoted a paper 4 months ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published Feb 12 • 38

submitted a paper to Daily Papers 4 months ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published Feb 12 • 38

New activity in PRIME-RL/P1-VL-30B-A3B 4 months ago

Add metadata and link to paper/code

#1 opened 4 months ago by

nielsr

New activity in PRIME-RL/P1-VL-235B-A22B 4 months ago

Add metadata and links to paper and code

#1 opened 4 months ago by

nielsr

authored 2 papers 4 months ago

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Paper • 2509.07894 • Published Sep 9, 2025 • 32

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 135

upvoted a paper 4 months ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

updated a model 4 months ago

PRIME-RL/P1-VL-235B-A22B

Image-Text-to-Text • 236B • Updated Feb 12 • 5 • 3

published 2 models 4 months ago

PRIME-RL/P1-VL-30B-A3B

Image-Text-to-Text • 31B • Updated Feb 12 • 23 • 3

PRIME-RL/P1-VL-235B-A22B

Image-Text-to-Text • 236B • Updated Feb 12 • 5 • 3

updated a model 4 months ago

PRIME-RL/P1-VL-30B-A3B

Image-Text-to-Text • 31B • Updated Feb 12 • 23 • 3

upvoted a paper 8 months ago

Spotlight on Token Perception for Multimodal Reinforcement Learning

Paper • 2510.09285 • Published Oct 10, 2025 • 37

updated a Space over 2 years ago

HalluChecker

😻

Display leaderboard for LLM hallucination checks

Luo