25 63 244

Yinxu Pan

cppowboy

https://github.com/Cppowboy

AI & ML interests

RL for LLM, Code&Math Reasoning, Function Calling, Code Interpreter, Vision-Language Pretraining

Recent Activity

upvoted a paper about 7 hours ago

Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning

liked a dataset about 8 hours ago

FreedomIntelligence/medical-o1-reasoning-SFT

new activity about 9 hours ago

nebius/SWE-rebench:How can I find all instance_ids that come with a Docker image?

View all activity

Organizations

upvoted a paper about 7 hours ago

Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning

Paper • 2508.03501 • Published Aug 5 • 57

liked a dataset about 8 hours ago

FreedomIntelligence/medical-o1-reasoning-SFT

Viewer • Updated Apr 22 • 90.1k • 7.93k • 934

New activity in nebius/SWE-rebench about 9 hours ago

How can I find all instance_ids that come with a Docker image?

#10 opened 16 days ago by

KYLN24

liked a dataset 1 day ago

meituan-longcat/AMO-Bench

Viewer • Updated 1 day ago • 50 • 438 • 10

liked 2 models 22 days ago

microsoft/UserLM-8b

Text Generation • 8B • Updated 28 days ago • 4.88k • 340

inclusionAI/Ling-1T

Text Generation • 1000B • Updated 2 days ago • 4.97k • • 505

upvoted a paper 22 days ago

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Paper • 2510.12693 • Published 23 days ago • 26

liked a dataset 22 days ago

inclusionAI/Ling-Coder-SFT

Viewer • Updated Mar 27 • 4.48M • 841 • 25

liked a model 24 days ago

jinaai/ReaderLM-v2

Text Generation • 2B • Updated Mar 4 • 9.81k • • 722

liked 3 datasets 24 days ago

liked a model about 1 month ago

Kwaipilot/KAT-Dev

Text Generation • 33B • Updated 23 days ago • 2.92k • • 177

upvoted 2 papers about 1 month ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23 • 67

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 49

liked 2 datasets about 1 month ago

meta-agents-research-environments/gaia2

Viewer • Updated Sep 25 • 963 • 8.44k • 31

ScaleAI/SWE-bench_Pro

Viewer • Updated Sep 25 • 731 • 12.4k • 34

liked 2 models about 2 months ago

Alibaba-NLP/Tongyi-DeepResearch-30B-A3B

Text Generation • 31B • Updated 27 days ago • 12.5k • 751

openbmb/VoxCPM-0.5B

Text-to-Speech • Updated Sep 19 • 3.55k • 754

upvoted a paper about 2 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 186

Yinxu Pan

AI & ML interests

Recent Activity

Organizations

cppowboy's activity

How can I find all instance_ids that come with a Docker image?