DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation Paper • 2501.16764 • Published 1 day ago • 4
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published about 14 hours ago • 10
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 2 days ago • 256
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 7 days ago • 252
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers Paper • 2408.06195 • Published Aug 12, 2024 • 70
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 14 days ago • 127
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published 16 days ago • 89
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning Paper • 2501.06458 • Published 18 days ago • 29
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published Dec 25, 2024 • 97