Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published 5 days ago • 30
Running 27 27 CoT-Lab: Human-AI Co-Thinking Laboratory 🤖 Generate human-like text responses to your prompts
Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper • 2501.12599 • Published 14 days ago • 86
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 14 days ago • 295
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published 20 days ago • 67
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 22 days ago • 53
MangaNinja: Line Art Colorization with Precise Reference Following Paper • 2501.08332 • Published 22 days ago • 56
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 22 days ago • 272
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning Paper • 2501.06458 • Published 25 days ago • 29
Reasoning Datasets Collection Reasoning datasets that are trending 🔥 • 10 items • Updated Jan 3 • 24
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 28 days ago • 84
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 28 days ago • 253