ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs Paper • 2510.04767 • Published 20 days ago • 26
Generative Universal Verifier as Multimodal Meta-Reasoner Paper • 2510.13804 • Published 11 days ago • 24
Demystifying Reinforcement Learning in Agentic Reasoning Paper • 2510.11701 • Published 13 days ago • 31
Open-AgentRL Collection Demystifying Reinforcement Learning in Agentic Reasoning • 6 items • Updated 13 days ago • 2
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs Paper • 2510.07499 • Published 18 days ago • 45
Revisiting Long-context Modeling from Context Denoising Perspective Paper • 2510.05862 • Published 19 days ago • 20
WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation Paper • 2510.07313 • Published 18 days ago • 6
Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention Paper • 2510.04212 • Published 21 days ago • 22
Artificial Hippocampus Networks for Efficient Long-Context Modeling Paper • 2510.07318 • Published 18 days ago • 27
Vibe Checker: Aligning Code Evaluation with Human Preference Paper • 2510.07315 • Published 18 days ago • 30
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models Paper • 2510.06917 • Published 18 days ago • 34
Cache-to-Cache: Direct Semantic Communication Between Large Language Models Paper • 2510.03215 • Published 23 days ago • 92
Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models Paper • 2509.06949 • Published Sep 8 • 56
LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Paper • 2508.03694 • Published Aug 5 • 50
ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs Paper • 2506.18896 • Published Jun 23 • 29
Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning Paper • 2506.03136 • Published Jun 3 • 24
Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning Paper • 2505.16270 • Published May 22 • 6
ReasonFLux-Coder Collection Coding LLMs excel at both writing code and generating unit tests. • 9 items • Updated May 26 • 11