Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL Paper • 2602.03773 • Published 10 days ago • 5
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning Paper • 2602.10560 • Published 3 days ago • 25
Thinking with Comics: Enhancing Multimodal Reasoning through Structured Visual Storytelling Paper • 2602.02453 • Published 11 days ago • 35
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale Paper • 2602.05711 • Published 8 days ago • 9
LLaDA2.1: Speeding Up Text Diffusion via Token Editing Paper • 2602.08676 • Published 4 days ago • 58
LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning Paper • 2602.07075 • Published 8 days ago • 18
SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks Paper • 2602.06854 • Published 7 days ago • 6
LatentMem: Customizing Latent Memory for Multi-Agent Systems Paper • 2602.03036 • Published 11 days ago • 14
Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published 16 days ago • 99
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models Paper • 2511.08577 • Published Nov 11, 2025 • 108
DFlash: Block Diffusion for Flash Speculative Decoding Paper • 2602.06036 • Published 8 days ago • 41