SupritiVijay/dr-tulu-sft-deep-research-agent-data-cleaned-rectified Viewer • Updated Nov 30, 2025 • 12k • 39 • 1
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published 8 days ago • 81
Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report Paper • 2601.21051 • Published 9 days ago • 12
Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report Paper • 2601.21051 • Published 9 days ago • 12
Sweep Next Edit Collection Locally running next edit autocomplete • 2 items • Updated 10 days ago • 4
Scaling Law Discovery Collection Dataset and results for SLD (https://arxiv.org/abs/2507.21184) • 2 items • Updated 30 days ago • 2
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Paper • 2601.15165 • Published 16 days ago • 71