Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning Paper • 2601.09088 • Published 9 days ago • 57 • 6
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning Paper • 2601.09088 • Published 9 days ago • 57
Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation Paper • 2512.20908 • Published 30 days ago • 25
ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection Paper • 2601.09195 • Published 9 days ago • 15
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob Viewer • Updated 8 days ago • 435k • 3.31k • 40
Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks Paper • 2601.03448 • Published 16 days ago • 12
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 17 days ago • 26
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Paper • 2512.20578 • Published about 1 month ago • 81
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published Dec 2, 2025 • 53
TimeBill: Time-Budgeted Inference for Large Language Models Paper • 2512.21859 • Published 28 days ago • 25