microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 3 days ago • 113k • 976
🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 13 items • Updated 4 days ago • 89
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 15 days ago • 49
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published Jan 29 • 55
FreedomIntelligence/medical-o1-reasoning-SFT Viewer • Updated 14 days ago • 50.1k • 26.2k • 383