Ovis2 Collection Our latest advancement in multi-modal large language models (MLLMs) • 7 items • Updated 4 days ago • 36
Breeze 2 Family Collection Llama-Breeze2 is a multi-modal language model family specifically intended for Traditional Chinese use. BreezyVoice is a Taiwan Mandarin TTS • 5 items • Updated 3 days ago • 11
CritiqueFineTuning Collection The dataset and models for CritiqueFineTuning • 4 items • Updated 14 days ago • 2
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 18 days ago • 54
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models Paper • 2410.07985 • Published Oct 10, 2024 • 31
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 21 days ago • 99
NuminaMath Collection Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated 6 days ago • 74
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 6 days ago • 69
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 4 days ago • 90
view article Article Gradio spaces are the perfect agent tools\! By burtenshaw • about 1 month ago • 14
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 11 items • Updated 5 days ago • 65
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 20 days ago • 343