Running on CPU Upgrade Featured 3.16k The Smol Training Playbook 📚 3.16k The secrets to building world-class LLMs
Runtime error Agents Featured 83 MMaDA 🌍 83 Demo for MMaDA: Multimodal Large Diffusion Language Models
HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1.5 Sentence Similarity • Updated Mar 13, 2025 • 1.09k • 65
Runtime error Agents Featured 5.07k MusicGen 🎵 5.07k Generate music from text descriptions and optional melodies
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 +1 eliebak, lvwerra, lewtun • Jan 28, 2025 • 889