view article Article Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL +5 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra • 5 days ago • 34
The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence Paper • 2605.26494 • Published 6 days ago • 35
MiniCPM5 Collection A SOTA 1B on-device LLM, small yet powerful. • 11 items • Updated 6 days ago • 22
💧 LFM2.5 Collection Collection of post-trained and base LFM2.5 models. • 33 items • Updated 3 days ago • 144
view article Article Why Open Models Are the Only Sustainable Way to Teach AI penelopegittos • 10 days ago • 8
ArtifactLinker: Linking Scientific Artifacts for Automatic State-of-the-Art Discovery Paper • 2605.16902 • Published 16 days ago • 1
view article Article PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend PaddlePaddle • 14 days ago • 33
Scaling Laws for Mixture Pretraining Under Data Constraints Paper • 2605.12715 • Published 20 days ago • 4
view article Article How to Comply with SOC 2 and ISO 27001 with Hugging Face: A Practical Guide to AI Model Supply Chain Governance jeffboudier • 17 days ago • 6
view article Article Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law mishig • 21 days ago • 23