Solar Pro Collection The most intelligent LLM on a single GPU โข 4 items โข Updated 7 days ago โข 13
C4AI Aya 23 Collection Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. โข 4 items โข Updated Aug 6 โข 50
Yi 1.5 GGUFs Collection Collection of Yi 1.5 GGUFs made with gguf-my-repo โข 15 items โข Updated May 20 โข 5
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. โข 26 items โข Updated 8 days ago โข 497
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs Paper โข 2402.15627 โข Published Feb 23 โข 34
C4AI Command R Collection C4AI Command-R is a research release of a 35 billion parameter highly performant generative model. Command-R is a large language model with open weigh โข 4 items โข Updated Aug 30 โข 19
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper โข 2402.17764 โข Published Feb 27 โข 603
Frankenmodels Collection They're not supposed to be that size! Neat, right? โข 8 items โข Updated Dec 12, 2023 โข 3