view post Post 2056 Reply Releasing HQQ Llama-3.1-70b 4-bit quantized version! Check it out at mobiuslabsgmbh/Llama-3.1-70b-instruct_4bitgs64_hqq. Achieves 99% of the base model performance across various benchmarks! Details in the model card. 🔥 8 8 +
view post Post 1788 Reply Excited to announce the release of our high-quality Llama-3.1 8B 4-bit HQQ/calibrated quantized model! Achieving an impressive 99.3% relative performance to FP16, it also delivers the fastest inference speed for transformers. mobiuslabsgmbh/Llama-3.1-8b-instruct_4bitgs64_hqq_calib 1 reply · 🔥 9 9 +
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 • 166
view post Post 2066 Reply Diaries of Open Source. Part 11 🚀🚀Databricks release DBRX, potentially the best open access model! A 132B Mixture of Experts with 36B active params and trained on 12 trillion tokensBlog: https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llmBase and instruct models: databricks/dbrx-6601c0852a0cdd3c59f71962Demo: databricks/dbrx-instruct🤏1-bit and 2-bit quantization exploration using HQQ+Blog post: https://mobiusml.github.io/1bit_blog/Models: https://hf.co/collections/mobiuslabsgmbh/llama2-7b-hqq-6604257a96fc8b9c4e13e0feGitHub: https://github.com/mobiusml/hqq📚Cosmopedia: a large-scale synthetic dataset for pre-training - it includes 25 billion tokens and 30 million filesDataset: HuggingFaceTB/cosmopediaBlog: https://hf.co/blog/cosmopedia⭐Mini-Gemini: multi-modal VLMs, from 2B to 34BModels: https://hf.co/collections/YanweiLi/mini-gemini-6603c50b9b43d044171d0854Paper: Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models (2403.18814)GitHub: https://github.com/dvlab-research/MiniGemini🔥VILA - On Pre-training for VLMsModels: Efficient-Large-Model/vila-on-pre-training-for-visual-language-models-65d8022a3a52cd9bcd62698ePaper: VILA: On Pre-training for Visual Language Models (2312.07533)Misc👀 FeatUp: a framework for image features at any resolution: mhamilton723/FeatUp FeatUp: A Model-Agnostic Framework for Features at Any Resolution (2403.10516)🍞ColBERTus Maxiums, a colbertialized embedding model mixedbread-ai/mxbai-colbert-large-v1🖌️Semantic Palette, a new drawing paradigm ironjr/SemanticPalette🧑⚕️HistoGPT, a vision model that generates accurate pathology reports marr-peng-lab/histogpt https://www.medrxiv.org/content/10.1101/2024.03.15.24304211v1 4 replies · 🔥 15 15 ❤️ 6 6 👍 3 3 +