view article Article Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face 2 days ago • 6
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs Apr 29 • 40
view article Article AMD + 🤗: Large Language Models Out-of-the-Box Acceleration with AMD GPU Dec 5, 2023 • 4
view article Article Overview of natively supported quantization schemes in 🤗 Transformers Sep 12, 2023 • 12