Open Whisper-style Speech Models (OWSM) Collection Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/ • 15 items • Updated Sep 27 • 4
CommonCrawl Collection Large web-mined general corpus based on CommonCrawl. • 7 items • Updated 9 days ago • 1
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated 21 days ago • 98
Optimized ONNX models for NVIDIA RTX GPUs Collection Collection of optimized ONNX model checkpoints for NVIDIA RTX GPUs • 7 items • Updated 30 days ago • 10
Spaces for Model / Space / useful Utilities in Hugging Face Collection 224 items • Updated 2 days ago • 7
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models Paper • 2409.17892 • Published Sep 26 • 2
Faith and Fate: Limits of Transformers on Compositionality Paper • 2305.18654 • Published May 29, 2023 • 6
💻 Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated Aug 20 • 46
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 204
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27 • 603
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 8 items • Updated Oct 1 • 20
OLMo Suite Collection Artifacts for the first set of OLMo models. • 18 items • Updated 20 days ago • 69
MADLAD-400: A Multilingual And Document-Level Large Audited Dataset Paper • 2309.04662 • Published Sep 9, 2023 • 22