mdbr-leaf-embedding Collection A collection of compact, high performance text-embedding models trained using our proposed LEAF framework, see https://arxiv.org/abs/2509.12539 • 4 items • Updated Sep 29, 2025 • 6
NuExtract-2.0 Collection Models specialized in extracting structured information (JSON) from text, PDFs, scans, spreadsheets, etc. • 15 items • Updated 14 days ago • 28
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated Apr 10, 2025 • 112
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26, 2025 • 184
MoshiVis v0.1 Collection MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 9 items • Updated Dec 23, 2025 • 23
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model Mar 10, 2025 • 146
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks • 8 items • Updated Jul 31, 2025 • 29
Hallucination detection Collection Trained ModernBERT (base and large) for detection hallucinations in LLM responses. The models are trained as token classifications. • 4 items • Updated May 18, 2025 • 19
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation Paper • 2502.13128 • Published Feb 18, 2025 • 41
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google +1 Feb 19, 2025 • 75
view article Article Extending the Massive Text Embedding Benchmark to French: the datasets Jan 12, 2024 • 5
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 7 items • Updated Dec 24, 2025 • 55
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4, 2025 • 255
Dolphin 3.0 Collection Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 9 items • Updated Feb 7, 2025 • 199