LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated about 6 hours ago • 13
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling 7 days ago • 44
Robust Speech Recognition via Large-Scale Weak Supervision Paper • 2212.04356 • Published Dec 6, 2022 • 49
view article Article From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output 12 days ago • 20
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model +6 Oct 29, 2024 • 60
Quantized translategemma Collection Quickly tested with vLLM. Not fully compatible yet. • 7 items • Updated Jan 19 • 3
PII & De-Identification Collection Models for extracting PII entities and de-identifying clinical text, with support for HIPAA and GDPR compliance. • 188 items • Updated 2 days ago • 32
SAM Audio Collection The SAM Audio model licenses allow for redistribution so long as the original license files are included • 9 items • Updated Dec 25, 2025 • 4
ViDoRe Benchmark V3 Collection ViDoRe V3 is our latest benchmark, engineered to set a new industry gold standard for multi-modal, enterprise document retrieval evaluation. • 8 items • Updated Jan 14 • 19
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases Nov 5, 2025 • 62
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 Jul 5, 2024 • 314
SWE-Playground Collection Official Collection for "Training Versatile Coding Agents in Synthetic Environments" • 11 items • Updated Nov 22, 2025 • 2