InternVL 2.5 Collection Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling • 18 items • Updated about 17 hours ago • 67
rusBeIR-datasets Collection Collection of datasets used in rusBeIR • 25 items • Updated about 16 hours ago • 3
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper • 2412.05271 • Published 11 days ago • 110
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 5 days ago • 109
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 7 items • Updated 4 days ago • 18
Speculative Decoding Draft Models Collection Collection of OpenVINO optimized efficient draft models for speculative decoding • 2 items • Updated 27 days ago • 6
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated 20 days ago • 27
OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated 26 days ago • 29
SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration Paper • 2411.10958 • Published Nov 17 • 50
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper • 2411.14405 • Published 26 days ago • 55
Vortex Collection ModelCloud optimized and validated quants that pass/meet strict quality assurance on multiple benchmarks. • 6 items • Updated 16 days ago • 6
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 15 items • Updated 6 days ago • 52
Drowning in Documents: Consequences of Scaling Reranker Inference Paper • 2411.11767 • Published 30 days ago • 17
Rombos-Coder-V2.5 Collection Collection of coding models made by rombo based on qwen 2.5 • 6 items • Updated Nov 12 • 6
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 11 days ago • 95
Qwen 2.5 Coder Collection Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats. • 35 items • Updated 11 days ago • 20