Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities Paper β’ 2308.12966 β’ Published Aug 24, 2023 β’ 7
LLaVA-Critic Collection as a general evaluator for assessing model performance β’ 6 items β’ Updated Oct 6 β’ 8
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper β’ 2409.01704 β’ Published Sep 3 β’ 82
Qwen2-Math Collection Math-specific model series based on Qwen2 β’ 8 items β’ Updated Sep 18 β’ 45
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5 β’ 161
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24 β’ 177
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. β’ 39 items β’ Updated Sep 18 β’ 347
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data Paper β’ 2405.14333 β’ Published May 23 β’ 35
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 β’ 166
view article Article A Dive into Pretraining Strategies for Vision-Language Models Feb 3, 2023 β’ 48