OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper β’ 2411.04905 β’ Published 25 days ago β’ 109
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper β’ 2411.10440 β’ Published 17 days ago β’ 107
ROICtrl: Boosting Instance Control for Visual Generation Paper β’ 2411.17949 β’ Published 6 days ago β’ 76
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Paper β’ 2411.09595 β’ Published 18 days ago β’ 68
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems Paper β’ 2411.02959 β’ Published 28 days ago β’ 64
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Paper β’ 2411.10442 β’ Published 17 days ago β’ 61
Large Language Models Can Self-Improve in Long-context Reasoning Paper β’ 2411.08147 β’ Published 20 days ago β’ 59
MagicQuill: An Intelligent Interactive Image Editing System Paper β’ 2411.09703 β’ Published 18 days ago β’ 56
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper β’ 2411.14405 β’ Published 11 days ago β’ 53