REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper β’ 2501.03262 β’ Published 14 days ago β’ 82
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper β’ 2501.04682 β’ Published 10 days ago β’ 83
TACO Models Collection This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. β’ 3 items β’ Updated 28 days ago β’ 8
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper β’ 2412.19723 β’ Published 22 days ago β’ 79
Cosmos World Foundation Model Platform for Physical AI Paper β’ 2501.03575 β’ Published 11 days ago β’ 63
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper β’ 2501.04519 β’ Published 10 days ago β’ 230
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published about 1 month ago β’ 123
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated 30 days ago β’ 124
The Open Source Advantage in Large Language Models (LLMs) Paper β’ 2412.12004 β’ Published Dec 16, 2024 β’ 9
SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding Paper β’ 2412.09604 β’ Published Dec 12, 2024 β’ 35
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper β’ 2412.10360 β’ Published Dec 13, 2024 β’ 139
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions Paper β’ 2412.08737 β’ Published Dec 11, 2024 β’ 53