ColSmolVLM Collection Pre-trained checkpoints for the ColVision models with a ColSmolVLM backbone. β’ 2 items β’ Updated 7 days ago β’ 1
Eagle 2 Collection Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. β’ 9 items β’ Updated 7 days ago β’ 27
Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper β’ 2501.05366 β’ Published 21 days ago β’ 83
Meta Motivo Collection A first-of-its-kind behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks. β’ 6 items β’ Updated Dec 10, 2024 β’ 9
Structured 3D Latents for Scalable and Versatile 3D Generation Paper β’ 2412.01506 β’ Published Dec 2, 2024 β’ 56
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper β’ 2411.14405 β’ Published Nov 21, 2024 β’ 58
OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". β’ 8 items β’ Updated Nov 22, 2024 β’ 31
ReFT: Representation Finetuning for Language Models Paper β’ 2404.03592 β’ Published Apr 4, 2024 β’ 93
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 β’ 40 items β’ Updated Nov 28, 2024 β’ 268
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper β’ 2411.04905 β’ Published Nov 7, 2024 β’ 114
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning Paper β’ 2410.02089 β’ Published Oct 2, 2024 β’ 12
PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation Paper β’ 2404.13026 β’ Published Apr 19, 2024 β’ 24
AutoTrain: No-code training for state-of-the-art models Paper β’ 2410.15735 β’ Published Oct 21, 2024 β’ 59
view article Article Transformers.js v3: WebGPU support, new models & tasks, and more⦠Oct 22, 2024 ⒠67