view article Article Unlock the Power of AI in Your Browser with Transformers.js By luigi12345 β’ 2 days ago β’ 2
view article Article Understanding the Algorithm of Thoughts: A Heuristic Approach Beyond LLMs By TuringsSolutions β’ 2 days ago β’ 2
view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka β’ 2 days ago β’ 59
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper β’ 2411.10440 β’ Published 6 days ago β’ 87
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Paper β’ 2411.09595 β’ Published 7 days ago β’ 65
Large Language Models Can Self-Improve in Long-context Reasoning Paper β’ 2411.08147 β’ Published 9 days ago β’ 58
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 8 items β’ Updated 17 days ago β’ 171
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Paper β’ 2410.17243 β’ Published 30 days ago β’ 88
ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting Paper β’ 2410.17856 β’ Published 29 days ago β’ 49
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree Paper β’ 2410.16268 β’ Published about 1 month ago β’ 65
FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors Paper β’ 2410.16271 β’ Published about 1 month ago β’ 80
steiner-preview Collection Reasoning models trained on synthetic data using reinforcement learning. β’ 3 items β’ Updated Oct 20 β’ 23
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. β’ 8 items β’ Updated 17 days ago β’ 89
VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI Paper β’ 2410.11623 β’ Published Oct 15 β’ 46
HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks Paper β’ 2410.12381 β’ Published Oct 16 β’ 42
MobA: A Two-Level Agent System for Efficient Mobile Task Automation Paper β’ 2410.13757 β’ Published Oct 17 β’ 31
MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures Paper β’ 2410.13754 β’ Published Oct 17 β’ 74