Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass Paper ā¢ 2501.13928 ā¢ Published 6 days ago ā¢ 12
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths ā¢ 2 items ā¢ Updated 3 days ago ā¢ 83
MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation Paper ā¢ 2501.06713 ā¢ Published 18 days ago ā¢ 1
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release ā¢ 12 items ā¢ Updated 6 days ago ā¢ 60
SmolLM2 - Smashed Collection Many variations of SmolLM2 with many variation techniques ā¢ 15 items ā¢ Updated 29 days ago ā¢ 1
Image Classification (ResNet, ViT, MobileNet, ...) Collection 524 items ā¢ Updated Mar 27, 2024 ā¢ 4
Text-to-text Generation Models (LLMs, Llama, GPT, ...) Collection 5165 items ā¢ Updated 14 minutes ago ā¢ 13
Text-to-image Generation Models (Diffusion, LCM...) Collection 57 items ā¢ Updated May 8, 2024 ā¢ 8
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper ā¢ 2501.12909 ā¢ Published 7 days ago ā¢ 62
GTE models Collection General Text Embedding Models Released by Tongyi Lab of Alibaba Group ā¢ 21 items ā¢ Updated 9 days ago ā¢ 20
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper ā¢ 2501.08313 ā¢ Published 15 days ago ā¢ 270
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. ā¢ 27 items ā¢ Updated 3 days ago ā¢ 101
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI ā¢ 14 days ago ā¢ 40