InternVL3.5 Collection This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated 24 days ago • 98
view article Article How to train a new language model from scratch using Transformers and Tokenizers Feb 14, 2020 • 51
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control Feb 4 • 179
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents Paper • 2505.20411 • Published May 26 • 88
AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference Paper • 2504.10326 • Published Apr 14 • 25
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 236
view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques By jmamou and 8 others • Mar 24 • 20
💫StarVector Models Collection StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated Mar 20 • 97
view article Article From Files to Chunks: Improving Hugging Face Storage Efficiency Nov 20, 2024 • 66
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub Feb 12 • 77