How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • 15 days ago • 30
Granite Embedding R2: Setting New Standards for Enterprise Retrieval By hansolosan • about 20 hours ago • 12
High-Quality Datasets for Far-Field ASR (Treble Technologies x Hugging Face) By treble-technologies and 4 others • 2 days ago • 10
Ring-flash-linear-2.0: A Highly Efficient Hybrid Architecture for Test-Time Scaling By RichardBian and 8 others • 6 days ago • 9
Reactive Transformer (RxT): Fixing the Memory Problem in Conversational AI By AdamF92 • 7 days ago • 5
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 233
How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • 15 days ago • 30
Granite Embedding R2: Setting New Standards for Enterprise Retrieval By hansolosan • about 20 hours ago • 12
High-Quality Datasets for Far-Field ASR (Treble Technologies x Hugging Face) By treble-technologies and 4 others • 2 days ago • 10
Ring-flash-linear-2.0: A Highly Efficient Hybrid Architecture for Test-Time Scaling By RichardBian and 8 others • 6 days ago • 9
Reactive Transformer (RxT): Fixing the Memory Problem in Conversational AI By AdamF92 • 7 days ago • 5
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 233