Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason! By Writer and 1 other • 11 days ago • 55
AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models By imomayiz and 4 others • 6 days ago • 12
"Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack By anemll • 6 days ago • 10
mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL By driaforall and 1 other • 11 days ago • 18
🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎 By sasha and 1 other • 5 days ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 219
Use AI on Your PC: Optimize and Deploy a Multimodal Agentic Pipeline on AI PC Powered by Intel By estellea and 2 others • 5 days ago • 5
Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason! By Writer and 1 other • 11 days ago • 55
AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models By imomayiz and 4 others • 6 days ago • 12
"Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack By anemll • 6 days ago • 10
mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL By driaforall and 1 other • 11 days ago • 18
🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎 By sasha and 1 other • 5 days ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 219
Use AI on Your PC: Optimize and Deploy a Multimodal Agentic Pipeline on AI PC Powered by Intel By estellea and 2 others • 5 days ago • 5