Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper β’ 2503.09573 β’ Published 2 days ago β’ 46
view article Article LeRobot goes to driving school: Worldβs largest open-source self-driving dataset 4 days ago β’ 45
Gemini Embedding: Generalizable Embeddings from Gemini Paper β’ 2503.07891 β’ Published 4 days ago β’ 25
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper β’ 2503.07920 β’ Published 4 days ago β’ 89
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper β’ 2503.03601 β’ Published 9 days ago β’ 208
EuroBERT: Scaling Multilingual Encoders for European Languages Paper β’ 2503.05500 β’ Published 7 days ago β’ 72
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper β’ 2503.00865 β’ Published 12 days ago β’ 58
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper β’ 2503.01743 β’ Published 11 days ago β’ 72
SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference Paper β’ 2502.18137 β’ Published 17 days ago β’ 53
Slamming: Training a Speech Language Model on One GPU in a Day Paper β’ 2502.15814 β’ Published 23 days ago β’ 66
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper β’ 2502.15007 β’ Published 22 days ago β’ 162