BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data
Paper
•
2510.10159
•
Published
•
2
None defined yet.
AI for Service: Proactive Assistance with AI Glasses
Rethinking LLM Evaluation: Can We Evaluate LLMs with 200x Less Data?