Scaling Synthetic Data Creation with 1,000,000,000 Personas Paper • 2406.20094 • Published Jun 28 • 95
Finding Blind Spots in Evaluator LLMs with Interpretable Checklists Paper • 2406.13439 • Published Jun 19
MILU: A Multi-task Indic Language Understanding Benchmark Paper • 2411.02538 • Published 24 days ago • 1
MILU: A Multi-task Indic Language Understanding Benchmark Paper • 2411.02538 • Published 24 days ago • 1