CodeArena: A Collective Evaluation Platform for LLM Code Generation Paper • 2503.01295 • Published 11 days ago • 7
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper • 2406.15877 • Published Jun 22, 2024 • 46