Natural Language Processing
BuildBench: Benchmarking LLM Agents on Compiling Real-World Open-Source Software