A continuously updated benchmark evaluating AI coding agents on real-world software engineering tasks from GitHub issues.
Unipat AI
UnipatAI
AI & ML interests
None yet
Recent Activity
upvoted a paper about 8 hours ago
Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning updated a collection 2 days ago
Monthly-SWEBench updated a dataset 2 days ago
UnipatAI/Monthly-SWEBench-2026-04Organizations
None yet