ScaleAI/audiomc
Viewer
•
Updated
•
452
•
1.04k
•
7
None defined yet.
Agentic Rubrics as Contextual Verifiers for SWE Agents
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents