Easy2Hard-Bench Collection Easy2Hard-Bench offers six datasets with continuous difficulty ratings, enabling profiling of LLM performance and generalization across difficulties. • 7 items • Updated Jul 3
Easy-to-Hard GPT Rankings Collection Pairwise Difficulty Rankings for Easy-to-Hard Datasets • 7 items • Updated Apr 24