Open CoT Leaderboard


AI & ML interests

Chain of Thought, LLM Evaluation

👋 We're running the evaluations and hosting results that underpin the Open CoT Leaderboard.

For more information about the evaluation pipeline, have a look at our Github repo.

To get started with exploring the evaluation results on your own, check out this notebook.

If you want to run and contribute evaluations to the Open CoT Leaderboard, please apply for membership in this organization. We'll get back to you asap.


None public yet