Running 1.5k Big Code Models Leaderboard 📈 1.5k Explore and submit code model evaluations on a leaderboard
FreedomIntelligence/medical_o1_verifier_3B_Qwen2.5 Text Classification • 3B • Updated Feb 6, 2025 • 12 • 6
FreedomIntelligence/medical-o1-verifiable-problem Viewer • Updated Dec 30, 2024 • 40.6k • 1.05k • 122