Intelligent-Internet/swebench-pro-gpt-5-codex-ii-agent-trajectories
Viewer
• Updated
• 728 • 12
Intelligent-Internet/swebench-pro-claude-sonnet-4.5-ii-agent-trajectories
Viewer
• Updated
• 726 • 13
• 2
Intelligent-Internet/II-Search-Benchmark-Details
Viewer
• Updated
• 26.3k • 19
• 2
Intelligent-Internet/II-Search-RL
Viewer
• Updated
• 43.4k • 15
• 2
Intelligent-Internet/II-Search-CIR-SFT
Viewer
• Updated
• 85.6k • 19
• 5
Intelligent-Internet/II-Search-SFT
Viewer
• Updated
• 27.1k • 4
• 1
Intelligent-Internet/arxiv
Preview
• Updated
• 113
• 8
Intelligent-Internet/ChatDoctor-RL
Viewer
• Updated
• 16.7k • 61
• 15
Intelligent-Internet/II-Medical-RL
Viewer
• Updated
• 15.9k • 31
• 11
Intelligent-Internet/II-Medical-Reasoning-SFT
Viewer
• Updated
• 2.2M • 282
• 50
Intelligent-Internet/GAIA-Subset-Benchmark
Viewer
• Updated
• 44 • 1.24k
• 3
Intelligent-Internet/OpenAI-HealthBench-II-Medical-8B-1706-GPT-4.1
Viewer
• Updated
• 5k • 10
• 2
Intelligent-Internet/pd12m
Viewer
• Updated
• 11.7M • 1.7k
• 6
Intelligent-Internet/wikipedia_en
Viewer
• Updated
• 13.4M • 2.82k
• 2
Intelligent-Internet/ii-agent_gaia-benchmark_validation
Viewer
• Updated
• 165 • 1.23k
• 8
Intelligent-Internet/OpenAI-HealthBench-II-Medical-8B-GPT-4.1
Viewer
• Updated
• 5k • 11
• 1
Intelligent-Internet/II-Thought-RL-v0
Viewer
• Updated
• 342k • 274
• 54
Intelligent-Internet/frames-benchmark
Viewer
• Updated
• 822 • 8
• 3
Intelligent-Internet/Vietnamese-Entrance-Exam
Viewer
• Updated
• 432 • 16
• 1
Intelligent-Internet/II-Thought-RL-v0-Math-50K
Viewer
• Updated
• 53.3k • 8
• 3