VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL Paper β’ 2505.23977 β’ Published May 29 β’ 10
TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning Paper β’ 2505.14625 β’ Published May 20 β’ 13
KodCode-V1 Collection KodCode-V1 is the largest fully-synthetic open-source dataset providing verifiable solutions and tests for coding tasks. β’ 6 items β’ Updated Apr 2 β’ 4
KodCode-V1 Collection KodCode-V1 is the largest fully-synthetic open-source dataset providing verifiable solutions and tests for coding tasks. β’ 6 items β’ Updated Apr 2 β’ 4
KodCode-V1 Collection KodCode-V1 is the largest fully-synthetic open-source dataset providing verifiable solutions and tests for coding tasks. β’ 6 items β’ Updated Apr 2 β’ 4
SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities Paper β’ 2502.12025 β’ Published Feb 17 β’ 3
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding Paper β’ 2503.02951 β’ Published Mar 4 β’ 32
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding Paper β’ 2503.02951 β’ Published Mar 4 β’ 32
KodCode-V1 Collection KodCode-V1 is the largest fully-synthetic open-source dataset providing verifiable solutions and tests for coding tasks. β’ 6 items β’ Updated Apr 2 β’ 4