KodCode (KodCode)

zhangchenxu

authored a paper 8 months ago

TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Paper • 2510.01179 • Published Oct 1, 2025 • 29

zhangchenxu

in KodCode/KodCode-V1-SFT-R1 11 months ago

The relationship between r1_correctness and r1_solution

1

#3 opened 11 months ago by

zimuwang

zhangchenxu

authored 2 papers about 1 year ago

VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL

Paper • 2505.23977 • Published May 29, 2025 • 10

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

Paper • 2505.14625 • Published May 20, 2025 • 13

zhangchenxu

updated a collection about 1 year ago

KodCode-V1

Collection

KodCode-V1 is the largest fully-synthetic open-source dataset providing verifiable solutions and tests for coding tasks. • 5 items • Updated Mar 2 • 5

zhangchenxu

updated a dataset about 1 year ago

KodCode/KodCode-Light-RL-10K

Viewer • Updated Apr 2, 2025 • 10k • 2.21k • 9

zhangchenxu

published a dataset about 1 year ago

KodCode/KodCode-Light-RL-10K

Viewer • Updated Apr 2, 2025 • 10k • 2.21k • 9

zhangchenxu

updated 2 datasets about 1 year ago

KodCode/KodCode-V1-SFT-R1

Viewer • Updated Mar 17, 2025 • 483k • 2.68k • 39

KodCode/KodCode-V1

Viewer • Updated Mar 17, 2025 • 487k • 5.35k • 110

zhangchenxu

updated a collection about 1 year ago

KodCode-V1

Collection

KodCode-V1 is the largest fully-synthetic open-source dataset providing verifiable solutions and tests for coding tasks. • 5 items • Updated Mar 2 • 5

zhangchenxu

updated a dataset about 1 year ago

KodCode/KodCode-V1-SFT-4o

Viewer • Updated Mar 16, 2025 • 410k • 449 • 10

zhangchenxu

published a dataset about 1 year ago

KodCode/KodCode-V1-SFT-4o

Viewer • Updated Mar 16, 2025 • 410k • 449 • 10

zhangchenxu

in KodCode/KodCode-V1-SFT-R1 over 1 year ago

Add coding task category to KodCode dataset card

1

#2 opened over 1 year ago by

nielsr

zhangchenxu

in KodCode/KodCode-V1 over 1 year ago

Update license to CC BY-NC 4.0

1

#2 opened over 1 year ago by

nielsr

zhangchenxu

updated a Space over 1 year ago

README

🐱

zhangchenxu

published a Space over 1 year ago

README

🐱

yyqoni

updated a collection over 1 year ago

KodCode-V1

Collection

KodCode-V1 is the largest fully-synthetic open-source dataset providing verifiable solutions and tests for coding tasks. • 5 items • Updated Mar 2 • 5

zhangchenxu

authored 2 papers over 1 year ago

SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities

Paper • 2502.12025 • Published Feb 17, 2025 • 3

KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding

Paper • 2503.02951 • Published Mar 4, 2025 • 34

yyqoni

authored a paper over 1 year ago

KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding

Paper • 2503.02951 • Published Mar 4, 2025 • 34

AI & ML interests

Team members 4

KodCode's activity

The relationship between r1_correctness and r1_solution

Add coding task category to KodCode dataset card

Update license to CC BY-NC 4.0

README

README