Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a Space about 10 hours ago

lewtun/ml-intern-fable5sft

published a Space about 10 hours ago

lewtun/ml-intern-fable5sft

updated a Space about 12 hours ago

lewtun/ml-intern-wordle

View all activity

Organizations

buckets 47

lewtun/ml-intern-wordle-smoke-bucket-5

lewtun/ml-intern-wordle-smoke-bucket-4

lewtun/ml-intern-wordle-smoke-bucket-3

lewtun/ml-intern-wordle-smoke-bucket-2

lewtun/qwen3-4b-capybara-static-e6d3fd-bucket

lewtun/qwen3-4b-capybara-static-636f68-bucket

View 47 buckets

Posts 8

Post

5073

Introducing OlympicCoder: a series of open reasoning models that can solve olympiad-level programming problems 🧑‍💻

- 7B open-r1/OlympicCoder-7B
- 32B open-r1/OlympicCoder-32B

We find that OlympicCoder models outperform Claude 3.7 Sonnet, as well as others over 100x larger 💪

Together with the models, we are releasing:

📊CodeForces-CoTs: new dataset of code problems from the most popular competitive coding platform, with R1 traces in C++ and Python open-r1/codeforces-cots

🏆 IOI'2024: a new benchmark of VERY hard programming problems where even frontier models struggle to match human performance open-r1/ioi

For links to the models and datasets, check out our latest progress report from Open R1: https://huggingface.co/blog/open-r1/update-3

Articles 41

Article

88

The Open Source Community is backing OpenEnv for Agentic RL

View all Articles

Collections 6

View 6 collections

Papers 11

arxiv:2504.11354

arxiv:2504.05299

arxiv:2503.07572

arxiv:2502.02737

spaces 112

Trackio Dashboard

Display an interactive tracking dashboard

Trackio Dashboard

Display interactive tracking dashboard

Ml Intern Wordle Smoke

View your experiment tracking dashboard

Qwen3 4b Capybara Static E6d3fd

View and manage your tracked data with an interactive dashboard

Qwen3 4b Capybara Static 636f68

View and monitor your data in an interactive dashboard

Sft Qwen3 4b Capybara Static 746d14

Monitor and visualize project data in real time

View 112 Spaces

models 324

lewtun/qwen3-0.6b-wordle-grpo

Text Generation • 0.6B • Updated about 12 hours ago

lewtun/Qwen3-4B-Capybara-SFT

Text Generation • 4B • Updated about 13 hours ago

lewtun/qwen3-4b-capybara

Text Generation • 4B • Updated about 14 hours ago

lewtun/qwen3-0.6b-capybara-smoke

Text Generation • 0.6B • Updated 10 days ago • 61

lewtun/qwen3-0.6b-capybara

Text Generation • 0.6B • Updated 11 days ago • 49

lewtun/qwen3-0.6b-capybara-1step

Text Generation • 0.6B • Updated 12 days ago • 46

lewtun/qwen3-0.6b-angrygiraffe-sft

Text Generation • 0.6B • Updated 15 days ago • 62

lewtun/qwen3-4b-hermes-tooluse

Text Generation • 4B • Updated 15 days ago • 5

lewtun/qwen3-0.6b-sft-capybara

Text Generation • 0.6B • Updated May 12 • 125

lewtun/smollm2-1.7b-capybara-sft

View 324 models

datasets 96

lewtun/ml-intern-sessions

Updated about 5 hours ago • 4.02k • 3

lewtun/capybara-25-20260507

Viewer • Updated May 7 • 25 • 20

lewtun/capybara-25-20260506

Viewer • Updated May 6 • 25 • 13

lewtun/capybara-25

Viewer • Updated May 6 • 25 • 21

lewtun/capybara-100-2026-05-05

Viewer • Updated May 5 • 100 • 14

lewtun/capybara-100-test-2026-05-05

Updated May 5 • 8

lewtun/openthoughts-100

Updated May 5 • 33

lewtun/Capybara-100

Viewer • Updated May 5 • 100 • 28

lewtun/running-dashboard-data

Viewer • Updated May 3 • 16 • 5

lewtun/dolci-think-sft-6400

Viewer • Updated Mar 11 • 6.4k • 20

View 96 datasets