1 14 15

ME

meigel

AI & ML interests

None yet

Recent Activity

upvoted a collection 21 minutes ago

DeepSeek-Prover

new activity 1 day ago

open-r1/README:[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO

liked a Space 1 day ago

open-r1/README

View all activity

Organizations

None yet

meigel's activity

upvoted a collection 21 minutes ago

DeepSeek-Prover

Collection

DeepSeek-V1-and-V1.5-Series • 7 items • Updated Aug 16, 2024 • 25

upvoted an article 4 days ago

Article

Open-source DeepResearch – Freeing our search agents

6 days ago

• 829

upvoted an article 7 days ago

Article

Open-R1: Update #1

and 7 others •

8 days ago

• 259

upvoted a paper 12 days ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 87

upvoted 2 collections 13 days ago

📐 FineMath

Collection

FineMath datasets and ablation models • 14 items • Updated Jan 6 • 19

🪐 SmolLM

Collection

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Dec 22, 2024 • 214

upvoted 2 collections 15 days ago

FuseO1-Preview

Collection

System-II Reasoning Fusion of LLMs • 10 items • Updated 9 days ago • 17

DeepSeek-R1

Collection

8 items • Updated 20 days ago • 449

upvoted a paper 15 days ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Paper • 2501.10799 • Published 22 days ago • 14

upvoted a collection 18 days ago

DeepSeek R1 (All Versions)

Collection

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 1 day ago • 162

upvoted 4 papers 18 days ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 89

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 92

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 254

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published 21 days ago • 31