ME's picture

12 11

ME

meigel

·

AI & ML interests

None yet

Recent Activity

liked a model about 18 hours ago

Goedel-LM/Goedel-Prover-SFT

upvoted an article 2 days ago

Open-R1: Update #1

liked a model 6 days ago

deepseek-ai/DeepSeek-R1

View all activity

Organizations

None yet

meigel's activity

liked a model about 18 hours ago

Goedel-LM/Goedel-Prover-SFT

Updated 2 days ago • 52 • 5

upvoted an article 2 days ago

Article

Open-R1: Update #1

By

•

3 days ago

• 204

liked a model 6 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 3 days ago • 1.04M • • 6.46k

upvoted a paper 7 days ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published 26 days ago • 87

upvoted 2 collections 8 days ago

📐 FineMath

FineMath datasets and ablation models • 14 items • Updated 29 days ago • 19

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Dec 22, 2024 • 213

liked a Space 8 days ago

Anthropic Citations With Gradio Metadata Key

anthropic's citation aPI with gradio chatbot and tool use

liked a model 8 days ago

unsloth/DeepSeek-R1-GGUF

Text Generation • Updated 5 days ago • 301k • 515

liked a Space 9 days ago

Running on CPU Upgrade

AI Comic Factory

Create your own AI comic with a single prompt

upvoted 2 collections 10 days ago

FuseO1-Preview

System-II Reasoning Fusion of LLMs • 10 items • Updated 4 days ago • 16

DeepSeek-R1

8 items • Updated 15 days ago • 366

updated a collection 10 days ago

LLM

16 items • Updated 10 days ago

upvoted a paper 10 days ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Paper • 2501.10799 • Published 17 days ago • 14

liked a Space 10 days ago

Running on Zero

Hunyuan3D-2.0

Text-to-3D and Image-to-3D Generation

upvoted a collection 12 days ago

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated about 6 hours ago • 139

upvoted a paper 13 days ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 27 days ago • 90

updated a collection 13 days ago

LLM

16 items • Updated 10 days ago