Sergio Paniego's picture

Building on HF

Sergio Paniego PRO

sergiopaniego

huggingface

·

https://sergiopaniego.github.io/

AI & ML interests

None yet

Recent Activity

posted an update about 1 hour ago

how do you sync a trillion parameter model every RL step without a shared cluster? we just wrote a blog about it, led by @aminediroHF what I like the most is the way it proves you can use the Hub for basically everything 🧐 → trainer on one machine, vLLM in a HF Space, the wordle env in another HF Space and weights going through a Hub Bucket. no shared cluster, just HTTPS it works because ~99% of bf16 weights don't change between RL steps so you only sync the diff. 1.2 GB to 25 MB of payload per step https://huggingface.co/blog/delta-weight-sync

liked a Space about 1 hour ago

huggingface-projects/rf-detr-realtime-webcam

updated a dataset about 10 hours ago

agents-course/final-certificates

View all activity

Organizations

buckets 12

sergiopaniego/browsergym-grpo-functiongemma-270m-it-bucket

sergiopaniego/huggingface-static-c3b61b-bucket

sergiopaniego/huggingface-static-a08598-bucket

sergiopaniego/repo2rlenv-training-bucket

sergiopaniego/huggingface-static-1a5eab-bucket

sergiopaniego/reasoning-gym-chain-sum-Qwen3-1.7B-bucket

View 12 buckets

Posts 93

Post

6

how do you sync a trillion parameter model every RL step without a shared cluster? we just wrote a blog about it, led by @aminediroHF

what I like the most is the way it proves you can use the Hub for basically everything 🧐 → trainer on one machine, vLLM in a HF Space, the wordle env in another HF Space and weights going through a Hub Bucket. no shared cluster, just HTTPS

it works because ~99% of bf16 weights don't change between RL steps so you only sync the diff. 1.2 GB to 25 MB of payload per step

https://huggingface.co/blog/delta-weight-sync

Articles 19

Article

71

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

View all Articles

Collections 9

View 9 collections

spaces 139

VLM Object Understanding

Explore object detection, visual grounding, keypoint Detecti

Qwen2-VL-7B

Ask questions about charts in images

SmolVLM-trl-dpo-rlaif-v

Generate text from an image and question

SmolVLM-trl-sft-ChartQA

Ask questions about charts in images

Huggingface Static C3b61b

View and manage your tracking data in an interactive dashboard

Huggingface Static A08598

View project metrics on an interactive dashboard

View 139 Spaces

models 125

sergiopaniego/browsergym-grpo-functiongemma-270m-it

Text Generation • 0.3B • Updated 5 days ago • 32 • 2

sergiopaniego/qwen3-grpo-requests

Updated 15 days ago

sergiopaniego/reasoning-gym-chain-sum-Qwen3-1.7B-sft

Text Generation • 2B • Updated 30 days ago • 10

sergiopaniego/reasoning-gym-chain-sum-Qwen3-1.7B

Text Generation • 2B • Updated Apr 28 • 221 •

sergiopaniego/carla-vlm-gemma-test

sergiopaniego/carla-vlm-qwen35-test

sergiopaniego/carla-vlm-gemma

sergiopaniego/carla-vlm-qwen35

sergiopaniego/nemotron-3-sft

sergiopaniego/Qwen3-0.6B-carla-trolley-escape

0.8B • Updated Feb 26 • 7

View 125 models

datasets 9

sergiopaniego/requests-pr-diff

Viewer • Updated 15 days ago • 1 • 350

sergiopaniego/trl-r2e-test

Viewer • Updated 16 days ago • 1 • 68

sergiopaniego/chain-sum-rollouts

Viewer • Updated 30 days ago • 50 • 31

sergiopaniego/ttt-scripted-smoke

Viewer • Updated Apr 17 • 20 • 24

sergiopaniego/sample_videos

Viewer • Updated Jun 30, 2025 • 2 • 22

sergiopaniego/difficult_prompts

Viewer • Updated Jun 20, 2025 • 38 • 27

sergiopaniego/ourworldindata_example

Viewer • Updated Dec 2, 2024 • 13 • 36 • 1

sergiopaniego/faiss_embeddings

Updated Oct 3, 2024 • 24

sergiopaniego/CarlaFollowLanePreviousV

Viewer • Updated Sep 6, 2023 • 59.6k • 21