Sergio Paniego PRO
AI & ML interests
None yet
Recent Activity
posted an update about 1 hour ago
how do you sync a trillion parameter model every RL step without a shared cluster? we just wrote a blog about it, led by @aminediroHF
what I like the most is the way it proves you can use the Hub for basically everything 🧐 → trainer on one machine, vLLM in a HF Space, the wordle env in another HF Space and weights going through a Hub Bucket. no shared cluster, just HTTPS
it works because ~99% of bf16 weights don't change between RL steps so you only sync the diff. 1.2 GB to 25 MB of payload per step
https://huggingface.co/blog/delta-weight-sync liked a Space about 1 hour ago
huggingface-projects/rf-detr-realtime-webcam updated a dataset about 10 hours ago
agents-course/final-certificates