Ivelin Ivanov PRO

ivelin

AI & ML interests

computer vision, vision-language models, multi modal transformers

Recent Activity

Organizations

GuardianUI's profile picture zk0.bot's profile picture

ivelin's activity

upvoted an article 3 days ago
view article
Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

76
upvoted an article 10 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

660
liked a Space about 2 months ago
reacted to merve's post with 😎 2 months ago
view post
Post
2673
small but mighty 🔥
you can fine-tune SmolVLM on an L4 with batch size of 4 and it will only take 16.4 GB VRAM 🫰🏻 also with gradient accumulation simulated batch size is 16 ✨
I made a notebook that includes all the goodies: QLoRA, gradient accumulation, gradient checkpointing with explanations on how they work 💝 https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb
upvoted an article 5 months ago
view article
Article

XetHub is joining Hugging Face!

81
updated a Space almost 2 years ago