isrouush
·
AI & ML interests
Data Science, ML
Organizations
-
-
-
-
-
-
-
-
-
-
-
view article
Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies
view article
Open-R1: a fully open reproduction of DeepSeek-R1
upvoted
a
paper
12 months ago