Post
1363
Super nice intro to fine-tuning with TRL, just dropped by
@google
(runs free on Colab)!
They use SFT + QLoRA to fine-tune the tiny Gemma 3 270M model for emoji generation
Hereβs what the fine-tuned model generates for the prompt: βI'm learning to tweetβ β π¦π£π»
Colab: https://colab.research.google.com/github/google-gemini/gemma-cookbook/blob/main/Demos/Emoji-Gemma-on-Web/resources/Fine_tune_Gemma_3_270M_for_emoji_generation.ipynb
Try it out: google/emoji-gemma
Learn more: https://developers.googleblog.com/en/own-your-ai-fine-tune-gemma-3-270m-for-on-device/
They use SFT + QLoRA to fine-tune the tiny Gemma 3 270M model for emoji generation
Hereβs what the fine-tuned model generates for the prompt: βI'm learning to tweetβ β π¦π£π»
Colab: https://colab.research.google.com/github/google-gemini/gemma-cookbook/blob/main/Demos/Emoji-Gemma-on-Web/resources/Fine_tune_Gemma_3_270M_for_emoji_generation.ipynb
Try it out: google/emoji-gemma
Learn more: https://developers.googleblog.com/en/own-your-ai-fine-tune-gemma-3-270m-for-on-device/