Post
1388
Super nice intro to fine-tuning with TRL, just dropped by
@google
(runs free on Colab)!
They use SFT + QLoRA to fine-tune the tiny Gemma 3 270M model for emoji generation
Here’s what the fine-tuned model generates for the prompt: “I'm learning to tweet” → 🐦🗣💻
Colab: https://colab.research.google.com/github/google-gemini/gemma-cookbook/blob/main/Demos/Emoji-Gemma-on-Web/resources/Fine_tune_Gemma_3_270M_for_emoji_generation.ipynb
Try it out: google/emoji-gemma
Learn more: https://developers.googleblog.com/en/own-your-ai-fine-tune-gemma-3-270m-for-on-device/
They use SFT + QLoRA to fine-tune the tiny Gemma 3 270M model for emoji generation
Here’s what the fine-tuned model generates for the prompt: “I'm learning to tweet” → 🐦🗣💻
Colab: https://colab.research.google.com/github/google-gemini/gemma-cookbook/blob/main/Demos/Emoji-Gemma-on-Web/resources/Fine_tune_Gemma_3_270M_for_emoji_generation.ipynb
Try it out: google/emoji-gemma
Learn more: https://developers.googleblog.com/en/own-your-ai-fine-tune-gemma-3-270m-for-on-device/