The training script missing

by jampekka - opened May 30, 2024

Discussion

jampekka

May 30, 2024

•

edited May 30, 2024

The training script for v0.2 seems to be missing. In v0.1 does have train_unsloth_7b.py.

I'm trying to figure out what special tokenizations and instruction formats were used for the different datasources/tasks, but can't seem to find info about it.

Kiitos paljon, että saadaan suomenkielisiäkin malleja! 🔥🇫🇮

RASMUS

Finnish-NLP org Jun 3, 2024

We are soon releasing new 3b and 7b models.
I will later add the finetuning scripts once those are published and once I have prepared the finetuning tutorial notebooks.
Unsloth which I have mainly used on my own finetuning trials unfortunately does not work on Google Colab for our upcoming 3b model so I will need to do it with peft.
But probably I will also share a finetuning sample with Unsloth which works on newer GPUs.

RASMUS

Finnish-NLP org Jun 23, 2024

•

edited Jul 31, 2024

You can find finetuning example from here: https://huggingface.co/Finnish-NLP/Ahma-3B/blob/main/Finetune_Ahma_3B_example.ipynb
Along with a video: https://www.youtube.com/watch?v=6mbgn9XzpS4

RASMUS changed discussion status to closed Aug 2, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment