Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,19 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
datasets:
|
3 |
+
- datacrunch/freformatted
|
4 |
+
---
|
5 |
+
|
6 |
+
## Whats this merge about
|
7 |
+
|
8 |
+
- Its a 500 step checkpoint of the following run
|
9 |
+
```
|
10 |
+
python ./trl/examples/scripts/sft_trainer.py --model_name meta-llama/Llama-2-7b-hf --dataset_name datacrunch/finnish_alpaca --load_in_4bit --use_peft --batch_size 4 --gradient_accumulation_steps 2
|
11 |
+
```
|
12 |
+
- Using the repo https://github.com/lvwerra/trl/blob/main/examples/scripts/sft_trainer.py
|
13 |
+
|
14 |
+
I am still figuring out an efficient way of doing this, in the meantime you can test it
|
15 |
+
|
16 |
+
- An example prompt you can try, that should return the Finnish response you need
|
17 |
+
```
|
18 |
+
"Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request. ### Instruction: Anna kolme vinkkiä terveenä pysymiseen. ###Response:"
|
19 |
+
```
|