gradjitta commited on
Commit
a2790a5
1 Parent(s): 6a7106d

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -0
README.md ADDED
@@ -0,0 +1,19 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - datacrunch/freformatted
4
+ ---
5
+
6
+ ## Whats this merge about
7
+
8
+ - Its a 500 step checkpoint of the following run
9
+ ```
10
+ python ./trl/examples/scripts/sft_trainer.py --model_name meta-llama/Llama-2-7b-hf --dataset_name datacrunch/finnish_alpaca --load_in_4bit --use_peft --batch_size 4 --gradient_accumulation_steps 2
11
+ ```
12
+ - Using the repo https://github.com/lvwerra/trl/blob/main/examples/scripts/sft_trainer.py
13
+
14
+ I am still figuring out an efficient way of doing this, in the meantime you can test it
15
+
16
+ - An example prompt you can try, that should return the Finnish response you need
17
+ ```
18
+ "Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request. ### Instruction: Anna kolme vinkkiä terveenä pysymiseen. ###Response:"
19
+ ```