noeloco commited on
Commit
452a9ed
1 Parent(s): 9d84f61

End of training

Browse files
Files changed (2) hide show
  1. README.md +15 -16
  2. adapter_model.bin +2 -2
README.md CHANGED
@@ -30,8 +30,7 @@ load_in_4bit: true
30
  strict: false
31
 
32
  datasets:
33
- - path: /tmp/fizzbuzz-ft/datasets
34
- data_files: /tmp/fizzbuzz-ft/datasets/training-set-alpaca.json
35
  type: alpaca
36
  ds_type: json
37
 
@@ -46,7 +45,7 @@ sequence_len: 2048
46
  sample_packing: false
47
  pad_to_sequence_len: true
48
 
49
- adapter: qlora
50
  lora_model_dir:
51
  lora_r: 16
52
  lora_alpha: 8
@@ -102,7 +101,7 @@ special_tokens:
102
 
103
  This model is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf) on the None dataset.
104
  It achieves the following results on the evaluation set:
105
- - Loss: 0.0295
106
 
107
  ## Model description
108
 
@@ -134,18 +133,18 @@ The following hyperparameters were used during training:
134
 
135
  | Training Loss | Epoch | Step | Validation Loss |
136
  |:-------------:|:-----:|:----:|:---------------:|
137
- | 2.0177 | 0.01 | 1 | 2.5549 |
138
- | 0.603 | 0.26 | 18 | 0.8667 |
139
- | 0.3026 | 0.51 | 36 | 0.2340 |
140
- | 0.0977 | 0.77 | 54 | 0.1274 |
141
- | 0.1101 | 1.03 | 72 | 0.1098 |
142
- | 0.0503 | 1.29 | 90 | 0.0469 |
143
- | 0.0753 | 1.54 | 108 | 0.0516 |
144
- | 0.2285 | 1.8 | 126 | 0.0192 |
145
- | 0.0647 | 2.06 | 144 | 0.0386 |
146
- | 0.0494 | 2.31 | 162 | 0.0334 |
147
- | 0.0552 | 2.57 | 180 | 0.0293 |
148
- | 0.0888 | 2.83 | 198 | 0.0295 |
149
 
150
 
151
  ### Framework versions
 
30
  strict: false
31
 
32
  datasets:
33
+ - path: noeloco/fizzbuzz-sft
 
34
  type: alpaca
35
  ds_type: json
36
 
 
45
  sample_packing: false
46
  pad_to_sequence_len: true
47
 
48
+ adapter: lora
49
  lora_model_dir:
50
  lora_r: 16
51
  lora_alpha: 8
 
101
 
102
  This model is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf) on the None dataset.
103
  It achieves the following results on the evaluation set:
104
+ - Loss: 0.0210
105
 
106
  ## Model description
107
 
 
133
 
134
  | Training Loss | Epoch | Step | Validation Loss |
135
  |:-------------:|:-----:|:----:|:---------------:|
136
+ | 2.0829 | 0.01 | 1 | 2.5224 |
137
+ | 0.6045 | 0.26 | 18 | 0.8178 |
138
+ | 0.3357 | 0.51 | 36 | 0.2672 |
139
+ | 0.1057 | 0.77 | 54 | 0.1210 |
140
+ | 0.1046 | 1.03 | 72 | 0.0818 |
141
+ | 0.052 | 1.29 | 90 | 0.0458 |
142
+ | 0.0641 | 1.54 | 108 | 0.0363 |
143
+ | 0.1952 | 1.8 | 126 | 0.0213 |
144
+ | 0.0573 | 2.06 | 144 | 0.0362 |
145
+ | 0.0346 | 2.31 | 162 | 0.0284 |
146
+ | 0.0513 | 2.57 | 180 | 0.0221 |
147
+ | 0.0865 | 2.83 | 198 | 0.0210 |
148
 
149
 
150
  ### Framework versions
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4ca1b3f41f48bd83f6939570330d3b5133250530794bf42ad3cc23a91023705b
3
- size 80115914
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:85bb4b3e876da33f2bbc71ffbf9c5a7955242c2d50961739c57a68990b807677
3
+ size 160069834