DeepDream2045 commited on
Commit
32754b8
1 Parent(s): cd990b3

End of training

Browse files
Files changed (2) hide show
  1. README.md +4 -4
  2. adapter_model.bin +1 -1
README.md CHANGED
@@ -101,7 +101,7 @@ xformers_attention: true
101
 
102
  This model is a fine-tuned version of [NousResearch/Yarn-Llama-2-13b-128k](https://huggingface.co/NousResearch/Yarn-Llama-2-13b-128k) on the None dataset.
103
  It achieves the following results on the evaluation set:
104
- - Loss: 0.3135
105
 
106
  ## Model description
107
 
@@ -138,9 +138,9 @@ The following hyperparameters were used during training:
138
 
139
  | Training Loss | Epoch | Step | Validation Loss |
140
  |:-------------:|:------:|:----:|:---------------:|
141
- | 186.7866 | 0.0044 | 1 | 11.8610 |
142
- | 5.7838 | 0.1107 | 25 | 0.4496 |
143
- | 3.1469 | 0.2215 | 50 | 0.3135 |
144
 
145
 
146
  ### Framework versions
 
101
 
102
  This model is a fine-tuned version of [NousResearch/Yarn-Llama-2-13b-128k](https://huggingface.co/NousResearch/Yarn-Llama-2-13b-128k) on the None dataset.
103
  It achieves the following results on the evaluation set:
104
+ - Loss: 0.3082
105
 
106
  ## Model description
107
 
 
138
 
139
  | Training Loss | Epoch | Step | Validation Loss |
140
  |:-------------:|:------:|:----:|:---------------:|
141
+ | 186.8504 | 0.0044 | 1 | 11.8650 |
142
+ | 4.7856 | 0.1107 | 25 | 0.4165 |
143
+ | 3.5723 | 0.2215 | 50 | 0.3082 |
144
 
145
 
146
  ### Framework versions
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3dfabc06bbbd50429a69f95ac6fb1104b4a615639ab71a034b39b0132d5c407a
3
  size 500897546
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a5898d8fa2f9742dbe58c9b59258a6eb3efbe5d35c0df14e4e40f7b17c9a8d46
3
  size 500897546