bbytxt
/

5eb46825-b54e-4c8a-bb73-60acbff95428

Generated from Trainer

Model card Files Files and versions Community

bbytxt commited on Dec 17, 2024

Commit

d9e2776

·

verified ·

1 Parent(s): 92c5895

End of training

Files changed (2) hide show

README.md +3 -3
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -109,7 +109,7 @@ xformers_attention: null
 This model is a fine-tuned version of [NousResearch/Yarn-Mistral-7b-128k](https://huggingface.co/NousResearch/Yarn-Mistral-7b-128k) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0633
 ## Model description
@@ -144,8 +144,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 25.075        | 0.0031 | 1    | 1.5490          |
-| 17.9268       | 0.0773 | 25   | 1.0833          |
-| 19.1859       | 0.1546 | 50   | 1.0633          |
 ### Framework versions

 This model is a fine-tuned version of [NousResearch/Yarn-Mistral-7b-128k](https://huggingface.co/NousResearch/Yarn-Mistral-7b-128k) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0614
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 25.075        | 0.0031 | 1    | 1.5490          |
+| 17.915        | 0.0773 | 25   | 1.0824          |
+| 19.1739       | 0.1546 | 50   | 1.0614          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3580985890e5c3207210c7739271cb82a895a44f217f08ebcdfea4f01d4bb7a3
 size 335706186

 version https://git-lfs.github.com/spec/v1
+oid sha256:bc9c7e5dfb1216746b4fbd76a34339afd2d7e01034a4a2690666b25c7f83ac91
 size 335706186