DeepDream2045
/

ed378998-4fc4-45f3-86eb-0ba0fd17d6e3

Generated from Trainer

Model card Files Files and versions Community

DeepDream2045 commited on Dec 15, 2024

Commit

2809e6d

·

verified ·

1 Parent(s): 5e1ad32

End of training

Files changed (2) hide show

README.md +3 -3
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -105,7 +105,7 @@ xformers_attention: true
 This model is a fine-tuned version of [NousResearch/Yarn-Solar-10b-32k](https://huggingface.co/NousResearch/Yarn-Solar-10b-32k) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6344
 ## Model description
@@ -143,8 +143,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 27.5824       | 0.0007 | 1    | 2.2696          |
-| 29.8551       | 0.0166 | 25   | 1.6893          |
-| 28.7613       | 0.0331 | 50   | 1.6344          |
 ### Framework versions

 This model is a fine-tuned version of [NousResearch/Yarn-Solar-10b-32k](https://huggingface.co/NousResearch/Yarn-Solar-10b-32k) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.6362
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 27.5824       | 0.0007 | 1    | 2.2696          |
+| 29.8534       | 0.0166 | 25   | 1.6895          |
+| 28.7963       | 0.0331 | 50   | 1.6362          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8ade4c8816db825cbd1030934c5933e04998f69d412d1713fe82d4ff7f7a3d16
 size 503559370

 version https://git-lfs.github.com/spec/v1
+oid sha256:a6361d274863de68b719d1266822facfb84a479065b525f2b92bb530a1c83567
 size 503559370