DeepDream2045
/

9a6f6d59-94a2-4723-9a5e-c0446b97f321

Generated from Trainer

Model card Files Files and versions Community

DeepDream2045 commited on Dec 13, 2024

Commit

32754b8

·

verified ·

1 Parent(s): cd990b3

End of training

Files changed (2) hide show

README.md +4 -4
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -101,7 +101,7 @@ xformers_attention: true
 This model is a fine-tuned version of [NousResearch/Yarn-Llama-2-13b-128k](https://huggingface.co/NousResearch/Yarn-Llama-2-13b-128k) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3135
 ## Model description
@@ -138,9 +138,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 186.7866      | 0.0044 | 1    | 11.8610         |
-| 5.7838        | 0.1107 | 25   | 0.4496          |
-| 3.1469        | 0.2215 | 50   | 0.3135          |
 ### Framework versions

 This model is a fine-tuned version of [NousResearch/Yarn-Llama-2-13b-128k](https://huggingface.co/NousResearch/Yarn-Llama-2-13b-128k) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3082
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 186.8504      | 0.0044 | 1    | 11.8650         |
+| 4.7856        | 0.1107 | 25   | 0.4165          |
+| 3.5723        | 0.2215 | 50   | 0.3082          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3dfabc06bbbd50429a69f95ac6fb1104b4a615639ab71a034b39b0132d5c407a
 size 500897546

 version https://git-lfs.github.com/spec/v1
+oid sha256:a5898d8fa2f9742dbe58c9b59258a6eb3efbe5d35c0df14e4e40f7b17c9a8d46
 size 500897546