End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.1411
 ## Model description
@@ -45,7 +45,7 @@ The following hyperparameters were used during training:
 - total_eval_batch_size: 4
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- training_steps: 2350
 ### Training results
@@ -143,8 +143,18 @@ The following hyperparameters were used during training:
 | 2.1867        | 0.36  | 2250 | 2.4132          |
 | 2.3178        | 0.36  | 2275 | 2.4120          |
 | 2.2948        | 0.37  | 2300 | 2.4071          |
-| 2.1932        | 0.37  | 2325 | 2.4067          |
-| 2.2373        | 0.38  | 2350 | 2.4121          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.1373
 ## Model description
 - total_eval_batch_size: 4
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- training_steps: 2600
 ### Training results
 | 2.1867        | 0.36  | 2250 | 2.4132          |
 | 2.3178        | 0.36  | 2275 | 2.4120          |
 | 2.2948        | 0.37  | 2300 | 2.4071          |
+| 2.1954        | 0.37  | 2325 | 2.4108          |
+| 2.2368        | 0.38  | 2350 | 2.4105          |
+| 2.2714        | 0.38  | 2375 | 2.4060          |
+| 2.2808        | 0.38  | 2400 | 2.4097          |
+| 2.1327        | 0.39  | 2425 | 2.4075          |
+| 2.1245        | 0.39  | 2450 | 2.4101          |
+| 2.2168        | 0.4   | 2475 | 2.4119          |
+| 2.2988        | 0.4   | 2500 | 2.4106          |
+| 2.3049        | 0.4   | 2525 | 2.4084          |
+| 2.159         | 0.41  | 2550 | 2.4103          |
+| 2.183         | 0.41  | 2575 | 2.4105          |
+| 2.2598        | 0.42  | 2600 | 2.4103          |
 ### Framework versions

model-00001-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a113d66a4be05676ae341bae839ddfd9e9dd52d76e92dda521ec3850bbbeaf4c
 size 4943162336

 version https://git-lfs.github.com/spec/v1
+oid sha256:10b554202030174ea6904d9f05952cb2bfb8dd8748c79b0987e2e0fba2b89f19
 size 4943162336

model-00002-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d8904796973453fcfdba28b83a7a0f1352592155f1595a0e7034652b73eaf3e9
 size 4999819336

 version https://git-lfs.github.com/spec/v1
+oid sha256:f3be60e38ab8a979a51b987a3ab71010e7553a8b1f3ca94d3e6a3e71f6556c23
 size 4999819336

model-00003-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c927970d1fa489791a695730e35f76d3f34391e921aa33b212ea1ef0309b63bc
 size 4540516344

 version https://git-lfs.github.com/spec/v1
+oid sha256:1463b312b5ded566aa2239c726eb6f4e08b82cc705e7854c17593e9d5faf28c2
 size 4540516344