End of training

Browse files

Files changed (9) hide show

README.md +22 -22
final_checkpoint/model-00001-of-00003.safetensors +1 -1
final_checkpoint/model-00002-of-00003.safetensors +1 -1
final_checkpoint/model-00003-of-00003.safetensors +1 -1
model-00001-of-00003.safetensors +1 -1
model-00002-of-00003.safetensors +1 -1
model-00003-of-00003.safetensors +1 -1
tokenizer.json +1 -6
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1550
 ## Model description
@@ -37,7 +37,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1e-06
 - train_batch_size: 2
 - eval_batch_size: 1
 - seed: 42
@@ -52,26 +52,26 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 0.349         | 0.2667 | 50   | 0.3425          |
-| 0.2797        | 0.5333 | 100  | 0.2671          |
-| 0.1882        | 0.8    | 150  | 0.1831          |
-| 0.1593        | 1.0667 | 200  | 0.1748          |
-| 0.1609        | 1.3333 | 250  | 0.1586          |
-| 0.1544        | 1.6    | 300  | 0.1575          |
-| 0.1558        | 1.8667 | 350  | 0.1564          |
-| 0.1524        | 2.1333 | 400  | 0.1574          |
-| 0.1527        | 2.4    | 450  | 0.1568          |
-| 0.1564        | 2.6667 | 500  | 0.1554          |
-| 0.1523        | 2.9333 | 550  | 0.1563          |
-| 0.1519        | 3.2    | 600  | 0.1558          |
-| 0.1532        | 3.4667 | 650  | 0.1555          |
-| 0.1508        | 3.7333 | 700  | 0.1549          |
-| 0.1518        | 4.0    | 750  | 0.1550          |
-| 0.1507        | 4.2667 | 800  | 0.1551          |
-| 0.1501        | 4.5333 | 850  | 0.1550          |
-| 0.1477        | 4.8    | 900  | 0.1549          |
-| 0.1464        | 5.0667 | 950  | 0.1550          |
-| 0.1488        | 5.3333 | 1000 | 0.1550          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3111
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-07
 - train_batch_size: 2
 - eval_batch_size: 1
 - seed: 42
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.9785        | 0.2667 | 50   | 1.8984          |
+| 0.815         | 0.5333 | 100  | 0.6662          |
+| 0.4371        | 0.8    | 150  | 0.4306          |
+| 0.3721        | 1.0667 | 200  | 0.3807          |
+| 0.3439        | 1.3333 | 250  | 0.3367          |
+| 0.3251        | 1.6    | 300  | 0.3266          |
+| 0.3215        | 1.8667 | 350  | 0.3233          |
+| 0.3156        | 2.1333 | 400  | 0.3205          |
+| 0.3124        | 2.4    | 450  | 0.3183          |
+| 0.3165        | 2.6667 | 500  | 0.3161          |
+| 0.3128        | 2.9333 | 550  | 0.3130          |
+| 0.3093        | 3.2    | 600  | 0.3120          |
+| 0.311         | 3.4667 | 650  | 0.3109          |
+| 0.3073        | 3.7333 | 700  | 0.3112          |
+| 0.306         | 4.0    | 750  | 0.3115          |
+| 0.307         | 4.2667 | 800  | 0.3112          |
+| 0.3052        | 4.5333 | 850  | 0.3111          |
+| 0.3048        | 4.8    | 900  | 0.3105          |
+| 0.3034        | 5.0667 | 950  | 0.3111          |
+| 0.3057        | 5.3333 | 1000 | 0.3111          |
 ### Framework versions

final_checkpoint/model-00001-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5f0bd44b0aec034352c8f82114fed8acaae95d3058bfaf443a30be0eb4ca0ffe
 size 4943162240

 version https://git-lfs.github.com/spec/v1
+oid sha256:73cb93187755724d5577a051be4e5fef74b778d7afdbe51f40c8b679ed3fa99b
 size 4943162240

final_checkpoint/model-00002-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5168598622d24b96342b1732a9a8f0c3906f8abf0ded1b9c8fec445a1ae5e563
 size 4999819232

 version https://git-lfs.github.com/spec/v1
+oid sha256:5c10f2f5904b01fc07c410cf56f4819a5c3c36d6f87523459cf506653fdc7d41
 size 4999819232

final_checkpoint/model-00003-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0597ec2e730048ec58afbdb17653d22f33989d255e8fa8e1c8f6071f49026128
 size 4540516256

 version https://git-lfs.github.com/spec/v1
+oid sha256:60a9b3c738d1e2d459799366526aae561a83de324fac79ca05058a8c5932c654
 size 4540516256

model-00001-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5f0bd44b0aec034352c8f82114fed8acaae95d3058bfaf443a30be0eb4ca0ffe
 size 4943162240

 version https://git-lfs.github.com/spec/v1
+oid sha256:73cb93187755724d5577a051be4e5fef74b778d7afdbe51f40c8b679ed3fa99b
 size 4943162240

model-00002-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5168598622d24b96342b1732a9a8f0c3906f8abf0ded1b9c8fec445a1ae5e563
 size 4999819232

 version https://git-lfs.github.com/spec/v1
+oid sha256:5c10f2f5904b01fc07c410cf56f4819a5c3c36d6f87523459cf506653fdc7d41
 size 4999819232

model-00003-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0597ec2e730048ec58afbdb17653d22f33989d255e8fa8e1c8f6071f49026128
 size 4540516256

 version https://git-lfs.github.com/spec/v1
+oid sha256:60a9b3c738d1e2d459799366526aae561a83de324fac79ca05058a8c5932c654
 size 4540516256

tokenizer.json CHANGED Viewed

@@ -1,11 +1,6 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 50,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": null,
   "padding": null,
   "added_tokens": [
     {

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3a0c76bc3fcd9269cf01f28b413e67acd85fed7194692ddbee11008ef00db1b3
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:bfec2f3deb1c823a48fc54f00cc972a71911c44148d2bd29a8372728e26edf5b
 size 5176