End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [ai-forever/rugpt3medium_based_on_gpt2](https://huggingface.co/ai-forever/rugpt3medium_based_on_gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 9.1801
 ## Model description
@@ -49,14 +49,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 10.9006       | 0.37  | 20   | 10.5654         |
-| 10.3052       | 0.74  | 40   | 9.9389          |
-| 9.8311        | 1.1   | 60   | 9.6573          |
-| 9.6329        | 1.47  | 80   | 9.5473          |
-| 9.5378        | 1.84  | 100  | 9.4772          |
-| 9.466         | 2.21  | 120  | 9.3990          |
-| 9.3906        | 2.58  | 140  | 9.3004          |
-| 9.2874        | 2.94  | 160  | 9.1801          |
 ### Framework versions

 This model is a fine-tuned version of [ai-forever/rugpt3medium_based_on_gpt2](https://huggingface.co/ai-forever/rugpt3medium_based_on_gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 9.1812
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 10.8819       | 0.37  | 20   | 10.5196         |
+| 10.2462       | 0.74  | 40   | 9.9291          |
+| 9.8184        | 1.1   | 60   | 9.6638          |
+| 9.6334        | 1.47  | 80   | 9.5486          |
+| 9.5381        | 1.84  | 100  | 9.4843          |
+| 9.4918        | 2.21  | 120  | 9.4058          |
+| 9.3774        | 2.58  | 140  | 9.2999          |
+| 9.2801        | 2.94  | 160  | 9.1812          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f7d62f94326ceb501dda7949b4a31538b520a7913ee9a0ccedbf653599caf34d
 size 1423517184

 version https://git-lfs.github.com/spec/v1
+oid sha256:c9e649979d379e2e76ad7b7e92228930eb05aff9df66a61d2721cb790157eda6
 size 1423517184

runs/Dec28_06-55-41_782f0ab6a697/events.out.tfevents.1703746547.782f0ab6a697.686.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:4791c3d1d9ea7d68e3b588cc6e8196f38330299b935e94c81433f87d824824d3
+size 8283

tokenizer.json CHANGED Viewed

@@ -4,18 +4,9 @@
     "direction": "Left",
     "max_length": 64,
     "strategy": "LongestFirst",
-    "stride": 10
-  },
-  "padding": {
-    "strategy": {
-      "Fixed": 64
-    },
-    "direction": "Left",
-    "pad_to_multiple_of": null,
-    "pad_id": 0,
-    "pad_type_id": 0,
-    "pad_token": "<pad>"
   },
   "added_tokens": [
     {
       "id": 0,

     "direction": "Left",
     "max_length": 64,
     "strategy": "LongestFirst",
+    "stride": 0
   },
+  "padding": null,
   "added_tokens": [
     {
       "id": 0,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4468c13f6af905f827a766b673966a99fc6c77c374251ba294b7bf5c8a0ec57c
 size 4600

 version https://git-lfs.github.com/spec/v1
+oid sha256:e93caf4f85bdf85b714651823fd49fea51d3b89d78a8f61830be568337a11e5d
 size 4600