Karzan
/

walamakan-t5-base

@@ -1,9 +1,7 @@
 ---
-base_model: Karzan/ckb-t5-base
 tags:
 - generated_from_trainer
-metrics:
-- bleu
 model-index:
 - name: walamakan-t5-base
   results: []
@@ -14,11 +12,7 @@ should probably proofread and complete it, then remove this comment. -->
 # walamakan-t5-base
-This model is a fine-tuned version of [Karzan/ckb-t5-base](https://huggingface.co/Karzan/ckb-t5-base) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 3.5949
-- Bleu: 0.0
-- Gen Len: 19.0
 ## Model description
@@ -45,32 +39,13 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 20
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:----:|:-------:|
-| No log        | 1.0   | 87   | 3.8199          | 0.0  | 19.0    |
-| No log        | 1.99  | 174  | 3.7846          | 0.0  | 19.0    |
-| No log        | 2.99  | 261  | 3.7542          | 0.0  | 19.0    |
-| No log        | 4.0   | 349  | 3.7276          | 0.0  | 19.0    |
-| No log        | 5.0   | 436  | 3.7072          | 0.0  | 19.0    |
-| 3.789         | 5.99  | 523  | 3.6894          | 0.0  | 19.0    |
-| 3.789         | 6.99  | 610  | 3.6707          | 0.0  | 19.0    |
-| 3.789         | 8.0   | 698  | 3.6597          | 0.0  | 19.0    |
-| 3.789         | 9.0   | 785  | 3.6480          | 0.0  | 19.0    |
-| 3.789         | 9.99  | 872  | 3.6374          | 0.0  | 19.0    |
-| 3.789         | 10.99 | 959  | 3.6291          | 0.0  | 19.0    |
-| 3.6041        | 12.0  | 1047 | 3.6200          | 0.0  | 19.0    |
-| 3.6041        | 13.0  | 1134 | 3.6133          | 0.0  | 19.0    |
-| 3.6041        | 13.99 | 1221 | 3.6092          | 0.0  | 19.0    |
-| 3.6041        | 14.99 | 1308 | 3.6037          | 0.0  | 19.0    |
-| 3.6041        | 16.0  | 1396 | 3.6002          | 0.0  | 19.0    |
-| 3.6041        | 17.0  | 1483 | 3.5988          | 0.0  | 19.0    |
-| 3.5111        | 17.99 | 1570 | 3.5962          | 0.0  | 19.0    |
-| 3.5111        | 18.99 | 1657 | 3.5952          | 0.0  | 19.0    |
-| 3.5111        | 19.94 | 1740 | 3.5949          | 0.0  | 19.0    |
 ### Framework versions

 ---
+base_model: Karzan/walamakan-t5-base
 tags:
 - generated_from_trainer
 model-index:
 - name: walamakan-t5-base
   results: []
 # walamakan-t5-base
+This model is a fine-tuned version of [Karzan/walamakan-t5-base](https://huggingface.co/Karzan/walamakan-t5-base) on an unknown dataset.
 ## Model description
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:----:|:-------:|
+| No log        | 1.0   | 87   | 0.8475          | 0.0  | 19.0    |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "Karzan/ckb-t5-base",
   "architectures": [
     "T5ForConditionalGeneration"
   ],

 {
+  "_name_or_path": "Karzan/walamakan-t5-base",
   "architectures": [
     "T5ForConditionalGeneration"
   ],

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8a14db0ab211869e72550d7c7a69ba8ad0bf95b5a0b7d21fd8667a9b1adb79ed
 size 990236853

 version https://git-lfs.github.com/spec/v1
+oid sha256:e86c33bcf4ccf7f51114547dfa7f0a6d4a9b3d9de946ead8601a0e10bf23a0e7
 size 990236853

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3fa8f85dc90dd4d5345aab91a93f1940c5d184a32929dabc18d9f22fb9f6c5e0
 size 4155

 version https://git-lfs.github.com/spec/v1
+oid sha256:2345cf4362ed8ef200a0a9b9ae41652f8270b0a6215eac1e55feebfb022c9005
 size 4155