Mayypeeya
/

mt5_thaisum_model

@@ -1,5 +1,6 @@
 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
 datasets:
@@ -21,7 +22,7 @@ model-index:
     metrics:
     - name: Rouge1
       type: rouge
-      value: 0.1432
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -31,12 +32,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on the thaisum dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3540
-- Rouge1: 0.1432
-- Rouge2: 0.041
-- Rougel: 0.1423
-- Rougelsum: 0.142
-- Gen Len: 18.933
 ## Model description
@@ -55,9 +56,9 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -67,15 +68,15 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
-| 2.4271        | 1.0   | 2500  | 0.3976          | 0.1306 | 0.0372 | 0.1302 | 0.1299    | 18.9665 |
-| 2.1992        | 2.0   | 5000  | 0.3720          | 0.1392 | 0.0376 | 0.1382 | 0.1384    | 18.922  |
-| 2.1687        | 3.0   | 7500  | 0.3599          | 0.1401 | 0.0391 | 0.1394 | 0.1389    | 18.9215 |
-| 2.1096        | 4.0   | 10000 | 0.3540          | 0.1432 | 0.041  | 0.1423 | 0.142     | 18.933  |
 ### Framework versions
-- Transformers 4.30.2
 - Pytorch 2.0.1+cu118
 - Datasets 2.13.1
 - Tokenizers 0.13.3

 ---
 license: apache-2.0
+base_model: google/mt5-base
 tags:
 - generated_from_trainer
 datasets:
     metrics:
     - name: Rouge1
       type: rouge
+      value: 0.2017
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on the thaisum dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3039
+- Rouge1: 0.2017
+- Rouge2: 0.0806
+- Rougel: 0.2016
+- Rougelsum: 0.2017
+- Gen Len: 18.9995
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0002
+- train_batch_size: 2
+- eval_batch_size: 2
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step  | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| 2.0742        | 1.0   | 5000  | 0.3272          | 0.1713 | 0.055  | 0.1703 | 0.1716    | 18.9945 |
+| 1.7874        | 2.0   | 10000 | 0.3073          | 0.194  | 0.0742 | 0.1942 | 0.194     | 18.997  |
+| 1.6341        | 3.0   | 15000 | 0.3035          | 0.2002 | 0.0804 | 0.1999 | 0.2002    | 19.0    |
+| 1.4501        | 4.0   | 20000 | 0.3039          | 0.2017 | 0.0806 | 0.2016 | 0.2017    | 18.9995 |
 ### Framework versions
+- Transformers 4.31.0
 - Pytorch 2.0.1+cu118
 - Datasets 2.13.1
 - Tokenizers 0.13.3