ubaada
/

long-t5-tglobal-base

@@ -17,10 +17,10 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/long-t5-tglobal-base](https://huggingface.co/google/long-t5-tglobal-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.9565
-- Rouge1: 0.1423
-- Rouge2: 0.0178
-- Rougel: 0.0928
 ## Model description
@@ -39,7 +39,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 8e-05
 - train_batch_size: 8
 - eval_batch_size: 1
 - seed: 42
@@ -50,7 +50,7 @@ The following hyperparameters were used during training:
 - total_eval_batch_size: 2
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
@@ -59,6 +59,12 @@ The following hyperparameters were used during training:
 | 1.5731        | 0.9996 | 600  | 1.9730          | 0.1342 | 0.0151 | 0.0912 |
 | 1.3694        | 1.9996 | 1200 | 1.9623          | 0.1371 | 0.0175 | 0.0909 |
 | 1.9561        | 2.9992 | 1800 | 1.9565          | 0.1423 | 0.0178 | 0.0928 |
 ### Framework versions

 This model is a fine-tuned version of [google/long-t5-tglobal-base](https://huggingface.co/google/long-t5-tglobal-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.9401
+- Rouge1: 0.1934
+- Rouge2: 0.0269
+- Rougel: 0.1151
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 4e-05
 - train_batch_size: 8
 - eval_batch_size: 1
 - seed: 42
 - total_eval_batch_size: 2
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 9
 ### Training results
 | 1.5731        | 0.9996 | 600  | 1.9730          | 0.1342 | 0.0151 | 0.0912 |
 | 1.3694        | 1.9996 | 1200 | 1.9623          | 0.1371 | 0.0175 | 0.0909 |
 | 1.9561        | 2.9992 | 1800 | 1.9565          | 0.1423 | 0.0178 | 0.0928 |
+| 1.0882        | 3.9996 | 2400 | 1.9548          | 0.1417 | 0.0186 | 0.0900 |
+| 1.4872        | 4.9992 | 3000 | 1.9412          | 0.1581 | 0.0212 | 0.1006 |
+| 1.4126        | 5.9988 | 3600 | 1.9486          | 0.1589 | 0.0188 | 0.0986 |
+| 1.1634        | 7.0    | 4201 | 1.9464          | 0.1756 | 0.0229 | 0.1046 |
+| 0.9541        | 7.9996 | 4801 | 1.9401          | 0.1791 | 0.0243 | 0.1078 |
+| 0.9153        | 8.9975 | 5400 | 1.9401          | 0.1934 | 0.0269 | 0.1151 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:84f21eff3ce6deda24ca6e879a245eb12d871429288d8890407f15c1c1bb1a82
 size 990386200

 version https://git-lfs.github.com/spec/v1
+oid sha256:e40fbb8361d246d53f4edae35d42da61366d34201eec62f22ac6d4b04a2e99b2
 size 990386200

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:296154d90ec6ea346feed2cef6a2e1a56591d88c5f82648a51e3b732668f6e6b
 size 6776

 version https://git-lfs.github.com/spec/v1
+oid sha256:c611d9f9ff8ec098da6eb03453782a6458e06a90779c1927d1d621248f690dfb
 size 6776