ahmeddbahaa
/

AraBART-finetuned-ar

@@ -5,22 +5,9 @@ tags:
 - generated_from_trainer
 datasets:
 - xlsum
-metrics:
-- rouge
 model-index:
 - name: AraBART-finetuned-ar
-  results:
-  - task:
-      name: Sequence-to-sequence Language Modeling
-      type: text2text-generation
-    dataset:
-      name: xlsum
-      type: xlsum
-      args: arabic
-    metrics:
-    - name: Rouge1
-      type: rouge
-      value: 2.2459
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [moussaKam/AraBART](https://huggingface.co/moussaKam/AraBART) on the xlsum dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.3785
-- Rouge1: 2.2459
-- Rouge2: 0.0
-- Rougel: 2.2459
-- Rougelsum: 2.2459
-- Gen Len: 19.695
 ## Model description
@@ -55,35 +42,34 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
-- gradient_accumulation_steps: 16
-- total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_ratio: 0.6
 - num_epochs: 10
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
-|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
-| No log        | 0.98  | 32   | 2.7774          | 1.0638 | 0.0    | 1.0638 | 1.182     | 19.5177 |
-| No log        | 1.98  | 64   | 2.4730          | 1.182  | 0.0    | 1.3002 | 1.182     | 19.8121 |
-| No log        | 2.98  | 96   | 2.4129          | 2.3641 | 0.3546 | 2.3641 | 2.3641    | 19.8298 |
-| No log        | 3.98  | 128  | 2.3724          | 2.1277 | 0.3546 | 2.1277 | 2.1277    | 19.8121 |
-| No log        | 4.98  | 160  | 2.3560          | 1.8913 | 0.3546 | 1.8913 | 1.8913    | 19.805  |
-| No log        | 5.98  | 192  | 2.3574          | 1.5366 | 0.0    | 1.5366 | 1.6548    | 19.7979 |
-| No log        | 6.98  | 224  | 2.3676          | 2.1277 | 0.3546 | 2.2459 | 2.1277    | 19.6348 |
-| No log        | 7.98  | 256  | 2.3656          | 2.0095 | 0.0    | 2.0095 | 2.0095    | 19.844  |
-| No log        | 8.98  | 288  | 2.3751          | 2.2459 | 0.0    | 2.3641 | 2.2459    | 19.6738 |
-| No log        | 9.98  | 320  | 2.3785          | 2.2459 | 0.0    | 2.2459 | 2.2459    | 19.695  |
 ### Framework versions
-- Transformers 4.17.0
 - Pytorch 1.10.0+cu111
-- Datasets 2.0.0
-- Tokenizers 0.11.6

 - generated_from_trainer
 datasets:
 - xlsum
 model-index:
 - name: AraBART-finetuned-ar
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [moussaKam/AraBART](https://huggingface.co/moussaKam/AraBART) on the xlsum dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.7449
+- Rouge-1: 31.08
+- Rouge-2: 14.68
+- Rouge-l: 27.36
+- Gen Len: 19.64
+- Bertscore: 73.86
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 250
 - num_epochs: 10
+- label_smoothing_factor: 0.1
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Rouge-1 | Rouge-2 | Rouge-l | Gen Len | Bertscore |
+|:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:-------:|:---------:|
+| 4.4318        | 1.0   | 2345  | 3.7996          | 28.93   | 13.2    | 25.56   | 19.51   | 73.17     |
+| 4.0338        | 2.0   | 4690  | 3.7483          | 30.29   | 14.24   | 26.73   | 19.5    | 73.59     |
+| 3.8586        | 3.0   | 7035  | 3.7281          | 30.44   | 14.44   | 26.92   | 19.75   | 73.58     |
+| 3.7289        | 4.0   | 9380  | 3.7204          | 30.55   | 14.49   | 26.88   | 19.66   | 73.73     |
+| 3.6245        | 5.0   | 11725 | 3.7199          | 30.73   | 14.63   | 27.11   | 19.69   | 73.68     |
+| 3.5392        | 6.0   | 14070 | 3.7221          | 30.85   | 14.65   | 27.21   | 19.7    | 73.77     |
+| 3.4694        | 7.0   | 16415 | 3.7286          | 31.08   | 14.8    | 27.41   | 19.62   | 73.84     |
+| 3.4126        | 8.0   | 18760 | 3.7384          | 31.06   | 14.77   | 27.41   | 19.64   | 73.82     |
+| 3.3718        | 9.0   | 21105 | 3.7398          | 31.18   | 14.89   | 27.49   | 19.67   | 73.87     |
+| 3.3428        | 10.0  | 23450 | 3.7449          | 31.19   | 14.88   | 27.44   | 19.68   | 73.87     |
 ### Framework versions
+- Transformers 4.18.0
 - Pytorch 1.10.0+cu111
+- Datasets 2.1.0
+- Tokenizers 0.12.1