End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -1,6 +1,5 @@
 ---
-base_model: t5-small
-license: apache-2.0
 tags:
 - generated_from_trainer
 model-index:
@@ -13,9 +12,9 @@ should probably proofread and complete it, then remove this comment. -->
 # angika-to-english-translation
-This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1153
 ## Model description
@@ -35,20 +34,22 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 0.1274        | 1.0   | 2563 | 0.1180          |
-| 0.1251        | 2.0   | 5126 | 0.1158          |
-| 0.1249        | 3.0   | 7689 | 0.1153          |
 ### Framework versions

 ---
+base_model: ai4bharat/IndicBART
 tags:
 - generated_from_trainer
 model-index:
 # angika-to-english-translation
+This model is a fine-tuned version of [ai4bharat/IndicBART](https://huggingface.co/ai4bharat/IndicBART) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 6.8315
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 8.3922        | 0.9998 | 1281 | 7.8789          |
+| 7.2908        | 1.9996 | 2562 | 7.0937          |
+| 6.9378        | 2.9994 | 3843 | 6.8315          |
 ### Framework versions

generation_config.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "_from_model_config": true,
-  "decoder_start_token_id": 0,
-  "eos_token_id": 1,
   "pad_token_id": 0,
   "transformers_version": "4.42.4"
 }

 {
   "_from_model_config": true,
+  "bos_token_id": 64000,
+  "eos_token_id": 64001,
   "pad_token_id": 0,
   "transformers_version": "4.42.4"
 }

runs/Aug24_09-21-33_70d685c0de53/events.out.tfevents.1724491329.70d685c0de53.1018.5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f95df4fa7aebb047a46b5ef27ea142c45461a69256cfacf902b08ffe2574382c
-size 7445

 version https://git-lfs.github.com/spec/v1
+oid sha256:018847db19117666a35a667b96415fcc6d4a1e1d13b2da7956f05aef91ad5c25
+size 8070