Azad commited on
Commit
53b8019
1 Parent(s): 69ea3de

End of training

Browse files
README.md CHANGED
@@ -1,6 +1,5 @@
1
  ---
2
- base_model: t5-small
3
- license: apache-2.0
4
  tags:
5
  - generated_from_trainer
6
  model-index:
@@ -13,9 +12,9 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # angika-to-english-translation
15
 
16
- This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.1153
19
 
20
  ## Model description
21
 
@@ -35,20 +34,22 @@ More information needed
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 2e-05
38
- - train_batch_size: 16
39
- - eval_batch_size: 16
40
  - seed: 42
 
 
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
  - num_epochs: 3
44
 
45
  ### Training results
46
 
47
- | Training Loss | Epoch | Step | Validation Loss |
48
- |:-------------:|:-----:|:----:|:---------------:|
49
- | 0.1274 | 1.0 | 2563 | 0.1180 |
50
- | 0.1251 | 2.0 | 5126 | 0.1158 |
51
- | 0.1249 | 3.0 | 7689 | 0.1153 |
52
 
53
 
54
  ### Framework versions
 
1
  ---
2
+ base_model: ai4bharat/IndicBART
 
3
  tags:
4
  - generated_from_trainer
5
  model-index:
 
12
 
13
  # angika-to-english-translation
14
 
15
+ This model is a fine-tuned version of [ai4bharat/IndicBART](https://huggingface.co/ai4bharat/IndicBART) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 6.8315
18
 
19
  ## Model description
20
 
 
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 2e-05
37
+ - train_batch_size: 8
38
+ - eval_batch_size: 8
39
  - seed: 42
40
+ - gradient_accumulation_steps: 4
41
+ - total_train_batch_size: 32
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
  - num_epochs: 3
45
 
46
  ### Training results
47
 
48
+ | Training Loss | Epoch | Step | Validation Loss |
49
+ |:-------------:|:------:|:----:|:---------------:|
50
+ | 8.3922 | 0.9998 | 1281 | 7.8789 |
51
+ | 7.2908 | 1.9996 | 2562 | 7.0937 |
52
+ | 6.9378 | 2.9994 | 3843 | 6.8315 |
53
 
54
 
55
  ### Framework versions
generation_config.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "_from_model_config": true,
3
- "decoder_start_token_id": 0,
4
- "eos_token_id": 1,
5
  "pad_token_id": 0,
6
  "transformers_version": "4.42.4"
7
  }
 
1
  {
2
  "_from_model_config": true,
3
+ "bos_token_id": 64000,
4
+ "eos_token_id": 64001,
5
  "pad_token_id": 0,
6
  "transformers_version": "4.42.4"
7
  }
runs/Aug24_09-21-33_70d685c0de53/events.out.tfevents.1724491329.70d685c0de53.1018.5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f95df4fa7aebb047a46b5ef27ea142c45461a69256cfacf902b08ffe2574382c
3
- size 7445
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:018847db19117666a35a667b96415fcc6d4a1e1d13b2da7956f05aef91ad5c25
3
+ size 8070