sagorbert_nwp_finetuning_def_v3
This model is a fine-tuned version of sagorsarker/bangla-bert-base on the None dataset. It achieves the following results on the evaluation set:
- Loss: 2.7146
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 50
Training results
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
4.1717 | 1.0 | 1551 | 3.9996 |
3.7173 | 2.0 | 3102 | 3.6406 |
3.4565 | 3.0 | 4653 | 3.4235 |
3.2657 | 4.0 | 6204 | 3.2330 |
3.1522 | 5.0 | 7755 | 3.2134 |
3.0686 | 6.0 | 9306 | 3.1522 |
2.9315 | 7.0 | 10857 | 3.0937 |
2.8902 | 8.0 | 12408 | 3.0556 |
2.7995 | 9.0 | 13959 | 3.0475 |
2.7451 | 10.0 | 15510 | 2.9813 |
2.7015 | 11.0 | 17061 | 2.9560 |
2.6528 | 12.0 | 18612 | 2.9613 |
2.5797 | 13.0 | 20163 | 2.9195 |
2.5343 | 14.0 | 21714 | 2.8609 |
2.4927 | 15.0 | 23265 | 2.8933 |
2.4433 | 16.0 | 24816 | 2.8718 |
2.3995 | 17.0 | 26367 | 2.8405 |
2.3875 | 18.0 | 27918 | 2.8703 |
2.3171 | 19.0 | 29469 | 2.8371 |
2.319 | 20.0 | 31020 | 2.8027 |
2.2824 | 21.0 | 32571 | 2.7959 |
2.2633 | 22.0 | 34122 | 2.8165 |
2.2149 | 23.0 | 35673 | 2.7747 |
2.1812 | 24.0 | 37224 | 2.7879 |
2.1677 | 25.0 | 38775 | 2.7723 |
2.1521 | 26.0 | 40326 | 2.7887 |
2.14 | 27.0 | 41877 | 2.7839 |
2.059 | 28.0 | 43428 | 2.8150 |
2.0881 | 29.0 | 44979 | 2.7617 |
2.0583 | 30.0 | 46530 | 2.7491 |
2.0574 | 31.0 | 48081 | 2.7303 |
2.0416 | 32.0 | 49632 | 2.7490 |
1.9837 | 33.0 | 51183 | 2.7419 |
1.9747 | 34.0 | 52734 | 2.7409 |
1.9486 | 35.0 | 54285 | 2.7757 |
1.941 | 36.0 | 55836 | 2.7546 |
1.9549 | 37.0 | 57387 | 2.7046 |
1.9346 | 38.0 | 58938 | 2.7700 |
1.8979 | 39.0 | 60489 | 2.7033 |
1.9104 | 40.0 | 62040 | 2.7383 |
1.8989 | 41.0 | 63591 | 2.6837 |
1.8691 | 42.0 | 65142 | 2.7084 |
1.8492 | 43.0 | 66693 | 2.7000 |
1.8271 | 44.0 | 68244 | 2.6792 |
1.8723 | 45.0 | 69795 | 2.7325 |
1.8208 | 46.0 | 71346 | 2.6998 |
1.8218 | 47.0 | 72897 | 2.7490 |
1.8305 | 48.0 | 74448 | 2.7394 |
1.8067 | 49.0 | 75999 | 2.6545 |
1.7974 | 50.0 | 77550 | 2.6925 |
Framework versions
- Transformers 4.38.1
- Pytorch 2.1.0+cu121
- Datasets 2.17.1
- Tokenizers 0.15.2
- Downloads last month
- 9
Model tree for Hamza11/sagorbert_nwp_finetuning_def_v3
Base model
sagorsarker/bangla-bert-base