--- license: mit base_model: flax-community/spanish-t5-small tags: - generated_from_trainer metrics: - rouge model-index: - name: augmented_t5_pictos results: [] --- # augmented_t5_pictos This model is a fine-tuned version of [flax-community/spanish-t5-small](https://huggingface.co/flax-community/spanish-t5-small) on the None dataset. It achieves the following results on the evaluation set: - Loss: 1.4052 - Rouge1: 42.0934 - Rouge2: 28.3804 - Rougel: 41.2489 - Rougelsum: 41.3148 - Gen Len: 8.4979 ## Model description More information needed ## Intended uses & limitations More information needed ## Training and evaluation data More information needed ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 2e-05 - train_batch_size: 8 - eval_batch_size: 8 - seed: 42 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 - lr_scheduler_type: linear - num_epochs: 10 ### Training results | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len | |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:| | 2.3625 | 1.0 | 527 | 2.0249 | 34.7282 | 19.5441 | 33.9362 | 33.9053 | 7.6303 | | 1.9453 | 2.0 | 1054 | 1.7992 | 37.5587 | 22.5669 | 36.6904 | 36.7767 | 8.2671 | | 1.7429 | 3.0 | 1581 | 1.6655 | 39.7178 | 24.1301 | 38.8293 | 38.8615 | 8.3462 | | 1.6306 | 4.0 | 2108 | 1.5791 | 40.8591 | 26.0948 | 40.088 | 40.0812 | 7.9573 | | 1.461 | 5.0 | 2635 | 1.5204 | 40.6998 | 25.5825 | 39.8972 | 39.9642 | 8.3568 | | 1.4178 | 6.0 | 3162 | 1.4702 | 41.0675 | 26.4585 | 40.1879 | 40.2333 | 8.4679 | | 1.3394 | 7.0 | 3689 | 1.4452 | 41.9649 | 27.1925 | 41.0945 | 41.1233 | 8.2329 | | 1.2844 | 8.0 | 4216 | 1.4210 | 41.9633 | 27.7102 | 40.9804 | 41.0706 | 8.5406 | | 1.2151 | 9.0 | 4743 | 1.4072 | 41.9336 | 28.0917 | 41.0597 | 41.1328 | 8.4722 | | 1.215 | 10.0 | 5270 | 1.4052 | 42.0934 | 28.3804 | 41.2489 | 41.3148 | 8.4979 | ### Framework versions - Transformers 4.34.1 - Pytorch 2.1.0+cu118 - Datasets 2.14.6 - Tokenizers 0.14.1