--- license: apache-2.0 tags: - generated_from_trainer model-index: - name: alltasks_m1-t1 results: [] --- # alltasks_m1-t1 This model is a fine-tuned version of [yuchenlin/BART0pp](https://huggingface.co/yuchenlin/BART0pp) on an unknown dataset. It achieves the following results on the evaluation set: - Loss: 1.8914 - Train Runtime: 12625.9615 - Train Samples Per Second: 57.001 - Train Steps Per Second: 0.792 - Train Loss: 1.6667 - Train Samples: 239899 - Gen Len: 9.9497 ## Model description More information needed ## Intended uses & limitations More information needed ## Training and evaluation data More information needed ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 5e-05 - train_batch_size: 9 - eval_batch_size: 9 - seed: 42 - distributed_type: multi-GPU - num_devices: 8 - total_train_batch_size: 72 - total_eval_batch_size: 72 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 - lr_scheduler_type: linear - num_epochs: 3.0 ### Training results | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Accuracy | F1 | Recall | Precision | Gen Len | |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:--------:|:-------:|:-------:|:---------:|:-------:| | 1.9907 | 0.15 | 500 | 2.3435 | 50.3191 | 6.4838 | 49.7719 | 50.0456 | 55.5972 | 55.5972 | 55.5972 | 55.5972 | 8.8197 | | 1.9578 | 0.3 | 1000 | 2.0301 | 54.8237 | 7.033 | 54.3422 | 54.4676 | 61.3115 | 61.3115 | 61.3115 | 61.3115 | 8.0583 | | 1.8599 | 0.45 | 1500 | 1.9683 | 58.0535 | 6.4621 | 57.5215 | 57.7813 | 66.2295 | 66.2295 | 66.2295 | 66.2295 | 8.1403 | | 1.861 | 0.6 | 2000 | 1.9899 | 60.2053 | 6.6431 | 59.6317 | 59.8907 | 69.0867 | 69.0867 | 69.0867 | 69.0867 | 8.4773 | | 1.7464 | 0.75 | 2500 | 1.9600 | 61.3403 | 6.6424 | 60.8196 | 61.0684 | 70.726 | 70.726 | 70.726 | 70.726 | 8.4747 | | 1.8516 | 0.9 | 3000 | 1.9506 | 59.7834 | 6.4538 | 59.2387 | 59.5396 | 68.8993 | 68.8993 | 68.8993 | 68.8993 | 8.5043 | | 1.6371 | 1.05 | 3500 | 1.9415 | 60.9397 | 6.6405 | 60.3836 | 60.6176 | 70.1639 | 70.1639 | 70.1639 | 70.1639 | 8.1427 | | 1.643 | 1.2 | 4000 | 1.9433 | 62.7362 | 6.8939 | 62.1572 | 62.4167 | 72.4122 | 72.4122 | 72.4122 | 72.4122 | 7.9857 | | 1.6193 | 1.35 | 4500 | 1.9296 | 61.3662 | 6.7287 | 60.8375 | 61.1083 | 70.8197 | 70.8197 | 70.8197 | 70.8197 | 8.4563 | | 1.6593 | 1.5 | 5000 | 1.9060 | 63.089 | 6.7619 | 62.5142 | 62.8447 | 73.1616 | 73.1616 | 73.1616 | 73.1616 | 8.42 | | 1.6716 | 1.65 | 5500 | 1.9133 | 63.2106 | 6.7486 | 62.5549 | 62.9047 | 73.2553 | 73.2553 | 73.2553 | 73.2553 | 8.362 | | 1.5638 | 1.8 | 6000 | 1.8967 | 63.5146 | 6.9202 | 62.9517 | 63.1969 | 73.4895 | 73.4895 | 73.4895 | 73.4895 | 8.28 | | 1.5614 | 1.95 | 6500 | 1.8835 | 63.3545 | 6.9092 | 62.7955 | 63.0354 | 73.2084 | 73.2084 | 73.2084 | 73.2084 | 8.2333 | | 1.4675 | 2.1 | 7000 | 1.9220 | 63.465 | 6.7168 | 62.9135 | 63.2247 | 73.63 | 73.63 | 73.63 | 73.63 | 8.1323 | | 1.4402 | 2.25 | 7500 | 1.9425 | 64.0073 | 7.0859 | 63.4022 | 63.7246 | 73.8642 | 73.8642 | 73.8642 | 73.8642 | 8.1393 | | 1.4655 | 2.4 | 8000 | 1.9142 | 64.366 | 6.8629 | 63.7608 | 64.0938 | 74.5667 | 74.5667 | 74.5667 | 74.5667 | 8.1717 | | 1.4741 | 2.55 | 8500 | 1.9238 | 64.022 | 6.8364 | 63.4035 | 63.7259 | 74.192 | 74.192 | 74.192 | 74.192 | 8.1777 | | 1.4335 | 2.7 | 9000 | 1.9001 | 64.8286 | 6.9507 | 64.159 | 64.5065 | 75.0351 | 75.0351 | 75.0351 | 75.0351 | 8.1387 | | 1.5305 | 2.85 | 9500 | 1.8914 | 64.895 | 6.9613 | 64.2636 | 64.5959 | 75.1288 | 75.1288 | 75.1288 | 75.1288 | 8.2063 | ### Framework versions - Transformers 4.20.1 - Pytorch 1.11.0 - Datasets 2.3.2 - Tokenizers 0.12.1