---
license: apache-2.0
tags:
- generated_from_trainer
model-index:
- name: alltasks_m1-t1
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# alltasks_m1-t1

This model is a fine-tuned version of [yuchenlin/BART0pp](https://huggingface.co/yuchenlin/BART0pp) on an unknown dataset.
It achieves the following results on the evaluation set:
- Loss: 1.8914
- Train Runtime: 12625.9615
- Train Samples Per Second: 57.001
- Train Steps Per Second: 0.792
- Train Loss: 1.6667
- Train Samples: 239899
- Gen Len: 9.9497

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 5e-05
- train_batch_size: 9
- eval_batch_size: 9
- seed: 42
- distributed_type: multi-GPU
- num_devices: 8
- total_train_batch_size: 72
- total_eval_batch_size: 72
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 3.0

### Training results

| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum | Accuracy | F1      | Recall  | Precision | Gen Len |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:--------:|:-------:|:-------:|:---------:|:-------:|
| 1.9907        | 0.15  | 500  | 2.3435          | 50.3191 | 6.4838 | 49.7719 | 50.0456   | 55.5972  | 55.5972 | 55.5972 | 55.5972   | 8.8197  |
| 1.9578        | 0.3   | 1000 | 2.0301          | 54.8237 | 7.033  | 54.3422 | 54.4676   | 61.3115  | 61.3115 | 61.3115 | 61.3115   | 8.0583  |
| 1.8599        | 0.45  | 1500 | 1.9683          | 58.0535 | 6.4621 | 57.5215 | 57.7813   | 66.2295  | 66.2295 | 66.2295 | 66.2295   | 8.1403  |
| 1.861         | 0.6   | 2000 | 1.9899          | 60.2053 | 6.6431 | 59.6317 | 59.8907   | 69.0867  | 69.0867 | 69.0867 | 69.0867   | 8.4773  |
| 1.7464        | 0.75  | 2500 | 1.9600          | 61.3403 | 6.6424 | 60.8196 | 61.0684   | 70.726   | 70.726  | 70.726  | 70.726    | 8.4747  |
| 1.8516        | 0.9   | 3000 | 1.9506          | 59.7834 | 6.4538 | 59.2387 | 59.5396   | 68.8993  | 68.8993 | 68.8993 | 68.8993   | 8.5043  |
| 1.6371        | 1.05  | 3500 | 1.9415          | 60.9397 | 6.6405 | 60.3836 | 60.6176   | 70.1639  | 70.1639 | 70.1639 | 70.1639   | 8.1427  |
| 1.643         | 1.2   | 4000 | 1.9433          | 62.7362 | 6.8939 | 62.1572 | 62.4167   | 72.4122  | 72.4122 | 72.4122 | 72.4122   | 7.9857  |
| 1.6193        | 1.35  | 4500 | 1.9296          | 61.3662 | 6.7287 | 60.8375 | 61.1083   | 70.8197  | 70.8197 | 70.8197 | 70.8197   | 8.4563  |
| 1.6593        | 1.5   | 5000 | 1.9060          | 63.089  | 6.7619 | 62.5142 | 62.8447   | 73.1616  | 73.1616 | 73.1616 | 73.1616   | 8.42    |
| 1.6716        | 1.65  | 5500 | 1.9133          | 63.2106 | 6.7486 | 62.5549 | 62.9047   | 73.2553  | 73.2553 | 73.2553 | 73.2553   | 8.362   |
| 1.5638        | 1.8   | 6000 | 1.8967          | 63.5146 | 6.9202 | 62.9517 | 63.1969   | 73.4895  | 73.4895 | 73.4895 | 73.4895   | 8.28    |
| 1.5614        | 1.95  | 6500 | 1.8835          | 63.3545 | 6.9092 | 62.7955 | 63.0354   | 73.2084  | 73.2084 | 73.2084 | 73.2084   | 8.2333  |
| 1.4675        | 2.1   | 7000 | 1.9220          | 63.465  | 6.7168 | 62.9135 | 63.2247   | 73.63    | 73.63   | 73.63   | 73.63     | 8.1323  |
| 1.4402        | 2.25  | 7500 | 1.9425          | 64.0073 | 7.0859 | 63.4022 | 63.7246   | 73.8642  | 73.8642 | 73.8642 | 73.8642   | 8.1393  |
| 1.4655        | 2.4   | 8000 | 1.9142          | 64.366  | 6.8629 | 63.7608 | 64.0938   | 74.5667  | 74.5667 | 74.5667 | 74.5667   | 8.1717  |
| 1.4741        | 2.55  | 8500 | 1.9238          | 64.022  | 6.8364 | 63.4035 | 63.7259   | 74.192   | 74.192  | 74.192  | 74.192    | 8.1777  |
| 1.4335        | 2.7   | 9000 | 1.9001          | 64.8286 | 6.9507 | 64.159  | 64.5065   | 75.0351  | 75.0351 | 75.0351 | 75.0351   | 8.1387  |
| 1.5305        | 2.85  | 9500 | 1.8914          | 64.895  | 6.9613 | 64.2636 | 64.5959   | 75.1288  | 75.1288 | 75.1288 | 75.1288   | 8.2063  |


### Framework versions

- Transformers 4.20.1
- Pytorch 1.11.0
- Datasets 2.3.2
- Tokenizers 0.12.1