File size: 3,916 Bytes

bfae21f

---
license: mit
base_model: facebook/bart-large-cnn
tags:
- generated_from_trainer
datasets:
- clupubhealth
metrics:
- rouge
model-index:
- name: bart-cnn-pubhealth-expanded-hi-grad
  results:
  - task:
      name: Sequence-to-sequence Language Modeling
      type: text2text-generation
    dataset:
      name: clupubhealth
      type: clupubhealth
      config: expanded
      split: test
      args: expanded
    metrics:
    - name: Rouge1
      type: rouge
      value: 28.8807
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# bart-cnn-pubhealth-expanded-hi-grad

This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on the clupubhealth dataset.
It achieves the following results on the evaluation set:
- Loss: 2.1939
- Rouge1: 28.8807
- Rouge2: 8.9567
- Rougel: 19.5591
- Rougelsum: 20.6726
- Gen Len: 66.99

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 16
- eval_batch_size: 8
- seed: 42
- gradient_accumulation_steps: 950
- total_train_batch_size: 15200
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 10

### Training results

| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum | Gen Len |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
| 3.4166        | 0.49  | 2    | 2.4019          | 22.0991 | 4.6789 | 15.1628 | 17.4382   | 75.065  |
| 3.2194        | 0.98  | 4    | 2.3372          | 25.0981 | 6.6975 | 17.4606 | 19.2018   | 71.225  |
| 3.0969        | 1.47  | 6    | 2.2979          | 26.4747 | 7.1948 | 18.2262 | 19.6241   | 67.19   |
| 3.0313        | 1.96  | 8    | 2.3038          | 26.8637 | 7.5831 | 18.2923 | 19.6327   | 66.875  |
| 2.9753        | 2.44  | 10   | 2.2976          | 27.8942 | 8.3434 | 19.095  | 20.6248   | 67.975  |
| 2.9296        | 2.93  | 12   | 2.2602          | 28.1255 | 8.6477 | 19.0575 | 20.7787   | 68.515  |
| 2.8681        | 3.42  | 14   | 2.2341          | 28.0812 | 8.598  | 19.3391 | 20.7526   | 68.285  |
| 2.867         | 3.91  | 16   | 2.2246          | 28.3624 | 8.7921 | 19.5552 | 21.1147   | 68.225  |
| 2.8157        | 4.4   | 18   | 2.2178          | 28.8197 | 8.8423 | 19.3606 | 20.698    | 69.08   |
| 2.8007        | 4.89  | 20   | 2.2149          | 28.34   | 8.5084 | 18.8293 | 20.1169   | 68.255  |
| 2.7797        | 5.38  | 22   | 2.2123          | 28.2156 | 8.4891 | 19.3472 | 20.5036   | 67.525  |
| 2.7563        | 5.87  | 24   | 2.2083          | 27.8927 | 8.3783 | 19.1194 | 20.2498   | 68.365  |
| 2.736         | 6.36  | 26   | 2.2035          | 28.2588 | 8.2345 | 18.9418 | 20.2931   | 68.335  |
| 2.7208        | 6.85  | 28   | 2.2020          | 28.2471 | 8.599  | 19.3465 | 20.5104   | 68.44   |
| 2.713         | 7.33  | 30   | 2.2022          | 28.1863 | 8.5142 | 19.194  | 20.2467   | 68.3    |
| 2.7135        | 7.82  | 32   | 2.2013          | 28.462  | 8.6346 | 19.2465 | 20.4812   | 68.195  |
| 2.6987        | 8.31  | 34   | 2.1988          | 28.9168 | 8.8888 | 19.6491 | 20.7796   | 67.275  |
| 2.6978        | 8.8   | 36   | 2.1965          | 28.7303 | 8.9879 | 19.5924 | 20.6943   | 67.31   |
| 2.6769        | 9.29  | 38   | 2.1946          | 28.7956 | 8.9652 | 19.545  | 20.7352   | 67.33   |
| 2.6821        | 9.78  | 40   | 2.1939          | 28.8807 | 8.9567 | 19.5591 | 20.6726   | 66.99   |


### Framework versions

- Transformers 4.31.0
- Pytorch 2.0.1+cu117
- Datasets 2.7.1
- Tokenizers 0.13.2