Sumanth2390
commited on
Commit
•
871bf26
1
Parent(s):
70c4b46
Update README.md
Browse files
README.md
CHANGED
@@ -19,13 +19,17 @@ should probably proofread and complete it, then remove this comment. -->
|
|
19 |
|
20 |
This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on the None dataset.
|
21 |
It achieves the following results on the evaluation set:
|
22 |
-
|
23 |
-
|
24 |
-
|
25 |
-
|
26 |
-
|
27 |
-
|
28 |
-
|
|
|
|
|
|
|
|
|
29 |
## Model description
|
30 |
|
31 |
More information needed
|
@@ -51,22 +55,16 @@ The following hyperparameters were used during training:
|
|
51 |
- total_train_batch_size: 8
|
52 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
53 |
- lr_scheduler_type: linear
|
54 |
-
- num_epochs:
|
55 |
|
56 |
### Training results
|
57 |
|
58 |
-
|
59 |
-
|
60 |
-
|
61 |
-
|
62 |
-
|
63 |
-
|
64 |
-
| No log | 5.0 | 470 | 1.0107 | 57.8272 | 31.5716 | 41.9775 | 54.5114 | 135.1733 |
|
65 |
-
| 1.1609 | 6.0 | 564 | 0.9775 | 58.561 | 32.5462 | 42.9577 | 55.1653 | 133.5533 |
|
66 |
-
| 1.1609 | 7.0 | 658 | 0.9683 | 59.0592 | 33.8153 | 43.918 | 56.0493 | 135.3267 |
|
67 |
-
| 1.1609 | 8.0 | 752 | 0.9626 | 60.4587 | 35.8511 | 45.9511 | 57.3658 | 134.38 |
|
68 |
-
| 1.1609 | 9.0 | 846 | 0.9623 | 60.3938 | 35.8996 | 45.7161 | 57.2104 | 135.2333 |
|
69 |
-
| 1.1609 | 10.0 | 940 | 0.9653 | 61.2091 | 36.8979 | 46.3962 | 58.3082 | 135.6733 |
|
70 |
|
71 |
|
72 |
### Framework versions
|
|
|
19 |
|
20 |
This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on the None dataset.
|
21 |
It achieves the following results on the evaluation set:
|
22 |
+
eval_loss: 0.8548385500907898,
|
23 |
+
eval_rouge1: 66.4768,
|
24 |
+
eval_rouge2: 48.5059,
|
25 |
+
eval_rougeL: 55.6107,
|
26 |
+
eval_rougeLsum: 64.379,
|
27 |
+
eval_gen_len: 135.19,
|
28 |
+
eval_runtime: 106.4023,
|
29 |
+
eval_samples_per_second: 0.94,
|
30 |
+
eval_steps_per_second: 0.235,
|
31 |
+
epoch: 5.0
|
32 |
+
|
33 |
## Model description
|
34 |
|
35 |
More information needed
|
|
|
55 |
- total_train_batch_size: 8
|
56 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
57 |
- lr_scheduler_type: linear
|
58 |
+
- num_epochs: 5
|
59 |
|
60 |
### Training results
|
61 |
|
62 |
+
Epoch Training Loss Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
|
63 |
+
1 No log 0.968213 59.682700 35.068600 44.651000 56.618200 137.666700
|
64 |
+
2 No log 0.961468 61.080300 37.609500 47.390200 58.380500 134.193300
|
65 |
+
3 No log 0.965955 62.082900 39.734400 48.736800 59.302500 135.833300
|
66 |
+
4 No log 0.975513 63.494900 42.147500 50.690800 60.831800 134.246700
|
67 |
+
5 No log 0.983745 64.556600 43.555200 51.977700 61.979700 134.180000
|
|
|
|
|
|
|
|
|
|
|
|
|
68 |
|
69 |
|
70 |
### Framework versions
|