Sumanth2390 commited on
Commit
871bf26
1 Parent(s): 70c4b46

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -20
README.md CHANGED
@@ -19,13 +19,17 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.9653
23
- - Rouge1: 61.2091
24
- - Rouge2: 36.8979
25
- - Rougel: 46.3962
26
- - Rougelsum: 58.3082
27
- - Gen Len: 135.6733
28
-
 
 
 
 
29
  ## Model description
30
 
31
  More information needed
@@ -51,22 +55,16 @@ The following hyperparameters were used during training:
51
  - total_train_batch_size: 8
52
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
53
  - lr_scheduler_type: linear
54
- - num_epochs: 10
55
 
56
  ### Training results
57
 
58
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
59
- |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:--------:|
60
- | No log | 1.0 | 94 | 1.3755 | 53.9112 | 25.5975 | 36.8507 | 50.0306 | 132.7733 |
61
- | No log | 2.0 | 188 | 1.2081 | 55.5956 | 27.4849 | 37.7785 | 51.7906 | 137.1267 |
62
- | No log | 3.0 | 282 | 1.1149 | 55.714 | 28.3629 | 39.0763 | 52.439 | 137.62 |
63
- | No log | 4.0 | 376 | 1.0564 | 56.6202 | 29.789 | 39.9223 | 53.3054 | 135.1733 |
64
- | No log | 5.0 | 470 | 1.0107 | 57.8272 | 31.5716 | 41.9775 | 54.5114 | 135.1733 |
65
- | 1.1609 | 6.0 | 564 | 0.9775 | 58.561 | 32.5462 | 42.9577 | 55.1653 | 133.5533 |
66
- | 1.1609 | 7.0 | 658 | 0.9683 | 59.0592 | 33.8153 | 43.918 | 56.0493 | 135.3267 |
67
- | 1.1609 | 8.0 | 752 | 0.9626 | 60.4587 | 35.8511 | 45.9511 | 57.3658 | 134.38 |
68
- | 1.1609 | 9.0 | 846 | 0.9623 | 60.3938 | 35.8996 | 45.7161 | 57.2104 | 135.2333 |
69
- | 1.1609 | 10.0 | 940 | 0.9653 | 61.2091 | 36.8979 | 46.3962 | 58.3082 | 135.6733 |
70
 
71
 
72
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ eval_loss: 0.8548385500907898,
23
+ eval_rouge1: 66.4768,
24
+ eval_rouge2: 48.5059,
25
+ eval_rougeL: 55.6107,
26
+ eval_rougeLsum: 64.379,
27
+ eval_gen_len: 135.19,
28
+ eval_runtime: 106.4023,
29
+ eval_samples_per_second: 0.94,
30
+ eval_steps_per_second: 0.235,
31
+ epoch: 5.0
32
+
33
  ## Model description
34
 
35
  More information needed
 
55
  - total_train_batch_size: 8
56
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
57
  - lr_scheduler_type: linear
58
+ - num_epochs: 5
59
 
60
  ### Training results
61
 
62
+ Epoch Training Loss Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
63
+ 1 No log 0.968213 59.682700 35.068600 44.651000 56.618200 137.666700
64
+ 2 No log 0.961468 61.080300 37.609500 47.390200 58.380500 134.193300
65
+ 3 No log 0.965955 62.082900 39.734400 48.736800 59.302500 135.833300
66
+ 4 No log 0.975513 63.494900 42.147500 50.690800 60.831800 134.246700
67
+ 5 No log 0.983745 64.556600 43.555200 51.977700 61.979700 134.180000
 
 
 
 
 
 
68
 
69
 
70
  ### Framework versions