lewtun HF staff commited on
Commit
4c1fa82
1 Parent(s): cc842da

Add evaluation results on the plain_text config and validation split of launch/gov_report

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the plain_text config and validation split of the [launch/gov_report](https://huggingface.co/datasets/launch/gov_report) dataset by @nonchalant-nagavalli, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-launch__gov_report-plain_text-7b7f8a-16126219).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=launch/gov_report).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=launch/gov_report).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -177,6 +177,39 @@ model-index:
177
  type: gen_len
178
  value: 233.8107
179
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
180
  ---
181
 
182
  # BigBirdPegasus model (large)
 
177
  type: gen_len
178
  value: 233.8107
179
  verified: true
180
+ - task:
181
+ type: summarization
182
+ name: Summarization
183
+ dataset:
184
+ name: launch/gov_report
185
+ type: launch/gov_report
186
+ config: plain_text
187
+ split: validation
188
+ metrics:
189
+ - name: ROUGE-1
190
+ type: rouge
191
+ value: 19.5926
192
+ verified: true
193
+ - name: ROUGE-2
194
+ type: rouge
195
+ value: 2.119
196
+ verified: true
197
+ - name: ROUGE-L
198
+ type: rouge
199
+ value: 13.7779
200
+ verified: true
201
+ - name: ROUGE-LSUM
202
+ type: rouge
203
+ value: 18.2027
204
+ verified: true
205
+ - name: loss
206
+ type: loss
207
+ value: 6.766763210296631
208
+ verified: true
209
+ - name: gen_len
210
+ type: gen_len
211
+ value: 244.2084
212
+ verified: true
213
  ---
214
 
215
  # BigBirdPegasus model (large)