Commit
•
ebf7bad
1
Parent(s):
39e7680
Add evaluation results on the 3.0.0 config and test split of cnn_dailymail
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the 3.0.0 config and test split of the [cnn_dailymail](https://huggingface.co/datasets/cnn_dailymail) dataset by
@Raj
P Sini, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-cnn_dailymail-3.0.0-52cdb7-47832145227).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=cnn_dailymail).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=cnn_dailymail).
README.md
CHANGED
@@ -18,30 +18,30 @@ model-index:
|
|
18 |
split: test
|
19 |
metrics:
|
20 |
- type: rouge
|
21 |
-
value: 24.
|
22 |
name: ROUGE-1
|
23 |
verified: true
|
24 |
-
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.
|
25 |
- type: rouge
|
26 |
-
value: 11.
|
27 |
name: ROUGE-2
|
28 |
verified: true
|
29 |
-
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.
|
30 |
- type: rouge
|
31 |
-
value: 19.
|
32 |
name: ROUGE-L
|
33 |
verified: true
|
34 |
-
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.
|
35 |
- type: rouge
|
36 |
-
value: 22.
|
37 |
name: ROUGE-LSUM
|
38 |
verified: true
|
39 |
-
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.
|
40 |
- type: loss
|
41 |
-
value: 2.
|
42 |
name: loss
|
43 |
verified: true
|
44 |
-
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.
|
45 |
- type: gen_len
|
46 |
value: 18.9993
|
47 |
name: gen_len
|
|
|
18 |
split: test
|
19 |
metrics:
|
20 |
- type: rouge
|
21 |
+
value: 24.1585
|
22 |
name: ROUGE-1
|
23 |
verified: true
|
24 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiN2Q0Nzk3ZTNkNTFjMTM2YjliNzcxYTVlMDgyNDE4MzZjNzgzZjgzYjI1NWFjZTE2YjE4MWE3NGRiNGZiMmVhNyIsInZlcnNpb24iOjF9.H2oS1cN5A3wY8oFZTVtCMwnbDPAdUhNwjTSDocqQinhDq7aSee_AvIVn-7m84Ke8qaMTAvHB9e56MDAAVT8XBA
|
25 |
- type: rouge
|
26 |
+
value: 11.0688
|
27 |
name: ROUGE-2
|
28 |
verified: true
|
29 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZGIyMmYzZTFhNjgwMmU5YWQ1MTZjM2ZlNjEwYmVmODkyMGQwZDQ2MjM1YmRkYjM2NTEyNjE5N2ExYzc0ZTcyYSIsInZlcnNpb24iOjF9.6GtmrXTD0EnrXx02enbLdbeiLh--I9u0GfrPdXZ_CKHeYgpFs0Gk1F0c75QBfGoMilodGymS15A9Bjvt00baBw
|
30 |
- type: rouge
|
31 |
+
value: 19.7293
|
32 |
name: ROUGE-L
|
33 |
verified: true
|
34 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYzc4MGQyNmYwNDk5NDE0MDk2ZjE2NmVkZDIwN2NmYzQxZTI0NWZhZjkxOGFkMWZmNjQ5NzRkODViNzg5Zjc5MiIsInZlcnNpb24iOjF9.rOgFJeHsW74nQiKc3DPoMIB9aWKqWTRtnweYP3DCp4duJN5jq32PPNyXo3EYuskGgTSp4KWwf7-Hl2MYwDrSCQ
|
35 |
- type: rouge
|
36 |
+
value: 22.6394
|
37 |
name: ROUGE-LSUM
|
38 |
verified: true
|
39 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOTA4M2JlZDliMmFlZDgwM2E2MDZjY2ZjZGUwZTcxNDM0NGU3NzdlYzJlZTEzNDEyZDE0OWFiMjUzMmYwNjRhNyIsInZlcnNpb24iOjF9.Mq9ltLQ5YAZfLLaGsPtSOe6KCRLRwjT_2nSAH9KWvOiyagJ16F5xQ1m9uUx9mhiu_UOmpjDaAtD3y4AOy4L0Dg
|
40 |
- type: loss
|
41 |
+
value: 2.516355514526367
|
42 |
name: loss
|
43 |
verified: true
|
44 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOGQwNTIyZmU5ZjU3OWM1NGMwYzJiYTA0ZGVmOTA2MjcxYzZmZDRjZDViZDg0NGNlOWNjODkxYTc1ZTJhMmYyMiIsInZlcnNpb24iOjF9.mh6ZVu82CFnb5g92Uj-99wjyvoSQQI-gO-PDBdH4JZyc8mVPJYzV-S7jyXwC_XsOfD1OsR9XKTxM1NUirfBKAw
|
45 |
- type: gen_len
|
46 |
value: 18.9993
|
47 |
name: gen_len
|