autoevaluator HF staff commited on
Commit
dd110a0
1 Parent(s): a1b9025

Add evaluation results on the autoevaluate--squad-sample config and test split of autoevaluate/squad-sample

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 馃憢!\
Your model has been evaluated on the autoevaluate--squad-sample config and test split of the [autoevaluate/squad-sample](https://huggingface.co/datasets/autoevaluate/squad-sample) dataset by

@lewtun
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-autoevaluate__squad-sample-autoevaluate__squad-sample-778ba0-17436362).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=autoevaluate/squad-sample).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=autoevaluate/squad-sample).

Files changed (1) hide show
  1. README.md +25 -1
README.md CHANGED
@@ -6,7 +6,31 @@ datasets:
6
  - squad
7
  model-index:
8
  - name: extractive-question-answering
9
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
6
  - squad
7
  model-index:
8
  - name: extractive-question-answering
9
+ results:
10
+ - task:
11
+ type: question-answering
12
+ name: Question Answering
13
+ dataset:
14
+ name: autoevaluate/squad-sample
15
+ type: autoevaluate/squad-sample
16
+ config: autoevaluate--squad-sample
17
+ split: test
18
+ metrics:
19
+ - type: f1
20
+ value: 76.9929
21
+ name: F1
22
+ verified: true
23
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYWU5MzZiMjc4Yzk0ZWE3MjQ2NzRkZTNhYjUwNTg4NjQxMWRjNDMzMzM3ZTk1N2ZlZWFmMGMzNzI5NDA5YmE5NyIsInZlcnNpb24iOjF9.Rhdvpq_Kxl0KQbQGm9zS-ZD_hAMAvaiJjN81vcF5FHUbkbr08YEJt7R0GCCogo8XTGTwjD3IVx4vhkp2YihHCA
24
+ - type: exact_match
25
+ value: 70.0
26
+ name: Exact Match
27
+ verified: true
28
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOTM0MjdlMjU1ZDc2NzgyMWVmNDU4Mjk2M2Y0NzliNDc4MmFlMGViNmU0YjFmMzkyNWI2NzAyYzI0ZmVhMjc2NSIsInZlcnNpb24iOjF9.-ffZWh0VpGUjxDXbZVRMnA7UzLVLx2j9YlcdnzgS96exKQe4nrEnfqVTh_zMATXWvYmpGwG0r9H3vyl2DBgIBg
29
+ - type: loss
30
+ value: 1.203058123588562
31
+ name: loss
32
+ verified: true
33
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMDYxMWRhNDVhN2UzZWIyZDhhNWU1ZjljNDMwNmI3MzE0ODIwZTk3NzY4ZTY1Mzk3NGQ5MzVkNWJiNzg2MTIxMiIsInZlcnNpb24iOjF9.fa-apsWqPIIJ_vtZigRkpWeC3Nn3bel_1BV97hVQiqahIz-5000jw1dqjVWWrU7Srq1RLmPS0q1mnNW-6wrjAA
34
  ---
35
 
36
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You