lewtun HF staff commited on
Commit
b044405
1 Parent(s): c67df39

Add evaluation results on the qnli config of glue

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the qnli config of the [glue](https://huggingface.co/datasets/glue) dataset by

@lewtun

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-glue-91d4fe29-14115933).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=glue).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=glue).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -22,6 +22,39 @@ model-index:
22
  - name: Accuracy
23
  type: accuracy
24
  value: 0.9150649826102873
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
22
  - name: Accuracy
23
  type: accuracy
24
  value: 0.9150649826102873
25
+ - task:
26
+ type: natural-language-inference
27
+ name: Natural Language Inference
28
+ dataset:
29
+ name: glue
30
+ type: glue
31
+ config: qnli
32
+ split: validation
33
+ metrics:
34
+ - name: Accuracy
35
+ type: accuracy
36
+ value: 0.914881933003844
37
+ verified: true
38
+ - name: Precision
39
+ type: precision
40
+ value: 0.9195906432748538
41
+ verified: true
42
+ - name: Recall
43
+ type: recall
44
+ value: 0.9112640347700108
45
+ verified: true
46
+ - name: AUC
47
+ type: auc
48
+ value: 0.9718281171793548
49
+ verified: true
50
+ - name: F1
51
+ type: f1
52
+ value: 0.9154084045843187
53
+ verified: true
54
+ - name: loss
55
+ type: loss
56
+ value: 0.21421395242214203
57
+ verified: true
58
  ---
59
 
60
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You