lewtun HF staff commited on
Commit
81ef52c
1 Parent(s): 35cdaef

Add evaluation results on the plain_text config of anli

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the plain_text config of the [anli](https://huggingface.co/datasets/anli) dataset by

@MoritzLaurer

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-anli-plain_text-c507f2-14355972).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=anli).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=anli).

Files changed (1) hide show
  1. README.md +57 -2
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- language:
3
  - en
4
  license: mit
5
  tags:
@@ -12,7 +12,62 @@ datasets:
12
  - anli
13
  - fever
14
  pipeline_tag: zero-shot-classification
15
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
16
  ---
17
  # DeBERTa-v3-base-mnli-fever-anli
18
  ## Model description
 
1
  ---
2
+ language:
3
  - en
4
  license: mit
5
  tags:
 
12
  - anli
13
  - fever
14
  pipeline_tag: zero-shot-classification
15
+ model-index:
16
+ - name: MoritzLaurer/DeBERTa-v3-base-mnli-fever-anli
17
+ results:
18
+ - task:
19
+ type: natural-language-inference
20
+ name: Natural Language Inference
21
+ dataset:
22
+ name: anli
23
+ type: anli
24
+ config: plain_text
25
+ split: test_r3
26
+ metrics:
27
+ - name: Accuracy
28
+ type: accuracy
29
+ value: 0.495
30
+ verified: true
31
+ - name: Precision Macro
32
+ type: precision
33
+ value: 0.4984740618243923
34
+ verified: true
35
+ - name: Precision Micro
36
+ type: precision
37
+ value: 0.495
38
+ verified: true
39
+ - name: Precision Weighted
40
+ type: precision
41
+ value: 0.4984357572868885
42
+ verified: true
43
+ - name: Recall Macro
44
+ type: recall
45
+ value: 0.49461028192371476
46
+ verified: true
47
+ - name: Recall Micro
48
+ type: recall
49
+ value: 0.495
50
+ verified: true
51
+ - name: Recall Weighted
52
+ type: recall
53
+ value: 0.495
54
+ verified: true
55
+ - name: F1 Macro
56
+ type: f1
57
+ value: 0.4942810999491704
58
+ verified: true
59
+ - name: F1 Micro
60
+ type: f1
61
+ value: 0.495
62
+ verified: true
63
+ - name: F1 Weighted
64
+ type: f1
65
+ value: 0.4944671868893595
66
+ verified: true
67
+ - name: loss
68
+ type: loss
69
+ value: 1.8788293600082397
70
+ verified: true
71
  ---
72
  # DeBERTa-v3-base-mnli-fever-anli
73
  ## Model description