Add evaluation results on the plain_text config of anli
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the plain_text config of the [anli](https://huggingface.co/datasets/anli) dataset by
@MoritzLaurer
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-anli-plain_text-c507f2-14355972).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=anli).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=anli).
README.md
CHANGED
@@ -1,5 +1,5 @@
|
|
1 |
---
|
2 |
-
language:
|
3 |
- en
|
4 |
license: mit
|
5 |
tags:
|
@@ -12,7 +12,62 @@ datasets:
|
|
12 |
- anli
|
13 |
- fever
|
14 |
pipeline_tag: zero-shot-classification
|
15 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
---
|
17 |
# DeBERTa-v3-base-mnli-fever-anli
|
18 |
## Model description
|
|
|
1 |
---
|
2 |
+
language:
|
3 |
- en
|
4 |
license: mit
|
5 |
tags:
|
|
|
12 |
- anli
|
13 |
- fever
|
14 |
pipeline_tag: zero-shot-classification
|
15 |
+
model-index:
|
16 |
+
- name: MoritzLaurer/DeBERTa-v3-base-mnli-fever-anli
|
17 |
+
results:
|
18 |
+
- task:
|
19 |
+
type: natural-language-inference
|
20 |
+
name: Natural Language Inference
|
21 |
+
dataset:
|
22 |
+
name: anli
|
23 |
+
type: anli
|
24 |
+
config: plain_text
|
25 |
+
split: test_r3
|
26 |
+
metrics:
|
27 |
+
- name: Accuracy
|
28 |
+
type: accuracy
|
29 |
+
value: 0.495
|
30 |
+
verified: true
|
31 |
+
- name: Precision Macro
|
32 |
+
type: precision
|
33 |
+
value: 0.4984740618243923
|
34 |
+
verified: true
|
35 |
+
- name: Precision Micro
|
36 |
+
type: precision
|
37 |
+
value: 0.495
|
38 |
+
verified: true
|
39 |
+
- name: Precision Weighted
|
40 |
+
type: precision
|
41 |
+
value: 0.4984357572868885
|
42 |
+
verified: true
|
43 |
+
- name: Recall Macro
|
44 |
+
type: recall
|
45 |
+
value: 0.49461028192371476
|
46 |
+
verified: true
|
47 |
+
- name: Recall Micro
|
48 |
+
type: recall
|
49 |
+
value: 0.495
|
50 |
+
verified: true
|
51 |
+
- name: Recall Weighted
|
52 |
+
type: recall
|
53 |
+
value: 0.495
|
54 |
+
verified: true
|
55 |
+
- name: F1 Macro
|
56 |
+
type: f1
|
57 |
+
value: 0.4942810999491704
|
58 |
+
verified: true
|
59 |
+
- name: F1 Micro
|
60 |
+
type: f1
|
61 |
+
value: 0.495
|
62 |
+
verified: true
|
63 |
+
- name: F1 Weighted
|
64 |
+
type: f1
|
65 |
+
value: 0.4944671868893595
|
66 |
+
verified: true
|
67 |
+
- name: loss
|
68 |
+
type: loss
|
69 |
+
value: 1.8788293600082397
|
70 |
+
verified: true
|
71 |
---
|
72 |
# DeBERTa-v3-base-mnli-fever-anli
|
73 |
## Model description
|