Commit
•
84ee1a6
1
Parent(s):
e5350ef
Add evaluation results on the plain_text config and dev_r1 split of anli
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the plain_text config and dev_r1 split of the [anli](https://huggingface.co/datasets/anli) dataset by
@ctkang
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-anli-plain_text-f2dca1-2066067125).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=anli).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=anli).
README.md
CHANGED
@@ -174,6 +174,59 @@ model-index:
|
|
174 |
type: loss
|
175 |
value: 0.3283354640007019
|
176 |
verified: true
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
177 |
---
|
178 |
# DeBERTa-v3-base-mnli-fever-anli
|
179 |
## Model description
|
|
|
174 |
type: loss
|
175 |
value: 0.3283354640007019
|
176 |
verified: true
|
177 |
+
- task:
|
178 |
+
type: natural-language-inference
|
179 |
+
name: Natural Language Inference
|
180 |
+
dataset:
|
181 |
+
name: anli
|
182 |
+
type: anli
|
183 |
+
config: plain_text
|
184 |
+
split: dev_r1
|
185 |
+
metrics:
|
186 |
+
- name: Accuracy
|
187 |
+
type: accuracy
|
188 |
+
value: 0.737
|
189 |
+
verified: true
|
190 |
+
- name: Precision Macro
|
191 |
+
type: precision
|
192 |
+
value: 0.737681071614645
|
193 |
+
verified: true
|
194 |
+
- name: Precision Micro
|
195 |
+
type: precision
|
196 |
+
value: 0.737
|
197 |
+
verified: true
|
198 |
+
- name: Precision Weighted
|
199 |
+
type: precision
|
200 |
+
value: 0.7376755842752241
|
201 |
+
verified: true
|
202 |
+
- name: Recall Macro
|
203 |
+
type: recall
|
204 |
+
value: 0.7369675064285843
|
205 |
+
verified: true
|
206 |
+
- name: Recall Micro
|
207 |
+
type: recall
|
208 |
+
value: 0.737
|
209 |
+
verified: true
|
210 |
+
- name: Recall Weighted
|
211 |
+
type: recall
|
212 |
+
value: 0.737
|
213 |
+
verified: true
|
214 |
+
- name: F1 Macro
|
215 |
+
type: f1
|
216 |
+
value: 0.7366853496239583
|
217 |
+
verified: true
|
218 |
+
- name: F1 Micro
|
219 |
+
type: f1
|
220 |
+
value: 0.737
|
221 |
+
verified: true
|
222 |
+
- name: F1 Weighted
|
223 |
+
type: f1
|
224 |
+
value: 0.7366990292378379
|
225 |
+
verified: true
|
226 |
+
- name: loss
|
227 |
+
type: loss
|
228 |
+
value: 0.9349392056465149
|
229 |
+
verified: true
|
230 |
---
|
231 |
# DeBERTa-v3-base-mnli-fever-anli
|
232 |
## Model description
|