Intel
/

bert-base-uncased-sparse-1_2

Inference Endpoints

Model card Files Files and versions Community

ofirzaf commited on Jun 24, 2021

Commit

eee4f16

•

1 Parent(s): 8048390

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -13,6 +13,8 @@ The model can be used for fine-tuning to downstream tasks with sparsity already
 To keep the sparsity a mask should be added to each sparse weight blocking the optimizer from updating the zeros.
 ## Evaluation Results
 | Task | MNLI-m (Acc) | MNLI-mm (Acc) | QQP (Acc/F1) | QNLI (Acc) | SST-2 (Acc) | STS-B (Pears/Spear) | SQuADv1.1 (Acc/F1) |
 |------|--------------|---------------|--------------|------------|-------------|---------------------|--------------------|
 |      | 83.3         | 83.9          |   90.8/87.6  |    90.4    |     91.3    |      88.8/88.3      |      80.5/88.2     |

 To keep the sparsity a mask should be added to each sparse weight blocking the optimizer from updating the zeros.
 ## Evaluation Results
+We get the following results on the tasks development set, all results are mean of 5 different seeded models:
 | Task | MNLI-m (Acc) | MNLI-mm (Acc) | QQP (Acc/F1) | QNLI (Acc) | SST-2 (Acc) | STS-B (Pears/Spear) | SQuADv1.1 (Acc/F1) |
 |------|--------------|---------------|--------------|------------|-------------|---------------------|--------------------|
 |      | 83.3         | 83.9          |   90.8/87.6  |    90.4    |     91.3    |      88.8/88.3      |      80.5/88.2     |