Update README.md
Browse files
README.md
CHANGED
@@ -1,7 +1,6 @@
|
|
1 |
---
|
2 |
datasets:
|
3 |
- adsabs/WIESP2022-NER
|
4 |
-
- jxhzxn/contributions-ner-cs
|
5 |
language:
|
6 |
- en
|
7 |
tags:
|
@@ -10,4 +9,7 @@ tags:
|
|
10 |
---
|
11 |
|
12 |
PCSciBERT_cased was initiated with the cased variant of SciBERT (https://huggingface.co/allenai/scibert_scivocab_cased) and pre-trained on texts from 1,560,661 research articles of the physics and computer science domain in arXiv.
|
13 |
-
The tokenizer for PCSciBERT_cased uses the same vocabulary from allenai/scibert_scivocab_cased.
|
|
|
|
|
|
|
|
1 |
---
|
2 |
datasets:
|
3 |
- adsabs/WIESP2022-NER
|
|
|
4 |
language:
|
5 |
- en
|
6 |
tags:
|
|
|
9 |
---
|
10 |
|
11 |
PCSciBERT_cased was initiated with the cased variant of SciBERT (https://huggingface.co/allenai/scibert_scivocab_cased) and pre-trained on texts from 1,560,661 research articles of the physics and computer science domain in arXiv.
|
12 |
+
The tokenizer for PCSciBERT_cased uses the same vocabulary from allenai/scibert_scivocab_cased.
|
13 |
+
|
14 |
+
The model was also evaluated on its downstream performance in named entity recognition using the adsabs/WIESP2022-NER and CS-NER (https://github.com/jd-coderepos/contributions-ner-cs/tree/main) dataset. Overall, PCSciBERT_cased achieved higher micro F1 scores for both WIESP and CS-NER datasets.\\
|
15 |
+
It improves the performance of SciBERT(cased) on CS-NER test dataset by 0.69% and on WIESP test dataset by 1.49%.
|