jmzk96
/

PCSciBERT_cased

computer science

Inference Endpoints

Model card Files Files and versions Community

jmzk96 commited on Jul 3, 2023

Commit

eff432b

•

1 Parent(s): 8507163

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -1,7 +1,6 @@
 ---
 datasets:
 - adsabs/WIESP2022-NER
-- jxhzxn/contributions-ner-cs
 language:
 - en
 tags:
@@ -10,4 +9,7 @@ tags:
 ---
 PCSciBERT_cased was initiated with the cased variant of SciBERT (https://huggingface.co/allenai/scibert_scivocab_cased) and pre-trained on texts from 1,560,661 research articles of the physics and computer science domain in arXiv.
-The tokenizer for PCSciBERT_cased uses the same vocabulary from allenai/scibert_scivocab_cased.

 ---
 datasets:
 - adsabs/WIESP2022-NER
 language:
 - en
 tags:
 ---
 PCSciBERT_cased was initiated with the cased variant of SciBERT (https://huggingface.co/allenai/scibert_scivocab_cased) and pre-trained on texts from 1,560,661 research articles of the physics and computer science domain in arXiv.
+The tokenizer for PCSciBERT_cased uses the same vocabulary from allenai/scibert_scivocab_cased.
+The model was also evaluated on its downstream performance in named entity recognition using the adsabs/WIESP2022-NER and CS-NER (https://github.com/jd-coderepos/contributions-ner-cs/tree/main) dataset. Overall, PCSciBERT_cased achieved higher micro F1 scores for both WIESP and CS-NER datasets.\\
+It improves the performance of SciBERT(cased) on CS-NER test dataset by 0.69% and on WIESP test dataset by 1.49%.