metadata
datasets:
- adsabs/WIESP2022-NER
- jxhzxn/contributions-ner-cs
language:
- en
tags:
- physics
- computer science
PCSciBERT_cased was initiated with the cased variant of SciBERT (https://huggingface.co/allenai/scibert_scivocab_cased) and pre-trained on texts from 1,560,661 research articles of the physics and computer science domain in arXiv. The tokenizer for PCSciBERT_cased uses the same vocabulary from allenai/scibert_scivocab_cased.