PCSciBERT_cased / README.md
jmzk96's picture
Update README.md
8507163
|
raw
history blame
448 Bytes
metadata
datasets:
  - adsabs/WIESP2022-NER
  - jxhzxn/contributions-ner-cs
language:
  - en
tags:
  - physics
  - computer science

PCSciBERT_cased was initiated with the cased variant of SciBERT (https://huggingface.co/allenai/scibert_scivocab_cased) and pre-trained on texts from 1,560,661 research articles of the physics and computer science domain in arXiv. The tokenizer for PCSciBERT_cased uses the same vocabulary from allenai/scibert_scivocab_cased.