adorkin's picture
Update metadata
f15e982 verified
metadata
language: et
license: cc-by-sa-4.0
inference: false
base_model:
  - EMBEDDIA/est-roberta
pipeline_tag: token-classification
tags:
  - NER

est-roberta-hist-ner

Model description

est-roberta-hist-ner is an Est-RoBERTa based model fine-tuned for named entity recognition in Estonian 19th century parish court records (for details, see this repository). The following types of entities are recognized: person names (PER), ambiguous locations-organizations (LOC_ORG), locations (LOC), organizations (ORG) and MISC (miscellaneous names).

How to use

Recommended usage of the model is with approriate pre- and postprocessing by EstNLTK. For an usage example, see this tutorial: https://github.com/soras/vk_ner_lrec_2022/blob/main/using_bert_ner_tagger.ipynb

Citation

If you use this model in your work, please cite us as follows:

@InProceedings{orasmaa-EtAl:2022:LREC,
  author    = {Orasmaa, Siim  and  Muischnek, Kadri  and  Poska, Kristjan  and  Edela, Anna},
  title     = {Named Entity Recognition in Estonian 19th Century Parish Court Records},
  booktitle      = {Proceedings of the Language Resources and Evaluation Conference},
  month          = {June},
  year           = {2022},
  address        = {Marseille, France},
  publisher      = {European Language Resources Association},
  pages     = {5304--5313},
  url       = {https://aclanthology.org/2022.lrec-1.568}
}