metadata
language: et
license: cc-by-sa-4.0
inference: false
base_model:
- EMBEDDIA/est-roberta
pipeline_tag: token-classification
tags:
- NER
est-roberta-hist-ner
Model description
est-roberta-hist-ner is an Est-RoBERTa based model fine-tuned for named entity recognition in Estonian 19th century parish court records (for details, see this repository). The following types of entities are recognized: person names (PER), ambiguous locations-organizations (LOC_ORG), locations (LOC), organizations (ORG) and MISC (miscellaneous names).
How to use
Recommended usage of the model is with approriate pre- and postprocessing by EstNLTK. For an usage example, see this tutorial: https://github.com/soras/vk_ner_lrec_2022/blob/main/using_bert_ner_tagger.ipynb
Citation
If you use this model in your work, please cite us as follows:
@InProceedings{orasmaa-EtAl:2022:LREC,
author = {Orasmaa, Siim and Muischnek, Kadri and Poska, Kristjan and Edela, Anna},
title = {Named Entity Recognition in Estonian 19th Century Parish Court Records},
booktitle = {Proceedings of the Language Resources and Evaluation Conference},
month = {June},
year = {2022},
address = {Marseille, France},
publisher = {European Language Resources Association},
pages = {5304--5313},
url = {https://aclanthology.org/2022.lrec-1.568}
}