--- language: et license: cc-by-sa-4.0 inference: false base_model: - EMBEDDIA/est-roberta pipeline_tag: token-classification tags: - NER --- # est-roberta-hist-ner ## Model description est-roberta-hist-ner is an [Est-RoBERTa](https://huggingface.co/EMBEDDIA/est-roberta) based model fine-tuned for named entity recognition in Estonian 19th century parish court records (for details, see [this repository](https://github.com/soras/vk_ner_lrec_2022)). The following types of entities are recognized: person names (PER), ambiguous locations-organizations (LOC_ORG), locations (LOC), organizations (ORG) and MISC (miscellaneous names). ## How to use Recommended usage of the model is with approriate pre- and postprocessing by EstNLTK. For an usage example, see this tutorial: [https://github.com/soras/vk\_ner\_lrec\_2022/blob/main/using\_bert\_ner\_tagger.ipynb](https://github.com/soras/vk_ner_lrec_2022/blob/main/using_bert_ner_tagger.ipynb) ## Citation If you use this model in your work, please cite us as follows: @InProceedings{orasmaa-EtAl:2022:LREC, author = {Orasmaa, Siim and Muischnek, Kadri and Poska, Kristjan and Edela, Anna}, title = {Named Entity Recognition in Estonian 19th Century Parish Court Records}, booktitle = {Proceedings of the Language Resources and Evaluation Conference}, month = {June}, year = {2022}, address = {Marseille, France}, publisher = {European Language Resources Association}, pages = {5304--5313}, url = {https://aclanthology.org/2022.lrec-1.568} }