|
--- |
|
language: et |
|
license: cc-by-sa-4.0 |
|
inference: false |
|
base_model: |
|
- EMBEDDIA/est-roberta |
|
pipeline_tag: token-classification |
|
tags: |
|
- NER |
|
--- |
|
|
|
# est-roberta-hist-ner |
|
|
|
## Model description |
|
|
|
est-roberta-hist-ner is an [Est-RoBERTa](https://huggingface.co/EMBEDDIA/est-roberta) based model fine-tuned for named entity recognition in Estonian 19th century parish court records (for details, see [this repository](https://github.com/soras/vk_ner_lrec_2022)). |
|
The following types of entities are recognized: person names (PER), ambiguous locations-organizations (LOC_ORG), locations (LOC), organizations (ORG) and MISC (miscellaneous names). |
|
|
|
## How to use |
|
|
|
Recommended usage of the model is with approriate pre- and postprocessing by EstNLTK. |
|
For an usage example, see this tutorial: [https://github.com/soras/vk\_ner\_lrec\_2022/blob/main/using\_bert\_ner\_tagger.ipynb](https://github.com/soras/vk_ner_lrec_2022/blob/main/using_bert_ner_tagger.ipynb) |
|
|
|
## Citation |
|
|
|
If you use this model in your work, please cite us as follows: |
|
|
|
@InProceedings{orasmaa-EtAl:2022:LREC, |
|
author = {Orasmaa, Siim and Muischnek, Kadri and Poska, Kristjan and Edela, Anna}, |
|
title = {Named Entity Recognition in Estonian 19th Century Parish Court Records}, |
|
booktitle = {Proceedings of the Language Resources and Evaluation Conference}, |
|
month = {June}, |
|
year = {2022}, |
|
address = {Marseille, France}, |
|
publisher = {European Language Resources Association}, |
|
pages = {5304--5313}, |
|
url = {https://aclanthology.org/2022.lrec-1.568} |
|
} |