pprokopidis
/

elNER18-bert-base-greek-uncased-v1-bs8-e150-lr5e-06

Token Classification

sequence-tagger-model

Model card Files Files and versions Community

pprokopidis commited on Oct 2

Commit

3f44aca

•

1 Parent(s): c3bb900

add README.md

Files changed (1) hide show

README.md +89 -3

README.md CHANGED Viewed

@@ -1,3 +1,89 @@
----
-license: cc-by-nc-2.0
----

+---
+language:
+- el
+license: cc-by-nc-2.0
+tags:
+- flair
+- token-classification
+- sequence-tagger-model
+base_model:
+- nlpaueb/bert-base-greek-uncased-v1
+---
+# Greek Named Entity Model finetuned on the elNER Dataset
+This Greek NER model was fine-tuned by researchers at the [Institute for Language and Speech Processing/Athena RC](https://www.ilsp.gr). The model was finetuned on the [elNER-18 dataset](https://dl.acm.org/doi/10.1145/3411408.3411437) using the  [nlpaueb/bert-base-greek-uncased-v1](https://huggingface.co/nlpaueb/bert-base-greek-uncased-v1) as backbone LM.
+## Dataset
+The [elNER-18 dataset](https://dl.acm.org/doi/10.1145/3411408.3411437) consists of 21K sentences, 623K tokens and 94K annotated named entities for 18 NE classes.
+The following 18 named entities are annotated:
+|Class|#|
+|:---|:---|
+|ORG|10944|
+|PERSON|8774|
+|CARDINAL|7343|
+|GPE|6781|
+|DATE|6338|
+|ORDINAL|1438|
+|PERCENT|1437|
+|LOC|1404|
+|NORP|1396|
+|MONEY|1012|
+|TIME|1011|
+|EVENT|962|
+|PRODUCT|668|
+|WORK_OF_ART|608|
+|FAC|567|
+|QUANTITY|565|
+|LAW|235|
+|LANGUAGE|55|
+## Fine-Tuning
+[Flair version 0.14](https://github.com/flairNLP/flair/releases/tag/v0.14.0) was used for fine-tuning.
+<!-- A hyper-parameter search is to be performed. Right now we have results with the following parameters. -->
+The model was trained with the following hyper-parameters:
+* Batch Size: [`8`]
+* Learning Rate: [`5e-05`]
+## Results
+- F-score (micro) 0.9169
+- F-score (macro) 0.8735
+- Accuracy 0.8634
+|Class|precision|recall|f1-score|support|
+|:---|:---|:---|:---|:---|
+|ORG|0.8928|0.8761|0.8844|1388|
+|PERSON|0.9578|0.9724|0.9651|1051|
+|CARDINAL|0.9395|0.9550|0.9472|911|
+|GPE|0.9292|0.9528|0.9408|826|
+|DATE|0.9436|0.9391|0.9414|838|
+|PERCENT|0.9903|0.9951|0.9927|206|
+|LOC|0.8011|0.7921|0.7966|178|
+|ORDINAL|0.9529|0.9419|0.9474|172|
+|NORP|0.8944|0.9007|0.8975|141|
+|TIME|0.9000|0.9197|0.9097|137|
+|EVENT|0.6912|0.7231|0.7068|130|
+|MONEY|0.9818|0.9730|0.9774|111|
+|PRODUCT|0.7191|0.7711|0.7442|83|
+|WORK_OF_ART|0.8272|0.7976|0.8121|84|
+|FAC|0.6757|0.6494|0.6623|77|
+|QUANTITY|0.8507|0.8769|0.8636|65|
+|LAW|0.8400|0.7500|0.7925|28|
+|LANGUAGE|1.0000|0.8889|0.9412|9|
+| ||||
+|micro avg|0.9150|0.9187|0.9169|6435|
+|macro avg|0.8771|0.8708|0.8735|6435|
+|weighted avg|0.9150|0.9187|0.9167|6435|
+## Files
+The Flair [training log](training.log) has also been uploaded to the model hub.