|
--- |
|
license: cc-by-nc-sa-3.0 |
|
language: |
|
- bg |
|
metrics: |
|
- accuracy |
|
library_name: spacy |
|
pipeline_tag: token-classification |
|
model-index: |
|
- name: bg_news_lg |
|
results: |
|
- task: |
|
name: NER |
|
type: token-classification |
|
metrics: |
|
- name: NER Precision |
|
type: precision |
|
value: 0.9090909091 |
|
- name: NER Recall |
|
type: recall |
|
value: 0.8773461371 |
|
- name: NER F Score |
|
type: f_score |
|
value: 0.8929364727 |
|
- task: |
|
name: TAG |
|
type: token-classification |
|
metrics: |
|
- name: TAG (XPOS) Accuracy |
|
type: accuracy |
|
value: 0.9565136361 |
|
- task: |
|
name: POS |
|
type: token-classification |
|
metrics: |
|
- name: POS (UPOS) Accuracy |
|
type: accuracy |
|
value: 0.9858305885 |
|
- task: |
|
name: MORPH |
|
type: token-classification |
|
metrics: |
|
- name: Morph (UFeats) Accuracy |
|
type: accuracy |
|
value: 0.9645896751 |
|
- task: |
|
name: LEMMA |
|
type: token-classification |
|
metrics: |
|
- name: Lemma Accuracy |
|
type: accuracy |
|
value: 0.9271110013 |
|
- task: |
|
name: UNLABELED_DEPENDENCIES |
|
type: token-classification |
|
metrics: |
|
- name: Unlabeled Attachment Score (UAS) |
|
type: f_score |
|
value: 0.9016132533 |
|
- task: |
|
name: LABELED_DEPENDENCIES |
|
type: token-classification |
|
metrics: |
|
- name: Labeled Attachment Score (LAS) |
|
type: f_score |
|
value: 0.8459063799 |
|
- task: |
|
name: SENTS |
|
type: token-classification |
|
metrics: |
|
- name: Sentences F-Score |
|
type: f_score |
|
value: 0.939802336 |
|
tags: |
|
- ner |
|
- named entity recognition |
|
- spacy |
|
--- |
|
| Feature | Description | |
|
| --- | --- | |
|
| **Name** | `bg_news_trf` | |
|
| **Version** | `3.5.4` | |
|
| **spaCy** | `>=3.5.4,<3.6.0` | |
|
| **Default Pipeline** | `transformer`, `tagger`, `morphologizer`, `parser`, `trainable_lemmatizer`, `ner` | |
|
| **Components** | `transformer`, `tagger`, `morphologizer`, `parser`, `trainable_lemmatizer`, `ner` | |
|
| **Vectors** | 0 keys, 0 unique vectors (0 dimensions) | |
|
| **Sources** | [UD_Bulgarian-BTB](https://github.com/UniversalDependencies/UD_Bulgarian-BTB) (Kiril Simov and Petya Osenova) | |
|
| **License** | CC-BY-NC-SA-3.0 | |
|
| **Author** | [Ivaylo Sakelariev](https://github.com/sakelariev) | |
|
|
|
Bulgarian large sized pipeline for BGspaCy. Components: tok2vec, tagger, morphologizer, lemmatizer, parser, ner |