File size: 2,514 Bytes
d869900
 
8e13782
 
 
 
 
 
 
4d3695e
8e13782
 
 
 
 
 
 
4d3695e
8e13782
 
4d3695e
8e13782
 
4d3695e
8e13782
 
 
 
 
 
4d3695e
8e13782
 
 
 
 
 
4d3695e
8e13782
 
 
 
 
 
4d3695e
8e13782
 
 
 
 
 
4d3695e
8e13782
 
 
 
 
 
4d3695e
8e13782
 
 
 
 
 
4d3695e
8e13782
 
 
 
 
 
4d3695e
8e13782
 
 
 
d869900
8e13782
 
 
 
 
 
 
 
4d3695e
8e13782
 
 
4d3695e
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
---
license: cc-by-nc-sa-3.0
language:
- bg
metrics:
- accuracy
library_name: spacy
pipeline_tag: token-classification
model-index:
- name: bg_news_trf
  results:
  - task:
      name: NER
      type: token-classification
    metrics:
    - name: NER Precision
      type: precision
      value: 0.8890829694
    - name: NER Recall
      type: recall
      value: 0.8886948931
    - name: NER F Score
      type: f_score
      value: 0.8888888889
  - task:
      name: TAG
      type: token-classification
    metrics:
    - name: TAG (XPOS) Accuracy
      type: accuracy
      value: 0.9702076246
  - task:
      name: POS
      type: token-classification
    metrics:
    - name: POS (UPOS) Accuracy
      type: accuracy
      value: 0.9897910505
  - task:
      name: MORPH
      type: token-classification
    metrics:
    - name: Morph (UFeats) Accuracy
      type: accuracy
      value: 0.9764380425
  - task:
      name: LEMMA
      type: token-classification
    metrics:
    - name: Lemma Accuracy
      type: accuracy
      value: 0.9404442198
  - task:
      name: UNLABELED_DEPENDENCIES
      type: token-classification
    metrics:
    - name: Unlabeled Attachment Score (UAS)
      type: f_score
      value: 0.9349327787
  - task:
      name: LABELED_DEPENDENCIES
      type: token-classification
    metrics:
    - name: Labeled Attachment Score (LAS)
      type: f_score
      value: 0.8934417103
  - task:
      name: SENTS
      type: token-classification
    metrics:
    - name: Sentences F-Score
      type: f_score
      value: 0.9241131567
tags:
- ner
- named entity recognition
- spacy
---
| Feature | Description |
| --- | --- |
| **Name** | `bg_news_trf` |
| **Version** | `3.5.4` |
| **spaCy** | `>=3.5.4,<3.6.0` |
| **Default Pipeline** | `transformer`, `tagger`, `morphologizer`, `parser`, `trainable_lemmatizer`, `ner` |
| **Components** | `transformer`, `tagger`, `morphologizer`, `parser`, `trainable_lemmatizer`, `ner` |
| **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
| **Sources** | [UD_Bulgarian-BTB](https://github.com/UniversalDependencies/UD_Bulgarian-BTB) (Kiril Simov and Petya Osenova) [BERT multilingual base model (uncased)](https://huggingface.co/bert-base-multilingual-uncased) (Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova)|
| **License** | CC-BY-NC-SA-3.0 |
| **Author** | [Ivaylo Sakelariev](https://github.com/sakelariev) |

Bulgarian transformers pipeline for BGspaCy. Components: tok2vec, tagger, morphologizer, lemmatizer, parser, ner