File size: 2,353 Bytes
d869900
 
8e13782
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d869900
8e13782
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
---
license: cc-by-nc-sa-3.0
language:
- bg
metrics:
- accuracy
library_name: spacy
pipeline_tag: token-classification
model-index:
- name: bg_news_lg
  results:
  - task:
      name: NER
      type: token-classification
    metrics:
    - name: NER Precision
      type: precision
      value: 0.9090909091
    - name: NER Recall
      type: recall
      value: 0.8773461371
    - name: NER F Score
      type: f_score
      value: 0.8929364727
  - task:
      name: TAG
      type: token-classification
    metrics:
    - name: TAG (XPOS) Accuracy
      type: accuracy
      value: 0.9565136361
  - task:
      name: POS
      type: token-classification
    metrics:
    - name: POS (UPOS) Accuracy
      type: accuracy
      value: 0.9858305885
  - task:
      name: MORPH
      type: token-classification
    metrics:
    - name: Morph (UFeats) Accuracy
      type: accuracy
      value: 0.9645896751
  - task:
      name: LEMMA
      type: token-classification
    metrics:
    - name: Lemma Accuracy
      type: accuracy
      value: 0.9271110013
  - task:
      name: UNLABELED_DEPENDENCIES
      type: token-classification
    metrics:
    - name: Unlabeled Attachment Score (UAS)
      type: f_score
      value: 0.9016132533
  - task:
      name: LABELED_DEPENDENCIES
      type: token-classification
    metrics:
    - name: Labeled Attachment Score (LAS)
      type: f_score
      value: 0.8459063799
  - task:
      name: SENTS
      type: token-classification
    metrics:
    - name: Sentences F-Score
      type: f_score
      value: 0.939802336
tags:
- ner
- named entity recognition
- spacy
---
| Feature | Description |
| --- | --- |
| **Name** | `bg_news_trf` |
| **Version** | `3.5.4` |
| **spaCy** | `>=3.5.4,<3.6.0` |
| **Default Pipeline** | `transformer`, `tagger`, `morphologizer`, `parser`, `trainable_lemmatizer`, `ner` |
| **Components** | `transformer`, `tagger`, `morphologizer`, `parser`, `trainable_lemmatizer`, `ner` |
| **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
| **Sources** | [UD_Bulgarian-BTB](https://github.com/UniversalDependencies/UD_Bulgarian-BTB) (Kiril Simov and Petya Osenova) |
| **License** | CC-BY-NC-SA-3.0 |
| **Author** | [Ivaylo Sakelariev](https://github.com/sakelariev) |

Bulgarian large sized pipeline for BGspaCy. Components: tok2vec, tagger, morphologizer, lemmatizer, parser, ner