data-silence
commited on
Commit
•
c0bbe77
1
Parent(s):
580c8bd
Update README.md
Browse files
README.md
CHANGED
@@ -29,7 +29,8 @@ should probably proofread and complete it, then remove this comment. -->
|
|
29 |
# Model description
|
30 |
|
31 |
This model is a fine-tuned version of [sentence-transformers/LaBSE](https://huggingface.co/sentence-transformers/LaBSE) on my [news dataset](https://huggingface.co/datasets/data-silence/rus_news_classifier).
|
32 |
-
The goal of this model was to create a universal model for categorizing Russian-language news that would preserve the ability of the basic LaBSE model to generate multi-lingual text embeddings in a single vector space.
|
|
|
33 |
The learning news dataset is a well-balanced sample of recent news from the last five years.
|
34 |
|
35 |
It achieves the following results on the evaluation set:
|
|
|
29 |
# Model description
|
30 |
|
31 |
This model is a fine-tuned version of [sentence-transformers/LaBSE](https://huggingface.co/sentence-transformers/LaBSE) on my [news dataset](https://huggingface.co/datasets/data-silence/rus_news_classifier).
|
32 |
+
The goal of this model was to create a universal model for categorizing Russian-language news that would preserve the ability of the basic LaBSE model to generate multi-lingual text embeddings in a single vector space.
|
33 |
+
It should be noted that the model allows to classify news articles in other languages available in LaBSE, but the quality of such classification will be worse than Russian-language news texts.
|
34 |
The learning news dataset is a well-balanced sample of recent news from the last five years.
|
35 |
|
36 |
It achieves the following results on the evaluation set:
|