Update README.md
#1
by
alession
- opened
README.md
CHANGED
@@ -1,3 +1,6 @@
|
|
1 |
---
|
2 |
license: eupl-1.1
|
3 |
---
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: eupl-1.1
|
3 |
---
|
4 |
+
SeTABERTa is a new multilingual language model pretained from scratch using various Open Access text repositories: EU legislation, research articles, EU public documents and US patents.
|
5 |
+
2/3 of training data is English. The other part of data covers EU24 languages.
|
6 |
+
The model was trained on JRC Big Data Platform. The model can be fine-tuned for other tasks.
|