alession commited on
Commit
955c988
1 Parent(s): 9f802ee

Update README.md

Browse files

Added model description

Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -1,3 +1,6 @@
1
  ---
2
  license: eupl-1.1
3
  ---
 
 
 
 
1
  ---
2
  license: eupl-1.1
3
  ---
4
+ SeTABERTa is a new multilingual language model pretained from scratch using various Open Access text repositories: EU legislation, research articles, EU public documents and US patents.
5
+ 2/3 of training data is English. The other part of data covers EU24 languages.
6
+ The model was trained on JRC Big Data Platform. The model can be fine-tuned for other tasks.