BERTuit-base / README.md
jahuerta92's picture
Create README.md
3ba308b
|
raw
history blame
294 Bytes

Model BERTuit as presented in the BERTuit: Understanding Spanish language in Twitter through a native transformer article.

Before tokenization replace user tags and urls with "" and "" respectively.

Tokenize text with base class RoBERTaTokenizer.