robertuito-base-uncased
WORK IN PROGRESS
RoBERTa model trained on tweets.
For the time being, please use this function before feeding it to the model. We still need to create a proper tokenizer for this model
Masked LM
To test the masked LM, take into account that space is encoded inside SentencePiece's tokens. So, if you want to test
Este es un día<mask>
don't put a space between día
and <mask>