Spaces:

flax-community
/

bertin

Runtime error

Pablo commited on Jul 19, 2021

Commit

ae788a5

•

1 Parent(s): e951a81

Further format improvements

Files changed (1) hide show

app.py CHANGED Viewed

@@ -52,12 +52,13 @@ st.sidebar.image(LOGO)
 # Body
 st.markdown(
     """
-    BERTIN is a series of BERT-based models for Spanish.
     The models are trained with Flax and using TPUs sponsored by Google since this is part of the
     [Flax/Jax Community Week](https://discuss.huggingface.co/t/open-to-the-community-community-week-using-jax-flax-for-nlp-cv/7104)
     organised by HuggingFace.
-    All models are variations of RoBERTa-base trained from scratch in Spanish using the mc4 dataset.
     We reduced the dataset size to 50 million documents to keep training times shorter, and also to be able to bias training examples based on their perplexity.
     The idea is to favour examples with perplexities that are neither too small (short, repetitive texts) or too long (potentially poor quality).

 # Body
 st.markdown(
     """
+    BERTIN is a series of BERT-based models for Spanish.
     The models are trained with Flax and using TPUs sponsored by Google since this is part of the
     [Flax/Jax Community Week](https://discuss.huggingface.co/t/open-to-the-community-community-week-using-jax-flax-for-nlp-cv/7104)
     organised by HuggingFace.
+    All models are variations of **RoBERTa-base** trained from scratch in **Spanish** using the **mc4 dataset**.
     We reduced the dataset size to 50 million documents to keep training times shorter, and also to be able to bias training examples based on their perplexity.
     The idea is to favour examples with perplexities that are neither too small (short, repetitive texts) or too long (potentially poor quality).