yhavinga commited on
Commit
ca47dc3
1 Parent(s): 0efe2a4

Autoupdate README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -4
README.md CHANGED
@@ -119,10 +119,7 @@ The `ul2-base-dutch-english` T5 model was pre-trained simultaneously on a combin
119
  including the `full_en_nl` config of the "mc4_nl_cleaned" dataset, which is a cleaned version of Common Crawl's web
120
  crawl corpus, Dutch books, the Dutch subset of Wikipedia (2022-03-20), the English subset of Wikipedia (2022-03-01),
121
  and a subset of "mc4_nl_cleaned"
122
- containing only texts from Dutch and Belgian newspapers. This last dataset is oversampled to bias the model
123
- towards descriptions of events in the Netherlands and Belgium.
124
-
125
-
126
 
127
  ## Training procedure
128
 
 
119
  including the `full_en_nl` config of the "mc4_nl_cleaned" dataset, which is a cleaned version of Common Crawl's web
120
  crawl corpus, Dutch books, the Dutch subset of Wikipedia (2022-03-20), the English subset of Wikipedia (2022-03-01),
121
  and a subset of "mc4_nl_cleaned"
122
+ containing only texts from Dutch newspapers.
 
 
 
123
 
124
  ## Training procedure
125