Update README.md
Browse files
README.md
CHANGED
@@ -17,7 +17,7 @@ See also the fine-tuned [t5-base-dutch-demo](https://huggingface.co/flax-communi
|
|
17 |
|
18 |
## Dataset
|
19 |
|
20 |
-
This model was trained on a cleaned version of
|
21 |
See the `clean` directory for the clean script.
|
22 |
|
23 |
* Documents that contained words from a selection of the Dutch and English [List of Dirty Naught Obscene and Otherwise Bad Words](https://github.com/LDNOOBW/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words) are removed
|
@@ -29,4 +29,4 @@ See the `clean` directory for the clean script.
|
|
29 |
|
30 |
## Training
|
31 |
|
32 |
-
The model was trained for 63000 steps with a batch size of 128, ending
|
|
|
17 |
|
18 |
## Dataset
|
19 |
|
20 |
+
This model was trained on a cleaned version of the Dutch part of [mC4](https://huggingface.co/datasets/mc4).
|
21 |
See the `clean` directory for the clean script.
|
22 |
|
23 |
* Documents that contained words from a selection of the Dutch and English [List of Dirty Naught Obscene and Otherwise Bad Words](https://github.com/LDNOOBW/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words) are removed
|
|
|
29 |
|
30 |
## Training
|
31 |
|
32 |
+
The model was trained for 63000 steps with a batch size of 128, ending with an evaluation loss of 1.79 and accuracy of 0.64.
|