Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
flax-community
/
t5-base-dutch
like
4
Follow
Flax Community
318
Text2Text Generation
Transformers
PyTorch
google-tensorflow
TensorFlow
JAX
TensorBoard
yhavinga/mc4_nl_cleaned
t5
seq2seq
lm-head
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
29fb4ca
t5-base-dutch
2 contributors
History:
44 commits
yhavinga
Add latest log and script
29fb4ca
over 3 years ago
clean
Saving weights and logs of step 900
over 3 years ago
runs
Add latest log and script
over 3 years ago
.gitattributes
Safe
737 Bytes
initial commit
over 3 years ago
.gitignore
Safe
38 Bytes
Saving weights and logs of step 300
over 3 years ago
Determine_batch_size.ipynb
Safe
6.45 kB
Saving weights and logs of step 16000
over 3 years ago
Load_preprocessed_dataset.ipynb
Safe
13.4 kB
Update weights and scripts
over 3 years ago
Load_token_group_dataset.ipynb
Safe
34.1 kB
Update scripts. Add dataset load check notebook
over 3 years ago
config.json
Safe
1.42 kB
Update scripts to work around collator valueerror. Update weights
over 3 years ago
create_config.py
Safe
148 Bytes
Add config, tokenizer and training script
over 3 years ago
flax_model.msgpack
Safe
892 MB
LFS
Update weights
over 3 years ago
flax_to_pt.py
Safe
292 Bytes
Update weights and scripts
over 3 years ago
opt_state.msgpack
Safe
1.99 MB
LFS
Update weights
over 3 years ago
pytorch_model.bin
Safe
892 MB
LFS
Update weights and scripts
over 3 years ago
run_t5.sh
Safe
2.07 kB
Add latest log and script
over 3 years ago
run_t5_mlm_flax.py
Safe
73 Bytes
Add config, tokenizer and training script
over 3 years ago
run_t5_mlm_flax_custom_dataset.py
Safe
44.7 kB
Update weights and scripts
over 3 years ago
streaming_dataset_filter_test.py
Safe
2.96 kB
Update weights and scripts
over 3 years ago
t5_tokenizer_model.py
Safe
76 Bytes
Add config, tokenizer and training script
over 3 years ago
tf_model.h5
Safe
892 MB
LFS
Update weights and scripts
over 3 years ago
tokenizer.json
Safe
1.03 MB
Retrain tokenizer for case sensitive
over 3 years ago
train_tokenizer.py
Safe
2 kB
Retrain tokenizer for case sensitive
over 3 years ago
training_state.json
Safe
15 Bytes
Update weights
over 3 years ago