ai-forever
/

FRED-T5-1.7B

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sberbank-ai commited on Jan 23, 2023

Commit

c9bafea

•

1 Parent(s): 1e3415b

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ It trained on Russian language corpus (300GB).   Dataset is the same as for ruT5
 Bbpe tokenizer. 50257 + special tokens 107. Prefix tokens: '\<LM\>', '\<SC1>',.. '\<SC6>'
-First half of the time model trained on the small part of all datasets (1%,3GB) and without prefixes in each task.
 For RSG we trained as described in the T5 paper. First, we trained multitask for all tasks. Then we took the best checkpoint for the task and trained it further.
 RSG submit here https://russiansuperglue.com/login/submit_info/1936

 Bbpe tokenizer. 50257 + special tokens 107. Prefix tokens: '\<LM\>', '\<SC1>',.. '\<SC6>'
+First half of the time model trained on the small part of all datasets (1%,3GB) and without prefixes in each tasks.
 For RSG we trained as described in the T5 paper. First, we trained multitask for all tasks. Then we took the best checkpoint for the task and trained it further.
 RSG submit here https://russiansuperglue.com/login/submit_info/1936