pszemraj's picture
Update README.md
6308cc3 verified
metadata
license: apache-2.0
datasets:
  - allenai/c4
language:
  - en

nanoT5-base-65kBPE-v2

This is a "raw" pretrained model intended to be fine-tuned on downstream tasks

training code: https://github.com/pszemraj/nanoT5/tree/any-tokenizer

plots

more details are under checkpoints/

loss

image/png

gradients

image/png

weights

image/png