monika_nano_10m / README.md
922CA's picture
Update README.md
8581451 verified
metadata
license: other
datasets:
  - 922-CA/MoCha_v1

Pretrained toy model, based off Monika (DDLC). Data in text version instead of jsonl. Made with Andrej Karpathy's NanoGPT, ~2023. All default parameters are used from Shakespeare example except for iters (1000 instead of 5000).