monika_nano_10m / README.md
922CA's picture
Update README.md
8581451 verified
---
license: other
datasets:
- 922-CA/MoCha_v1
---
Pretrained toy model, based off Monika (DDLC). Data in text version instead of jsonl.
Made with Andrej Karpathy's NanoGPT, ~2023.
All default parameters are used from Shakespeare example except for iters (1000 instead of 5000).