Edit model card

Model Card for Model ID

This is a GPT-2 model trained in llm.c, for 32K steps (of 1M batch size) on FineWeb-EDU.

A lot more detailed information is here: https://github.com/karpathy/llm.c/discussions/677

Bias, Risks, and Limitations

Eagerly generates disinformation about English-speaking unicorns in the Andes mountains.

Downloads last month
77
Safetensors
Model size
1.56B params
Tensor type
BF16
·
Inference Examples
Inference API (serverless) is not available, repository is disabled.

Model tree for karpathy/gpt2_1558M_final2_hf

Finetunes
7 models
Quantizations
1 model

Space using karpathy/gpt2_1558M_final2_hf 1