Standard roberta-large model fine-tuned for one pass over the entire Pile dataset.

See Test-time training on nearest neighbors for large language models for details.

Downloads last month
112
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Dataset used to train socialfoundations/roberta-large-pile-lr2e-5-bs16-8gpu-1700000