Text Generation
Transformers
PyTorch
code
gpt2
custom_code
Eval Results
text-generation-inference
Inference Endpoints

Dataset used to train SantaCoder

#43
by nihaljn - opened

Which dataset between The Stack (v1.1) and The Stack Dedup (v1.1) was used to train SantaCoder?

The SantaCoder repo links to the former but can this be confirmed?

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment