This collection contains models and datasets used in the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?