🇮🇹 BLOOM-IT
Collection
Models and datasets for BLOOM adapted to Italian.
•
10 items
•
Updated
The model is obtained by performing language adaptation on the original bloom-1b7 model. In detail, we continued the pre-training on Italian-specific data without adaptation of the vocabulary. We use about 2.8M documents obtained from Italian Wikimedia dumps (swap-uniba/itwiki-march-2024). The model is trained for one epoch using LoRA and SFT.
2.8M documents obtained from Italian Wikimedia dumps (swap-uniba/itwiki-march-2024).
LoRA and SFT.
BibTeX:
APA:
Pierpaolo Basile, University of Bari Aldo Moro, Italy.
Pierpaolo Basile, University of Bari Aldo Moro, Italy.