This model was trained on 24GB of RTX A500 on zicsx/mC4-Hindi-Cleaned-3.0 dataset (1%) for 3 hours.
We used Hugging Face PEFT-LoRA PyTorch for training.
Transtokenization process in --
Base model