metadata

license: mit
tags:
  - generated_from_trainer
model-index:
  - name: indic-gpt
    results: []

indic-gpt

This model is a fine-tuned version of gpt2 on an Indian Language(https://ai4bharat.iitm.ac.in/corpora) dataset. Sample Dataset is present on https://huggingface.co/datasets/aashay96/indic-gpt. It achieves the following results on the evaluation set:

Model description

Model is trained on multiple Indian Languages - Assamese, bengali, gujarati, Kannada, Malayalam,telugu, tamil, odhiya and punjabi.

More information needed

TBD - Evaluation on indic_glue

Check the notebook!

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss
3.3653	0.3	500	2.2985
2.2079	0.61	1000	2.0401
2.0396	0.91	1500	1.9482