Telugu Question-Answering model trained on Tydiqa dataset from Google

How to use

Use the below script from your python terminal as the web interface for inference has few encoding issues for Telugu

from transformers.pipelines import pipeline, AutoModelForQuestionAnswering, AutoTokenizer
model = AutoModelForQuestionAnswering.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained("kuppuluri/telugu_bertu_tydiqa",
                                          clean_text=False,
                                          handle_chinese_chars=False,
                                          strip_accents=False,
                                          wordpieces_prefix='##')
nlp = pipeline('question-answering', model=model, tokenizer=tokenizer)
result = nlp({'question': question, 'context': context})

Training data

I used Tydiqa Telugu data from Google https://github.com/google-research-datasets/tydiqa

PS: If you find my model useful, I would appreciate a note from you as it would encourage me to continue improving it and also add new models.

Downloads last month: 10

Safetensors

Model size

0.1B params

Tensor type

I64

F32

kuppuluri
/

telugu_bertu_tydiqa

Telugu Question-Answering model trained on Tydiqa dataset from Google

How to use

Training data

Space using kuppuluri/telugu_bertu_tydiqa 1