license: mit
datasets:
- davmel/ka_homonym_disambiguation
language:
- ka
This model is capable of determining the definition of the homonym "ბარი" located at the position marked by the [MASK] token.
It is a simple Transformer model trained on a hand classified dataset comprising 6000 hand-classified sentences.
I've masked the homonyms from the sentences and replaced them with their synonyms according to the definitions used. For example, I replaced ”ბარი” with ”დაბლობი” (lowland) where the homonym referred to the field.
The model predicts "თო" when it interprets the homonym as "Shovel," "დაბ" when it interprets it as "lowland," and "კაფე" when it interprets it as "Cafe."
My fine-tuned transformer model is based on a pre-trained transformer model which was downloaded from: https://huggingface.co/Davit6174/georgian-distilbert-mlm