Zangs3011's picture
Update README.md
a1bf2f8 verified
---
library_name: peft
tags:
- translation
- code
- instruct
- gemma
datasets:
- cfilt/iitb-english-hindi
base_model: google/gemma-2-2b-it
license: apache-2.0
---
### Finetuning Overview:
**Model Used:** google/gemma-2-2b-it
**Dataset:** cfilt/iitb-english-hindi
#### Dataset Insights:
The IIT Bombay English-Hindi corpus contains a parallel corpus for English-Hindi as well as a monolingual Hindi corpus collected from various sources. This corpus has been utilized in the Workshop on Asian Language Translation Shared Task since 2016 for Hindi-to-English and English-to-Hindi language pairs and as a pivot language pair for Hindi-to-Japanese and Japanese-to-Hindi translations.
#### Finetuning Details:
With the utilization of [MonsterAPI](https://monsterapi.ai)'s [LLM finetuner](https://docs.monsterapi.ai/fine-tune-a-large-language-model-llm), this finetuning:
- Was achieved with cost-effectiveness.
- Completed in a total duration of 1 hour and 33 minutes for 0.1 epochs.
- Costed `$1.91` for the entire process.
#### Hyperparameters & Additional Details:
- **Epochs:** 0.1
- **Total Finetuning Cost:** $1.91
- **Model Path:** google/gemma-2-2b-it
- **Learning Rate:** 0.001
- **Data Split:** 100% Train
- **Gradient Accumulation Steps:** 16
##### Prompt Template
```
<bos><start_of_turn>user
{PROMPT}<end_of_turn>
<start_of_turn>model
{OUTPUT} <end_of_turn>
<eos>
```
Training loss:
![training loss](train-loss.png "Training loss")
---
license: apache-2.0