ritika-kumar commited on
Commit
ec881af
1 Parent(s): 486e62c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -1
README.md CHANGED
@@ -1,4 +1,49 @@
1
  ---
 
 
2
  datasets:
3
  - ai4bharat/samanantar
4
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: apache-2.0
3
+ base_model: Helsinki/opus-mt-en-fr
4
  datasets:
5
  - ai4bharat/samanantar
6
+ language:
7
+ - en
8
+ - hi
9
+ metrics:
10
+ - bleu
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # Finetuning
17
+
18
+ This model is a fine-tuned version of [Helsinki/opus-mt-en-fr]([https://huggingface.co/Helsinki-NLP/opus-mt-en-mul](https://huggingface.co/Helsinki-NLP/opus-mt-en-fr)) on the samanantar dataset.
19
+ source group: English
20
+ target group: Hindi
21
+ model: transformer
22
+
23
+ ## Model description
24
+
25
+ Helsinki/opus-mt-en-fr finetuned for translation task in Hindi language
26
+
27
+ ## Training and evaluation data
28
+
29
+ ai4bharath/samanantar
30
+
31
+ ### Training hyperparameters
32
+
33
+ The following hyperparameters were used during training:
34
+ - learning_rate: 2e-5
35
+ - warmup_steps: 500
36
+ - gradient_accumulation_steps: 2
37
+ - total_train_batch_size: 16
38
+ - num_epochs: 3
39
+
40
+ ### Benchamark Evaluation
41
+ - BLEU score on Tatoeba: 21.93456688254154
42
+ - BLUE score on IN-22: 15.545328465618004
43
+
44
+ ### Framework versions
45
+
46
+ - Transformers 4.42.3
47
+ - Pytorch 2.1.2
48
+ - Datasets 2.20.0
49
+ - Tokenizers 0.19.1