JairamKanna commited on
Commit
27f48e1
·
1 Parent(s): 7f6ea64

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - ta
4
+ metrics:
5
+ - wer
6
+ library_name: transformers
7
+ pipeline_tag: automatic-speech-recognition
8
+ ---
9
+
10
+ # Model Card for Model ID
11
+
12
+ <!-- Provide a quick summary of what the model is/does. -->
13
+
14
+ This model is the fine-tuned version of Amrrs/wav2vec2-large-xlsr-53-tamil model for Tamil speech data.
15
+
16
+ ## Model Details
17
+
18
+
19
+ ## Uses
20
+
21
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
22
+ This model can be used to fine-tune the wav2vec2-large model for Tamil language.
23
+
24
+ #### Preprocessing [optional]
25
+
26
+ * Resampling to 16000 Hz
27
+ * Log-Mel Spectrogram extraction
28
+
29
+
30
+ #### Training Hyperparameters
31
+
32
+ training_args = TrainingArguments(
33
+ output_dir="armms-xlsr",
34
+ group_by_length=True,
35
+ per_device_train_batch_size=4,
36
+ gradient_accumulation_steps=3,
37
+ evaluation_strategy="steps",
38
+ num_train_epochs=30,
39
+ gradient_checkpointing=True,
40
+ fp16=True,
41
+ seed = 42,
42
+ save_steps=200,
43
+ eval_steps=200,
44
+ logging_steps=200,
45
+ learning_rate=3e-4,
46
+ warmup_steps=500,
47
+ save_total_limit=2,
48
+ push_to_hub=True,
49
+ )