|
--- |
|
language: |
|
- ta |
|
metrics: |
|
- wer |
|
pipeline_tag: automatic-speech-recognition |
|
library_name: adapter-transformers |
|
--- |
|
# Model Card for Model ID |
|
|
|
<!-- Provide a quick summary of what the model is/does. --> |
|
|
|
This model is a fine-tuned version of XLS-R on Tamil speech data from Tamil Vulnerable Speech Recognition. |
|
## Model Details |
|
|
|
### Model Description |
|
|
|
<!-- Provide a longer summary of what this model is. --> |
|
|
|
|
|
- **Model type:** [Transfoemr based model] |
|
- **Language(s) :** [Tamil] |
|
- **Finetuned from model [optional]:** [XLS-R] |
|
|
|
### Model Sources [optional] |
|
|
|
<!-- Provide the basic links for the model. --> |
|
|
|
- **Repository:** [More Information Needed] |
|
- **Paper [optional]:** [More Information Needed] |
|
- **Demo [optional]:** [More Information Needed] |
|
|
|
## Uses |
|
|
|
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. --> |
|
Thw model is used to perform speech-to-text in Tamil. |
|
### Direct Use |
|
|
|
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. --> |
|
|
|
[More Information Needed] |
|
|
|
### Downstream Use [optional] |
|
|
|
<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app --> |
|
|
|
Tammil Speech Recogniton |
|
|
|
|
|
## Training Details |
|
|
|
### Training Data |
|
|
|
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. --> |
|
Tamil vulnerable speech dataset. |
|
|
|
|
|
#### Preprocessing [optional] |
|
|
|
All the .wav files are resampled to 16000 Hz and Log-Mel Spectrogram is extracted |
|
|
|
|
|
|