ehcalabres commited on
Commit
874fe61
1 Parent(s): ebe7999

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -2
README.md CHANGED
@@ -11,9 +11,13 @@ model_index:
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
  should probably proofread and complete it, then remove this comment. -->
13
 
14
- # wav2vec2-lg-xlsr-en-speech-emotion-recognition
15
 
16
- This model is a fine-tuned version of [jonatasgrosman/wav2vec2-large-xlsr-53-english](https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english) for a Speech Emotion Recognition (SER) task.
 
 
 
 
17
 
18
  The dataset used to fine-tune the original pre-trained model is the [RAVDESS dataset](https://zenodo.org/record/1188976#.YO6yI-gzaUk). This dataset provides 1440 samples of recordings from actors performing on 8 different emotions in English, which are:
19
 
@@ -72,6 +76,10 @@ The following hyperparameters were used during training:
72
  | 0.4581 | 2.72 | 390 | 0.4719 | 0.8467 |
73
  | 0.3967 | 2.93 | 420 | 0.5023 | 0.8223 |
74
 
 
 
 
 
75
 
76
  ### Framework versions
77
 
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
  should probably proofread and complete it, then remove this comment. -->
13
 
14
+ # Speech Emotion Recognition By Fine-Tuning Wav2Vec 2.0
15
 
16
+
17
+ __Important:__ This model it's not yet implementable due to missing built-in functions in HuggingFace for speech classification tasks. I'm working on the instructions of how to use it and a repository where the code will be available soon. Thank you anyway!
18
+
19
+
20
+ The model is a fine-tuned version of [jonatasgrosman/wav2vec2-large-xlsr-53-english](https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english) for a Speech Emotion Recognition (SER) task.
21
 
22
  The dataset used to fine-tune the original pre-trained model is the [RAVDESS dataset](https://zenodo.org/record/1188976#.YO6yI-gzaUk). This dataset provides 1440 samples of recordings from actors performing on 8 different emotions in English, which are:
23
 
 
76
  | 0.4581 | 2.72 | 390 | 0.4719 | 0.8467 |
77
  | 0.3967 | 2.93 | 420 | 0.5023 | 0.8223 |
78
 
79
+ ## Contact
80
+
81
+ Any doubt, contact me on [Twitter](https://twitter.com/ehcalabres) (GitHub repo soon).
82
+
83
 
84
  ### Framework versions
85