Speech Emotion Recognition model created by fine-tuning the Wav2Vec2 model pre-trained on xlsr for English.
The dataset used to fine-tune this model is the RAVDESS dataset that can be found here.