rmayormartins commited on
Commit
006c80e
·
verified ·
1 Parent(s): cda3f5a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +107 -3
README.md CHANGED
@@ -1,3 +1,107 @@
1
- ---
2
- license: ecl-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: ecl-2.0
3
+ datasets:
4
+ - mozilla-foundation/common_voice_11_0
5
+ language:
6
+ - en
7
+ - pt
8
+ metrics:
9
+ - accuracy
10
+ library_name: transformers
11
+ tags:
12
+ - code
13
+ ---
14
+
15
+ # Speech Portuguese (Brazilian) Accent Classifier
16
+
17
+ 🎙️🤖🇧🇷
18
+
19
+ This project is a speech accent classifier that distinguishes between Portuguese (Brazilian) and other accents.
20
+
21
+ ## Project Overview
22
+
23
+ This application uses a trained model to classify speech accents into two categories:
24
+ 1. Portuguese (Brazilian)
25
+ 2. Other
26
+
27
+ The model is based on the author's work [results] and utilizes the Portuguese portion of the Common Voice dataset (version 11.0) from Mozilla Foundation.
28
+
29
+ ## Dataset
30
+
31
+ The project uses the Portuguese subset of the Common Voice dataset:
32
+ - Dataset: "mozilla-foundation/common_voice_11_0", "pt"
33
+
34
+ Brazilian accents included in the dataset:
35
+ - Português do Brasil, Região Sul do Brasil
36
+ - Paulistano
37
+ - Paulista, Brasileiro
38
+ - Carioca
39
+ - Mato Grosso
40
+ - Mineiro
41
+ - Interior Paulista
42
+ - Gaúcho
43
+ - Nordestino
44
+ - And various regional mixes
45
+
46
+ ## Model and Processor
47
+
48
+ The project utilizes the following model and processor:
49
+ - Base Model: "facebook/wav2vec2-base-960h"
50
+ - Processor: Wav2Vec2Processor.from_pretrained
51
+
52
+ ## Model Versions
53
+
54
+ We have trained three versions of the model with different configurations:
55
+
56
+ 1. **Results 1**:
57
+ - Epochs: 3
58
+ - Training samples: 1000
59
+ - Validation samples: 200
60
+
61
+ 2. **Results 2**:
62
+ - Epochs: 10
63
+ - Training samples: 1000
64
+ - Validation samples: 500
65
+
66
+ 3. **Results 3**:
67
+ - Epochs: 20
68
+ - Training samples: 5000
69
+ - Validation samples: 1000
70
+
71
+ All models were trained using high RAM GPU on Google Colab Pro.
72
+
73
+ ## File Structure
74
+
75
+ Each version of the model includes the following files:
76
+ results/
77
+ ├── config.json
78
+ ├── preprocessor_config.json
79
+ ├── model.safetensors
80
+ ├── special_tokens_map.json
81
+ ├── tokenizer_config.json
82
+ └── vocab.json
83
+
84
+
85
+ ## How to Use
86
+
87
+ Test with recording or uploading an audio file. To test, I recommend short sentences.
88
+
89
+ ## License
90
+
91
+ This project is licensed under the Eclipse Public License 2.0 (ECL-2.0).
92
+
93
+ ## Developer Information
94
+
95
+ Developed by Ramon Mayor Martins (2024)
96
+ - Email: rmayormartins@gmail.com
97
+ - Homepage: https://rmayormartins.github.io/
98
+ - Twitter: @rmayormartins
99
+ - GitHub: https://github.com/rmayormartins
100
+
101
+ ## Acknowledgements
102
+
103
+ Special thanks to Instituto Federal de Santa Catarina (Federal Institute of Santa Catarina) IFSC-São José-Brazil.
104
+
105
+ ## Contact
106
+
107
+ For any queries or suggestions, please contact the developer using the information provided above.