GaelleLaperriere
commited on
Commit
•
5af308a
1
Parent(s):
e1a60d9
Update README.md
Browse files
README.md
CHANGED
@@ -16,14 +16,14 @@ metrics:
|
|
16 |
- cer
|
17 |
- cher
|
18 |
model-index:
|
19 |
-
- name: slu-wav2vec2-ctc-MEDIA-
|
20 |
results:
|
21 |
- task:
|
22 |
name: Spoken Language Understanding
|
23 |
type: spoken-language-understanding
|
24 |
dataset:
|
25 |
name: MEDIA
|
26 |
-
type:
|
27 |
config: fr
|
28 |
split: test
|
29 |
args:
|
@@ -31,13 +31,13 @@ model-index:
|
|
31 |
metrics:
|
32 |
- name: Test ChER
|
33 |
type: cher
|
34 |
-
value: 7.
|
35 |
- name: Test CER
|
36 |
type: cer
|
37 |
-
value:
|
38 |
- name: Test CVER
|
39 |
type: cver
|
40 |
-
value:
|
41 |
---
|
42 |
|
43 |
<iframe src="https://ghbtns.com/github-btn.html?user=speechbrain&repo=speechbrain&type=star&count=true&size=large&v=2" frameborder="0" scrolling="0" width="170" height="30" title="GitHub"></iframe>
|
@@ -54,7 +54,7 @@ The performance of the model is the following:
|
|
54 |
|
55 |
| Release | Test ChER | Test CER | Test CVER | GPUs |
|
56 |
|:-------------:|:--------------:|:--------------:|:--------------:|:--------:|
|
57 |
-
| 22-02-23 | 7.
|
58 |
|
59 |
## Pipeline description
|
60 |
|
@@ -80,7 +80,7 @@ Please notice that we encourage you to read our tutorials and learn more about
|
|
80 |
```python
|
81 |
from speechbrain.pretrained import EncoderASR
|
82 |
|
83 |
-
asr_model = EncoderASR.from_hparams(source="speechbrain/slu-wav2vec2-ctc-MEDIA-
|
84 |
asr_model.transcribe_file('speechbrain/asr-wav2vec2-commonvoice-fr/example-fr.wav')
|
85 |
|
86 |
```
|
@@ -104,7 +104,7 @@ pip install -e .
|
|
104 |
- [Media ASR (ELRA-S0272)](https://catalogue.elra.info/en-us/repository/browse/ELRA-S0272/)
|
105 |
- [Media SLU (ELRA-E0024)](https://catalogue.elra.info/en-us/repository/browse/ELRA-E0024/)
|
106 |
- [channels.csv and concepts_full_relax.csv](https://drive.google.com/drive/u/1/folders/1z2zFZp3c0NYLFaUhhghhBakGcFdXVRyf)
|
107 |
-
4. Modify placeholders in hparams/
|
108 |
```bash
|
109 |
data_folder = !PLACEHOLDER
|
110 |
channels_path = !PLACEHOLDER
|
@@ -113,10 +113,10 @@ concepts_path = !PLACEHOLDER
|
|
113 |
5. Run Training:
|
114 |
```bash
|
115 |
cd recipes/MEDIA/SLU/CTC/
|
116 |
-
python train_hf_wav2vec.py hparams/
|
117 |
```
|
118 |
|
119 |
-
You can find our training results (models, logs, etc) [here](https://drive.google.com/drive/folders/
|
120 |
|
121 |
### Limitations
|
122 |
The SpeechBrain team does not provide any warranty on the performance achieved by this model when used on other datasets.
|
|
|
16 |
- cer
|
17 |
- cher
|
18 |
model-index:
|
19 |
+
- name: slu-wav2vec2-ctc-MEDIA-full
|
20 |
results:
|
21 |
- task:
|
22 |
name: Spoken Language Understanding
|
23 |
type: spoken-language-understanding
|
24 |
dataset:
|
25 |
name: MEDIA
|
26 |
+
type: MEDIA_slu_full
|
27 |
config: fr
|
28 |
split: test
|
29 |
args:
|
|
|
31 |
metrics:
|
32 |
- name: Test ChER
|
33 |
type: cher
|
34 |
+
value: 7.78
|
35 |
- name: Test CER
|
36 |
type: cer
|
37 |
+
value: 24.88
|
38 |
- name: Test CVER
|
39 |
type: cver
|
40 |
+
value: 35.77
|
41 |
---
|
42 |
|
43 |
<iframe src="https://ghbtns.com/github-btn.html?user=speechbrain&repo=speechbrain&type=star&count=true&size=large&v=2" frameborder="0" scrolling="0" width="170" height="30" title="GitHub"></iframe>
|
|
|
54 |
|
55 |
| Release | Test ChER | Test CER | Test CVER | GPUs |
|
56 |
|:-------------:|:--------------:|:--------------:|:--------------:|:--------:|
|
57 |
+
| 22-02-23 | 7.78 | 24.88 | 35.77 | 1xV100 32GB |
|
58 |
|
59 |
## Pipeline description
|
60 |
|
|
|
80 |
```python
|
81 |
from speechbrain.pretrained import EncoderASR
|
82 |
|
83 |
+
asr_model = EncoderASR.from_hparams(source="speechbrain/slu-wav2vec2-ctc-MEDIA-full", savedir="pretrained_models/slu-wav2vec2-ctc-MEDIA-full")
|
84 |
asr_model.transcribe_file('speechbrain/asr-wav2vec2-commonvoice-fr/example-fr.wav')
|
85 |
|
86 |
```
|
|
|
104 |
- [Media ASR (ELRA-S0272)](https://catalogue.elra.info/en-us/repository/browse/ELRA-S0272/)
|
105 |
- [Media SLU (ELRA-E0024)](https://catalogue.elra.info/en-us/repository/browse/ELRA-E0024/)
|
106 |
- [channels.csv and concepts_full_relax.csv](https://drive.google.com/drive/u/1/folders/1z2zFZp3c0NYLFaUhhghhBakGcFdXVRyf)
|
107 |
+
4. Modify placeholders in hparams/train_hf_wav2vec_full.yaml:
|
108 |
```bash
|
109 |
data_folder = !PLACEHOLDER
|
110 |
channels_path = !PLACEHOLDER
|
|
|
113 |
5. Run Training:
|
114 |
```bash
|
115 |
cd recipes/MEDIA/SLU/CTC/
|
116 |
+
python train_hf_wav2vec.py hparams/train_hf_wav2vec_full.yaml
|
117 |
```
|
118 |
|
119 |
+
You can find our training results (models, logs, etc) [here](https://drive.google.com/drive/folders/1cZRRQ3Cc_ucXXlwYSJqB7pOl6KwZdo84?usp=share_link).
|
120 |
|
121 |
### Limitations
|
122 |
The SpeechBrain team does not provide any warranty on the performance achieved by this model when used on other datasets.
|