Pedro Cuenca
commited on
Commit
•
9d57df5
1
Parent(s):
a77390b
* Update README - final version of eval script.
Browse files
README.md
CHANGED
@@ -121,8 +121,8 @@ def replace_additional(batch):
|
|
121 |
|
122 |
import librosa
|
123 |
def speech_file_to_array_fn(batch):
|
124 |
-
speech_array,
|
125 |
-
batch["speech"] = librosa.resample(speech_array.squeeze().numpy(),
|
126 |
return batch
|
127 |
|
128 |
# One-pass mapping function
|
@@ -217,5 +217,7 @@ I had previously used the `transformers` library as an end user, just to try Ber
|
|
217 |
|
218 |
* The WER metric crashed on large datasets. I evaluated on a small sample (also, it's faster) and wrote an accumulative version of wer that runs on fixed memory. I'd like to verify whether this change makes sense to be used inside the training loop.
|
219 |
|
|
|
|
|
220 |
* When using `num_proc` inside a notebook, I could not see progress bars. This is surely some permissions issue in my computer. I still need to find it out.
|
221 |
|
|
|
121 |
|
122 |
import librosa
|
123 |
def speech_file_to_array_fn(batch):
|
124 |
+
speech_array, sample_rate = torchaudio.load(batch["path"])
|
125 |
+
batch["speech"] = librosa.resample(speech_array.squeeze().numpy(), sample_rate, 16_000)
|
126 |
return batch
|
127 |
|
128 |
# One-pass mapping function
|
|
|
217 |
|
218 |
* The WER metric crashed on large datasets. I evaluated on a small sample (also, it's faster) and wrote an accumulative version of wer that runs on fixed memory. I'd like to verify whether this change makes sense to be used inside the training loop.
|
219 |
|
220 |
+
* `torchaudio` deadlocks when using multiple processes. `librosa` works fine. To be investigated.
|
221 |
+
|
222 |
* When using `num_proc` inside a notebook, I could not see progress bars. This is surely some permissions issue in my computer. I still need to find it out.
|
223 |
|