Pedro Cuenca commited on
Commit
9d57df5
1 Parent(s): a77390b

* Update README - final version of eval script.

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -121,8 +121,8 @@ def replace_additional(batch):
121
 
122
  import librosa
123
  def speech_file_to_array_fn(batch):
124
- speech_array, _ = torchaudio.load(batch["path"])
125
- batch["speech"] = librosa.resample(speech_array.squeeze().numpy(), 48_000, 16_000)
126
  return batch
127
 
128
  # One-pass mapping function
@@ -217,5 +217,7 @@ I had previously used the `transformers` library as an end user, just to try Ber
217
 
218
  * The WER metric crashed on large datasets. I evaluated on a small sample (also, it's faster) and wrote an accumulative version of wer that runs on fixed memory. I'd like to verify whether this change makes sense to be used inside the training loop.
219
 
 
 
220
  * When using `num_proc` inside a notebook, I could not see progress bars. This is surely some permissions issue in my computer. I still need to find it out.
221
 
 
121
 
122
  import librosa
123
  def speech_file_to_array_fn(batch):
124
+ speech_array, sample_rate = torchaudio.load(batch["path"])
125
+ batch["speech"] = librosa.resample(speech_array.squeeze().numpy(), sample_rate, 16_000)
126
  return batch
127
 
128
  # One-pass mapping function
 
217
 
218
  * The WER metric crashed on large datasets. I evaluated on a small sample (also, it's faster) and wrote an accumulative version of wer that runs on fixed memory. I'd like to verify whether this change makes sense to be used inside the training loop.
219
 
220
+ * `torchaudio` deadlocks when using multiple processes. `librosa` works fine. To be investigated.
221
+
222
  * When using `num_proc` inside a notebook, I could not see progress bars. This is surely some permissions issue in my computer. I still need to find it out.
223