\n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"The following columns in the training set don't have a corresponding argument in `WhisperForConditionalGeneration.forward` and have been ignored: input_length. If input_length are not expected by `WhisperForConditionalGeneration.forward`, you can safely ignore this message.\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-af3fa5ae-6544-4948-a2e4-93c28572bb0a.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i89df3375-5ef7-4e49-bf31-c22a4b6d021a.mp3 failed to download. Data may be missing.\n"
]
},
{
"data": {
"text/html": [
"\n",
" \n",
" \n",
"
\n",
" [ 115/5000 18:30 < 13:20:13, 0.10 it/s, Epoch 56.00/9223372036854775807]\n",
"
\n",
" \n",
" \n",
" \n",
" Step | \n",
" Training Loss | \n",
" Validation Loss | \n",
"
\n",
" \n",
" \n",
" \n",
"
"
],
"text/plain": [
""
]
},
"metadata": {},
"output_type": "display_data"
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-cb84d9a0-5212-43ff-bac2-0ede72dd35c6.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-f63d0ca4-c810-45a4-96ec-c0e0f4084706.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i24ebfbbe-1f29-43f4-87f4-9944e65d74ed.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i0e7493c6-5270-41b9-895b-224e0913d596.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i2ab4f58e-c5a8-4f02-ae58-0a72867170f5.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i388adb36-b6e7-4629-b6a6-6cd554df5bbb.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-e9b803ab-b052-4fcc-829b-c0ee3df3ec7c.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i5b6fff43-466d-4427-ada8-26ae4c654a0a.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-a0116d50-25ae-43e7-a05a-11ca7ebe590e.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i47391d09-db79-4c56-9d6d-88577d5beb3b.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-cdb41602-9bd7-4479-8116-898314069dbf.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-f68637d5-7d47-440c-8677-783cdd82807a.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-b86db51f-943b-4e95-a9ca-2600c5f33571.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-c984c83d-ec4c-4999-acc4-d62d78b2c1a0.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i7628fa79-e6dc-4733-9a51-42f7c41f5a4d.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i66c18c70-3642-484f-9b5e-dfc4b15f6b4b.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-f06d4cfc-7b65-460f-9743-2181d5f5401e.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i5d2e4b46-a1c7-4e04-b69d-b069f3aa18b7.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-a21cf243-f64c-47b2-bc25-f4e831f88cf1.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i88dc2742-243e-44c5-bc35-27cbdd3e28eb.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i960f99d6-1433-4639-9580-faeb02c618fd.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-a7630fad-0635-4313-86f1-943da03f2e68.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i1ca42afd-ce11-463d-8abf-304cb26d46cb.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-ec26fe3a-1a02-423d-8d33-bb3f70fa3b7b.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i118e8f74-2b29-471f-b718-fe8d2b296dea.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i0740bf17-14b0-44a8-95e5-633632ee1221.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i3d30e962-c261-41db-9768-ee953e86138d.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-cdb41602-9bd7-4479-8116-898314069dbf.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-a4cf9098-e6c6-4515-b423-1235170f4007.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i82384eae-0c6b-4553-8187-00541bd4ab68.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i86475264-f3b6-46ff-8da8-8320845f3132.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-de5c4745-2d3f-4871-ac53-3ad0d539126b.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i9d08aac1-f029-489f-8aaf-cfc65168666c.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i627d0769-11e1-41aa-8692-2a99313cfdbc.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-f1b0b533-f481-4d46-b802-764a7f3ec812.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-c9d28e99-1723-41a0-b412-a55bcfff7969.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-d2c6e696-84d7-4e66-840e-75fd4515cc71.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-ca4cd50a-de0d-4c7e-8521-ea535017aac4.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-cf904aaf-29de-4082-af34-bcba42d6f214.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-fafc940f-86b0-4d7f-bd59-24ea3ce34955.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i20ecad13-a438-48da-8514-2c7acc8c3185.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i9ddbea05-0ce7-4d40-84f0-2fc1eb751a76.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i2f0bd373-ed2a-4518-a307-b0d81d8faa07.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i84964b1f-3457-4e90-93b2-fad25d3617d0.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i24bb6122-b17c-473d-9046-649870e8100c.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i67c1ac4c-f4c2-4945-8106-dec4f219f1ff.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i118e8f74-2b29-471f-b718-fe8d2b296dea.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i9f6b65f7-f81b-4a92-ab87-28e048eba279.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i71bc773b-1d7c-47bc-b767-7b4ee4dbb6ba.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i3f0504d9-deb1-4e54-b1bd-c077cbe02a7b.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i7caca876-f015-4740-bbeb-8956da76fe38.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i61910a52-aa3a-4afe-bac5-c9f47647e6f3.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i2158ee58-6974-4172-9db9-040ee28b791f.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i84964b1f-3457-4e90-93b2-fad25d3617d0.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-b5a6ce88-575a-47ff-93e8-6937d976d820.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-de62d0c9-f250-4e87-b94b-c5af54a61dab.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i0740bf17-14b0-44a8-95e5-633632ee1221.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i990db079-9380-4a98-82d0-d475e508eee7.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i81592bb8-89c4-4831-80ab-fc20a1a6b0c9.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i61ef7c65-78d2-434e-b381-c9a07cec8f22.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-b5a6ce88-575a-47ff-93e8-6937d976d820.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i996ba5f7-da3c-454f-960e-3e2e25d0947f.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i6b97833d-fd7d-4bde-a21c-cf842a316a5d.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i7caca876-f015-4740-bbeb-8956da76fe38.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i788396e8-f7fc-4bdb-8521-6f0574b7cb9c.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i72cae8f4-cf18-4a31-8ae9-5c4dcd7ad553.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i7f363e42-caee-420f-8280-7f0a448a8297.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i15b8d06b-0b1f-4b07-9a74-a2fea3eafe6e.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-e8abc0ee-6e67-4954-8183-f47009a17065.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i304e022a-1a50-4e28-bc1f-4db5ef1491fc.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-b86db51f-943b-4e95-a9ca-2600c5f33571.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i98c3fa6a-8838-4840-8b82-b199b93007ab.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i9860426b-fdf7-4bc6-8e62-2944dac3a87b.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i3789caa7-1f03-4abb-a8ef-884d2025a441.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i44706917-2f4c-46d7-95b5-82ab99549552.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-cae2cad0-4f37-4711-ad2b-ee1465f9e97b.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i2f9c5b7b-6533-47c6-802b-4fb9a845b9fd.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i996ba5f7-da3c-454f-960e-3e2e25d0947f.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i3968e429-8460-4d96-8d71-0359ab10f12e.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i672bd3aa-9303-433d-bfc9-416c9e859838.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i89a0bf33-e8bf-4e79-9f70-dc672d8b5746.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i497e2f3b-51b4-4b37-a3fc-24053d755707.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i5d6d1a01-c522-434a-8591-9bb3479f6183.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-cb5669ac-d537-4f75-b27b-30b8c38031ea.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i5d2e4b46-a1c7-4e04-b69d-b069f3aa18b7.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-a01a8329-55cc-43f2-9f31-b1d88a3a17ed.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-a0116d50-25ae-43e7-a05a-11ca7ebe590e.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i29e327ae-fd2f-474e-8d7b-b24e6eaac7cd.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i9d270ab7-9416-40b9-8b11-991fd169b94e.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i98c3fa6a-8838-4840-8b82-b199b93007ab.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-fc9aab86-fe11-46c6-9148-1d91357bd238.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i00c970d6-388d-4195-af16-a019a2177686.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-a7630fad-0635-4313-86f1-943da03f2e68.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-fcc09d40-56ba-4ada-9dcb-6e1f8c9cb3ff.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i79d705fd-796d-47b5-ba78-4b82c7bd57a9.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i0cf4886a-c491-444f-8a79-4a0581ea5f3c.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i138571f4-bc8f-4378-acb3-708bbdabee22.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i2ab4f58e-c5a8-4f02-ae58-0a72867170f5.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-e1a7fd07-eee3-42ff-91a1-91a3d2fb6294.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-f4f66771-37e3-44fa-93c6-749604aeb4d1.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i388adb36-b6e7-4629-b6a6-6cd554df5bbb.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i0cf4886a-c491-444f-8a79-4a0581ea5f3c.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-a56b1a3f-f410-4813-ad8e-484aad226182.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i90f7c3c2-84b8-4c5b-a323-c01f4d854260.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i98c3fa6a-8838-4840-8b82-b199b93007ab.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-e2583a7e-a10e-48ea-a575-02c1b7476e07.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i3b21c151-87fb-4e97-8ccf-517986ddb1c4.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-cd03999e-af74-4ddc-8f88-be75cfafff28.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i162bdf3c-4447-4fbc-8660-cf0ba184102b.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i19d9ef42-b73c-4313-ab7d-24d885fc10ae.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i19d9ef42-b73c-4313-ab7d-24d885fc10ae.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i98c3fa6a-8838-4840-8b82-b199b93007ab.mp3 failed to download. Data may be missing.\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/features/audio.py:315: UserWarning: \n",
"To support 'mp3' decoding with `torchaudio>=0.12.0`, please install `ffmpeg4` system package. On Google Colab you can run:\n",
"\n",
"\t!add-apt-repository -y ppa:jonathonf/ffmpeg-4 && apt update && apt install -y ffmpeg\n",
"\n",
"and restart your runtime. Alternatively, you can downgrade `torchaudio`:\n",
"\n",
"\tpip install \"torchaudio<0.12\"`.\n",
"\n",
"Otherwise 'mp3' files will be decoded with `librosa`.\n",
" warnings.warn(\n",
"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/features/audio.py:336: UserWarning: Decoding mp3 with `librosa` instead of `torchaudio`, decoding might be slow.\n",
" warnings.warn(\"Decoding mp3 with `librosa` instead of `torchaudio`, decoding might be slow.\")\n",
"/home/ubuntu/hf_env/lib/python3.8/site-packages/librosa/util/decorators.py:88: UserWarning: PySoundFile failed. Trying audioread instead.\n",
" return f(*args, **kwargs)\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n",
"Exception ignored in: \n",
"Traceback (most recent call last):\n",
" File \"/home/ubuntu/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py\", line 137, in __iter__\n",
" yield from self.generate_examples_fn(**kwargs_with_shuffled_shards)\n",
"RuntimeError: generator ignored GeneratorExit\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i67d6d1a6-7974-46b3-9f28-b0df66f9dfab.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i470a80d8-f952-4abd-9ca3-78979ed837a6.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i98c3fa6a-8838-4840-8b82-b199b93007ab.mp3 failed to download. Data may be missing.\n",
"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i9860426b-fdf7-4bc6-8e62-2944dac3a87b.mp3 failed to download. Data may be missing.\n"
]
},
{
"ename": "FileNotFoundError",
"evalue": "[Errno 2] No such file or directory: 'https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i98c3fa6a-8838-4840-8b82-b199b93007ab.mp3'",
"output_type": "error",
"traceback": [
"\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
"\u001b[0;31mRuntimeError\u001b[0m Traceback (most recent call last)",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/features/audio.py:311\u001b[0m, in \u001b[0;36mAudio._decode_mp3\u001b[0;34m(self, path_or_file)\u001b[0m\n\u001b[1;32m 310\u001b[0m \u001b[38;5;28;01mtry\u001b[39;00m: \u001b[38;5;66;03m# try torchaudio anyway because sometimes it works (depending on the os and os packages installed)\u001b[39;00m\n\u001b[0;32m--> 311\u001b[0m array, sampling_rate \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;43mself\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43m_decode_mp3_torchaudio\u001b[49m\u001b[43m(\u001b[49m\u001b[43mpath_or_file\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 312\u001b[0m \u001b[38;5;28;01mexcept\u001b[39;00m \u001b[38;5;167;01mRuntimeError\u001b[39;00m:\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/features/audio.py:352\u001b[0m, in \u001b[0;36mAudio._decode_mp3_torchaudio\u001b[0;34m(self, path_or_file)\u001b[0m\n\u001b[1;32m 350\u001b[0m \u001b[38;5;28;01mimport\u001b[39;00m \u001b[38;5;21;01mtorchaudio\u001b[39;00m\u001b[38;5;21;01m.\u001b[39;00m\u001b[38;5;21;01mtransforms\u001b[39;00m \u001b[38;5;28;01mas\u001b[39;00m \u001b[38;5;21;01mT\u001b[39;00m\n\u001b[0;32m--> 352\u001b[0m array, sampling_rate \u001b[38;5;241m=\u001b[39m \u001b[43mtorchaudio\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mload\u001b[49m\u001b[43m(\u001b[49m\u001b[43mpath_or_file\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;28;43mformat\u001b[39;49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[38;5;124;43m\"\u001b[39;49m\u001b[38;5;124;43mmp3\u001b[39;49m\u001b[38;5;124;43m\"\u001b[39;49m\u001b[43m)\u001b[49m\n\u001b[1;32m 353\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39msampling_rate \u001b[38;5;129;01mand\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39msampling_rate \u001b[38;5;241m!=\u001b[39m sampling_rate:\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/torchaudio/backend/sox_io_backend.py:246\u001b[0m, in \u001b[0;36mload\u001b[0;34m(filepath, frame_offset, num_frames, normalize, channels_first, format)\u001b[0m\n\u001b[1;32m 245\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m ret\n\u001b[0;32m--> 246\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m \u001b[43m_fallback_load\u001b[49m\u001b[43m(\u001b[49m\u001b[43mfilepath\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mframe_offset\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mnum_frames\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mnormalize\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mchannels_first\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;28;43mformat\u001b[39;49m\u001b[43m)\u001b[49m\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/torchaudio/io/_compat.py:103\u001b[0m, in \u001b[0;36mload_audio\u001b[0;34m(src, frame_offset, num_frames, convert, channels_first, format)\u001b[0m\n\u001b[1;32m 95\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21mload_audio\u001b[39m(\n\u001b[1;32m 96\u001b[0m src: \u001b[38;5;28mstr\u001b[39m,\n\u001b[1;32m 97\u001b[0m frame_offset: \u001b[38;5;28mint\u001b[39m \u001b[38;5;241m=\u001b[39m \u001b[38;5;241m0\u001b[39m,\n\u001b[0;32m (...)\u001b[0m\n\u001b[1;32m 101\u001b[0m \u001b[38;5;28mformat\u001b[39m: Optional[\u001b[38;5;28mstr\u001b[39m] \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;01mNone\u001b[39;00m,\n\u001b[1;32m 102\u001b[0m ) \u001b[38;5;241m-\u001b[39m\u001b[38;5;241m>\u001b[39m Tuple[torch\u001b[38;5;241m.\u001b[39mTensor, \u001b[38;5;28mint\u001b[39m]:\n\u001b[0;32m--> 103\u001b[0m s \u001b[38;5;241m=\u001b[39m \u001b[43mtorch\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mclasses\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mtorchaudio\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mffmpeg_StreamReader\u001b[49m\u001b[43m(\u001b[49m\u001b[43msrc\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;28;43mformat\u001b[39;49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;28;43;01mNone\u001b[39;49;00m\u001b[43m)\u001b[49m\n\u001b[1;32m 104\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m _load_audio(s, frame_offset, num_frames, convert, channels_first)\n",
"\u001b[0;31mRuntimeError\u001b[0m: Failed to open the input \"https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i98c3fa6a-8838-4840-8b82-b199b93007ab.mp3\" (Input/output error).",
"\nDuring handling of the above exception, another exception occurred:\n",
"\u001b[0;31mLibsndfileError\u001b[0m Traceback (most recent call last)",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/librosa/core/audio.py:164\u001b[0m, in \u001b[0;36mload\u001b[0;34m(path, sr, mono, offset, duration, dtype, res_type)\u001b[0m\n\u001b[1;32m 163\u001b[0m \u001b[38;5;28;01mtry\u001b[39;00m:\n\u001b[0;32m--> 164\u001b[0m y, sr_native \u001b[38;5;241m=\u001b[39m \u001b[43m__soundfile_load\u001b[49m\u001b[43m(\u001b[49m\u001b[43mpath\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43moffset\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mduration\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mdtype\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 166\u001b[0m \u001b[38;5;28;01mexcept\u001b[39;00m \u001b[38;5;167;01mRuntimeError\u001b[39;00m \u001b[38;5;28;01mas\u001b[39;00m exc:\n\u001b[1;32m 167\u001b[0m \u001b[38;5;66;03m# If soundfile failed, try audioread instead\u001b[39;00m\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/librosa/core/audio.py:195\u001b[0m, in \u001b[0;36m__soundfile_load\u001b[0;34m(path, offset, duration, dtype)\u001b[0m\n\u001b[1;32m 193\u001b[0m \u001b[38;5;28;01melse\u001b[39;00m:\n\u001b[1;32m 194\u001b[0m \u001b[38;5;66;03m# Otherwise, create the soundfile object\u001b[39;00m\n\u001b[0;32m--> 195\u001b[0m context \u001b[38;5;241m=\u001b[39m \u001b[43msf\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mSoundFile\u001b[49m\u001b[43m(\u001b[49m\u001b[43mpath\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 197\u001b[0m \u001b[38;5;28;01mwith\u001b[39;00m context \u001b[38;5;28;01mas\u001b[39;00m sf_desc:\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/soundfile.py:655\u001b[0m, in \u001b[0;36mSoundFile.__init__\u001b[0;34m(self, file, mode, samplerate, channels, subtype, endian, format, closefd)\u001b[0m\n\u001b[1;32m 653\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_info \u001b[38;5;241m=\u001b[39m _create_info_struct(file, mode, samplerate, channels,\n\u001b[1;32m 654\u001b[0m \u001b[38;5;28mformat\u001b[39m, subtype, endian)\n\u001b[0;32m--> 655\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_file \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;43mself\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43m_open\u001b[49m\u001b[43m(\u001b[49m\u001b[43mfile\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mmode_int\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mclosefd\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 656\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28mset\u001b[39m(mode)\u001b[38;5;241m.\u001b[39missuperset(\u001b[38;5;124m'\u001b[39m\u001b[38;5;124mr+\u001b[39m\u001b[38;5;124m'\u001b[39m) \u001b[38;5;129;01mand\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mseekable():\n\u001b[1;32m 657\u001b[0m \u001b[38;5;66;03m# Move write position to 0 (like in Python file objects)\u001b[39;00m\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/soundfile.py:1213\u001b[0m, in \u001b[0;36mSoundFile._open\u001b[0;34m(self, file, mode_int, closefd)\u001b[0m\n\u001b[1;32m 1212\u001b[0m err \u001b[38;5;241m=\u001b[39m _snd\u001b[38;5;241m.\u001b[39msf_error(file_ptr)\n\u001b[0;32m-> 1213\u001b[0m \u001b[38;5;28;01mraise\u001b[39;00m LibsndfileError(err, prefix\u001b[38;5;241m=\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mError opening \u001b[39m\u001b[38;5;132;01m{0!r}\u001b[39;00m\u001b[38;5;124m: \u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;241m.\u001b[39mformat(\u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mname))\n\u001b[1;32m 1214\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m mode_int \u001b[38;5;241m==\u001b[39m _snd\u001b[38;5;241m.\u001b[39mSFM_WRITE:\n\u001b[1;32m 1215\u001b[0m \u001b[38;5;66;03m# Due to a bug in libsndfile version <= 1.0.25, frames != 0\u001b[39;00m\n\u001b[1;32m 1216\u001b[0m \u001b[38;5;66;03m# when opening a named pipe in SFM_WRITE mode.\u001b[39;00m\n\u001b[1;32m 1217\u001b[0m \u001b[38;5;66;03m# See http://github.com/erikd/libsndfile/issues/77.\u001b[39;00m\n",
"\u001b[0;31mLibsndfileError\u001b[0m: Error opening 'https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i98c3fa6a-8838-4840-8b82-b199b93007ab.mp3': System error.",
"\nDuring handling of the above exception, another exception occurred:\n",
"\u001b[0;31mFileNotFoundError\u001b[0m Traceback (most recent call last)",
"Cell \u001b[0;32mIn[47], line 1\u001b[0m\n\u001b[0;32m----> 1\u001b[0m \u001b[43mtrainer\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mtrain\u001b[49m\u001b[43m(\u001b[49m\u001b[43m)\u001b[49m\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/transformers/trainer.py:1534\u001b[0m, in \u001b[0;36mTrainer.train\u001b[0;34m(self, resume_from_checkpoint, trial, ignore_keys_for_eval, **kwargs)\u001b[0m\n\u001b[1;32m 1529\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mmodel_wrapped \u001b[38;5;241m=\u001b[39m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mmodel\n\u001b[1;32m 1531\u001b[0m inner_training_loop \u001b[38;5;241m=\u001b[39m find_executable_batch_size(\n\u001b[1;32m 1532\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_inner_training_loop, \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_train_batch_size, args\u001b[38;5;241m.\u001b[39mauto_find_batch_size\n\u001b[1;32m 1533\u001b[0m )\n\u001b[0;32m-> 1534\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m \u001b[43minner_training_loop\u001b[49m\u001b[43m(\u001b[49m\n\u001b[1;32m 1535\u001b[0m \u001b[43m \u001b[49m\u001b[43margs\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43margs\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m 1536\u001b[0m \u001b[43m \u001b[49m\u001b[43mresume_from_checkpoint\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mresume_from_checkpoint\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m 1537\u001b[0m \u001b[43m \u001b[49m\u001b[43mtrial\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mtrial\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m 1538\u001b[0m \u001b[43m \u001b[49m\u001b[43mignore_keys_for_eval\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mignore_keys_for_eval\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m 1539\u001b[0m \u001b[43m\u001b[49m\u001b[43m)\u001b[49m\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/transformers/trainer.py:1756\u001b[0m, in \u001b[0;36mTrainer._inner_training_loop\u001b[0;34m(self, batch_size, args, resume_from_checkpoint, trial, ignore_keys_for_eval)\u001b[0m\n\u001b[1;32m 1753\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_load_rng_state(resume_from_checkpoint)\n\u001b[1;32m 1755\u001b[0m step \u001b[38;5;241m=\u001b[39m \u001b[38;5;241m-\u001b[39m\u001b[38;5;241m1\u001b[39m\n\u001b[0;32m-> 1756\u001b[0m \u001b[38;5;28;01mfor\u001b[39;00m step, inputs \u001b[38;5;129;01min\u001b[39;00m \u001b[38;5;28menumerate\u001b[39m(epoch_iterator):\n\u001b[1;32m 1757\u001b[0m \n\u001b[1;32m 1758\u001b[0m \u001b[38;5;66;03m# Skip past any already trained steps if resuming training\u001b[39;00m\n\u001b[1;32m 1759\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m steps_trained_in_current_epoch \u001b[38;5;241m>\u001b[39m \u001b[38;5;241m0\u001b[39m:\n\u001b[1;32m 1760\u001b[0m steps_trained_in_current_epoch \u001b[38;5;241m-\u001b[39m\u001b[38;5;241m=\u001b[39m \u001b[38;5;241m1\u001b[39m\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/torch/utils/data/dataloader.py:628\u001b[0m, in \u001b[0;36m_BaseDataLoaderIter.__next__\u001b[0;34m(self)\u001b[0m\n\u001b[1;32m 625\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_sampler_iter \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m:\n\u001b[1;32m 626\u001b[0m \u001b[38;5;66;03m# TODO(https://github.com/pytorch/pytorch/issues/76750)\u001b[39;00m\n\u001b[1;32m 627\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_reset() \u001b[38;5;66;03m# type: ignore[call-arg]\u001b[39;00m\n\u001b[0;32m--> 628\u001b[0m data \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;43mself\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43m_next_data\u001b[49m\u001b[43m(\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 629\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_num_yielded \u001b[38;5;241m+\u001b[39m\u001b[38;5;241m=\u001b[39m \u001b[38;5;241m1\u001b[39m\n\u001b[1;32m 630\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_dataset_kind \u001b[38;5;241m==\u001b[39m _DatasetKind\u001b[38;5;241m.\u001b[39mIterable \u001b[38;5;129;01mand\u001b[39;00m \\\n\u001b[1;32m 631\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_IterableDataset_len_called \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;129;01mnot\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m \\\n\u001b[1;32m 632\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_num_yielded \u001b[38;5;241m>\u001b[39m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_IterableDataset_len_called:\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/torch/utils/data/dataloader.py:671\u001b[0m, in \u001b[0;36m_SingleProcessDataLoaderIter._next_data\u001b[0;34m(self)\u001b[0m\n\u001b[1;32m 669\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21m_next_data\u001b[39m(\u001b[38;5;28mself\u001b[39m):\n\u001b[1;32m 670\u001b[0m index \u001b[38;5;241m=\u001b[39m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_next_index() \u001b[38;5;66;03m# may raise StopIteration\u001b[39;00m\n\u001b[0;32m--> 671\u001b[0m data \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;43mself\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43m_dataset_fetcher\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mfetch\u001b[49m\u001b[43m(\u001b[49m\u001b[43mindex\u001b[49m\u001b[43m)\u001b[49m \u001b[38;5;66;03m# may raise StopIteration\u001b[39;00m\n\u001b[1;32m 672\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_pin_memory:\n\u001b[1;32m 673\u001b[0m data \u001b[38;5;241m=\u001b[39m _utils\u001b[38;5;241m.\u001b[39mpin_memory\u001b[38;5;241m.\u001b[39mpin_memory(data, \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_pin_memory_device)\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/torch/utils/data/_utils/fetch.py:34\u001b[0m, in \u001b[0;36m_IterableDatasetFetcher.fetch\u001b[0;34m(self, possibly_batched_index)\u001b[0m\n\u001b[1;32m 32\u001b[0m \u001b[38;5;28;01mfor\u001b[39;00m _ \u001b[38;5;129;01min\u001b[39;00m possibly_batched_index:\n\u001b[1;32m 33\u001b[0m \u001b[38;5;28;01mtry\u001b[39;00m:\n\u001b[0;32m---> 34\u001b[0m data\u001b[38;5;241m.\u001b[39mappend(\u001b[38;5;28;43mnext\u001b[39;49m\u001b[43m(\u001b[49m\u001b[38;5;28;43mself\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mdataset_iter\u001b[49m\u001b[43m)\u001b[49m)\n\u001b[1;32m 35\u001b[0m \u001b[38;5;28;01mexcept\u001b[39;00m \u001b[38;5;167;01mStopIteration\u001b[39;00m:\n\u001b[1;32m 36\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mended \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;01mTrue\u001b[39;00m\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py:849\u001b[0m, in \u001b[0;36mIterableDataset.__iter__\u001b[0;34m(self)\u001b[0m\n\u001b[1;32m 846\u001b[0m \u001b[38;5;28;01myield from\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_iter_pytorch(worker_info)\n\u001b[1;32m 847\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m\n\u001b[0;32m--> 849\u001b[0m \u001b[38;5;28;01mfor\u001b[39;00m key, example \u001b[38;5;129;01min\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_iter():\n\u001b[1;32m 850\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mfeatures:\n\u001b[1;32m 851\u001b[0m \u001b[38;5;66;03m# `IterableDataset` automatically fills missing columns with None.\u001b[39;00m\n\u001b[1;32m 852\u001b[0m \u001b[38;5;66;03m# This is done with `_apply_feature_types_on_example`.\u001b[39;00m\n\u001b[1;32m 853\u001b[0m \u001b[38;5;28;01myield\u001b[39;00m _apply_feature_types_on_example(\n\u001b[1;32m 854\u001b[0m example, \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mfeatures, token_per_repo_id\u001b[38;5;241m=\u001b[39m\u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_token_per_repo_id\n\u001b[1;32m 855\u001b[0m )\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py:791\u001b[0m, in \u001b[0;36mIterableDataset._iter\u001b[0;34m(self)\u001b[0m\n\u001b[1;32m 789\u001b[0m \u001b[38;5;28;01melse\u001b[39;00m:\n\u001b[1;32m 790\u001b[0m ex_iterable \u001b[38;5;241m=\u001b[39m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_ex_iterable\n\u001b[0;32m--> 791\u001b[0m \u001b[38;5;28;01myield from\u001b[39;00m ex_iterable\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py:522\u001b[0m, in \u001b[0;36mFilteredExamplesIterable.__iter__\u001b[0;34m(self)\u001b[0m\n\u001b[1;32m 520\u001b[0m current_idx \u001b[38;5;241m+\u001b[39m\u001b[38;5;241m=\u001b[39m batch_idx \u001b[38;5;241m+\u001b[39m \u001b[38;5;241m1\u001b[39m\n\u001b[1;32m 521\u001b[0m \u001b[38;5;28;01melse\u001b[39;00m:\n\u001b[0;32m--> 522\u001b[0m \u001b[38;5;28;01mfor\u001b[39;00m key, example \u001b[38;5;129;01min\u001b[39;00m iterator:\n\u001b[1;32m 523\u001b[0m \u001b[38;5;66;03m# If not batched, we can apply the filtering function direcly\u001b[39;00m\n\u001b[1;32m 524\u001b[0m inputs \u001b[38;5;241m=\u001b[39m \u001b[38;5;28mdict\u001b[39m(example)\n\u001b[1;32m 525\u001b[0m function_args \u001b[38;5;241m=\u001b[39m [inputs] \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39minput_columns \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;28;01melse\u001b[39;00m [inputs[col] \u001b[38;5;28;01mfor\u001b[39;00m col \u001b[38;5;129;01min\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39minput_columns]\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py:577\u001b[0m, in \u001b[0;36mBufferShuffledExamplesIterable.__iter__\u001b[0;34m(self)\u001b[0m\n\u001b[1;32m 575\u001b[0m \u001b[38;5;66;03m# this is the shuffle buffer that we keep in memory\u001b[39;00m\n\u001b[1;32m 576\u001b[0m mem_buffer \u001b[38;5;241m=\u001b[39m []\n\u001b[0;32m--> 577\u001b[0m \u001b[38;5;28;01mfor\u001b[39;00m x \u001b[38;5;129;01min\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mex_iterable:\n\u001b[1;32m 578\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28mlen\u001b[39m(mem_buffer) \u001b[38;5;241m==\u001b[39m buffer_size: \u001b[38;5;66;03m# if the buffer is full, pick and example from it\u001b[39;00m\n\u001b[1;32m 579\u001b[0m i \u001b[38;5;241m=\u001b[39m \u001b[38;5;28mnext\u001b[39m(indices_iterator)\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py:430\u001b[0m, in \u001b[0;36mMappedExamplesIterable.__iter__\u001b[0;34m(self)\u001b[0m\n\u001b[1;32m 428\u001b[0m current_idx \u001b[38;5;241m+\u001b[39m\u001b[38;5;241m=\u001b[39m batch_idx \u001b[38;5;241m+\u001b[39m \u001b[38;5;241m1\u001b[39m\n\u001b[1;32m 429\u001b[0m \u001b[38;5;28;01melse\u001b[39;00m:\n\u001b[0;32m--> 430\u001b[0m \u001b[38;5;28;01mfor\u001b[39;00m key, example \u001b[38;5;129;01min\u001b[39;00m iterator:\n\u001b[1;32m 431\u001b[0m \u001b[38;5;66;03m# If not batched, we can apply the transform and yield the example directly\u001b[39;00m\n\u001b[1;32m 432\u001b[0m \u001b[38;5;66;03m# first copy the example, since we might drop some keys\u001b[39;00m\n\u001b[1;32m 433\u001b[0m example \u001b[38;5;241m=\u001b[39m \u001b[38;5;28mdict\u001b[39m(example)\n\u001b[1;32m 434\u001b[0m \u001b[38;5;66;03m# then apply the transform\u001b[39;00m\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py:687\u001b[0m, in \u001b[0;36mTypedExamplesIterable.__iter__\u001b[0;34m(self)\u001b[0m\n\u001b[1;32m 684\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21m__iter__\u001b[39m(\u001b[38;5;28mself\u001b[39m):\n\u001b[1;32m 685\u001b[0m \u001b[38;5;66;03m# Then for each example, `TypedExamplesIterable` automatically fills missing columns with None.\u001b[39;00m\n\u001b[1;32m 686\u001b[0m \u001b[38;5;66;03m# This is done with `_apply_feature_types_on_example`.\u001b[39;00m\n\u001b[0;32m--> 687\u001b[0m \u001b[38;5;28;01mfor\u001b[39;00m key, example \u001b[38;5;129;01min\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mex_iterable:\n\u001b[1;32m 688\u001b[0m \u001b[38;5;28;01myield\u001b[39;00m key, _apply_feature_types_on_example(\n\u001b[1;32m 689\u001b[0m example, \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mfeatures, token_per_repo_id\u001b[38;5;241m=\u001b[39m\u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mtoken_per_repo_id\n\u001b[1;32m 690\u001b[0m )\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py:430\u001b[0m, in \u001b[0;36mMappedExamplesIterable.__iter__\u001b[0;34m(self)\u001b[0m\n\u001b[1;32m 428\u001b[0m current_idx \u001b[38;5;241m+\u001b[39m\u001b[38;5;241m=\u001b[39m batch_idx \u001b[38;5;241m+\u001b[39m \u001b[38;5;241m1\u001b[39m\n\u001b[1;32m 429\u001b[0m \u001b[38;5;28;01melse\u001b[39;00m:\n\u001b[0;32m--> 430\u001b[0m \u001b[38;5;28;01mfor\u001b[39;00m key, example \u001b[38;5;129;01min\u001b[39;00m iterator:\n\u001b[1;32m 431\u001b[0m \u001b[38;5;66;03m# If not batched, we can apply the transform and yield the example directly\u001b[39;00m\n\u001b[1;32m 432\u001b[0m \u001b[38;5;66;03m# first copy the example, since we might drop some keys\u001b[39;00m\n\u001b[1;32m 433\u001b[0m example \u001b[38;5;241m=\u001b[39m \u001b[38;5;28mdict\u001b[39m(example)\n\u001b[1;32m 434\u001b[0m \u001b[38;5;66;03m# then apply the transform\u001b[39;00m\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py:688\u001b[0m, in \u001b[0;36mTypedExamplesIterable.__iter__\u001b[0;34m(self)\u001b[0m\n\u001b[1;32m 684\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21m__iter__\u001b[39m(\u001b[38;5;28mself\u001b[39m):\n\u001b[1;32m 685\u001b[0m \u001b[38;5;66;03m# Then for each example, `TypedExamplesIterable` automatically fills missing columns with None.\u001b[39;00m\n\u001b[1;32m 686\u001b[0m \u001b[38;5;66;03m# This is done with `_apply_feature_types_on_example`.\u001b[39;00m\n\u001b[1;32m 687\u001b[0m \u001b[38;5;28;01mfor\u001b[39;00m key, example \u001b[38;5;129;01min\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mex_iterable:\n\u001b[0;32m--> 688\u001b[0m \u001b[38;5;28;01myield\u001b[39;00m key, \u001b[43m_apply_feature_types_on_example\u001b[49m\u001b[43m(\u001b[49m\n\u001b[1;32m 689\u001b[0m \u001b[43m \u001b[49m\u001b[43mexample\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;28;43mself\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mfeatures\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mtoken_per_repo_id\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[38;5;28;43mself\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mtoken_per_repo_id\u001b[49m\n\u001b[1;32m 690\u001b[0m \u001b[43m \u001b[49m\u001b[43m)\u001b[49m\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/iterable_dataset.py:653\u001b[0m, in \u001b[0;36m_apply_feature_types_on_example\u001b[0;34m(example, features, token_per_repo_id)\u001b[0m\n\u001b[1;32m 651\u001b[0m encoded_example \u001b[38;5;241m=\u001b[39m features\u001b[38;5;241m.\u001b[39mencode_example(example)\n\u001b[1;32m 652\u001b[0m \u001b[38;5;66;03m# Decode example for Audio feature, e.g.\u001b[39;00m\n\u001b[0;32m--> 653\u001b[0m decoded_example \u001b[38;5;241m=\u001b[39m \u001b[43mfeatures\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mdecode_example\u001b[49m\u001b[43m(\u001b[49m\u001b[43mencoded_example\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mtoken_per_repo_id\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mtoken_per_repo_id\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 654\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m decoded_example\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/features/features.py:1835\u001b[0m, in \u001b[0;36mFeatures.decode_example\u001b[0;34m(self, example, token_per_repo_id)\u001b[0m\n\u001b[1;32m 1821\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21mdecode_example\u001b[39m(\u001b[38;5;28mself\u001b[39m, example: \u001b[38;5;28mdict\u001b[39m, token_per_repo_id: Optional[Dict[\u001b[38;5;28mstr\u001b[39m, Union[\u001b[38;5;28mstr\u001b[39m, \u001b[38;5;28mbool\u001b[39m, \u001b[38;5;28;01mNone\u001b[39;00m]]] \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;01mNone\u001b[39;00m):\n\u001b[1;32m 1822\u001b[0m \u001b[38;5;124;03m\"\"\"Decode example with custom feature decoding.\u001b[39;00m\n\u001b[1;32m 1823\u001b[0m \n\u001b[1;32m 1824\u001b[0m \u001b[38;5;124;03m Args:\u001b[39;00m\n\u001b[0;32m (...)\u001b[0m\n\u001b[1;32m 1832\u001b[0m \u001b[38;5;124;03m `dict[str, Any]`\u001b[39;00m\n\u001b[1;32m 1833\u001b[0m \u001b[38;5;124;03m \"\"\"\u001b[39;00m\n\u001b[0;32m-> 1835\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m {\n\u001b[1;32m 1836\u001b[0m column_name: decode_nested_example(feature, value, token_per_repo_id\u001b[38;5;241m=\u001b[39mtoken_per_repo_id)\n\u001b[1;32m 1837\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_column_requires_decoding[column_name]\n\u001b[1;32m 1838\u001b[0m \u001b[38;5;28;01melse\u001b[39;00m value\n\u001b[1;32m 1839\u001b[0m \u001b[38;5;28;01mfor\u001b[39;00m column_name, (feature, value) \u001b[38;5;129;01min\u001b[39;00m zip_dict(\n\u001b[1;32m 1840\u001b[0m {key: value \u001b[38;5;28;01mfor\u001b[39;00m key, value \u001b[38;5;129;01min\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mitems() \u001b[38;5;28;01mif\u001b[39;00m key \u001b[38;5;129;01min\u001b[39;00m example}, example\n\u001b[1;32m 1841\u001b[0m )\n\u001b[1;32m 1842\u001b[0m }\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/features/features.py:1836\u001b[0m, in \u001b[0;36m\u001b[0;34m(.0)\u001b[0m\n\u001b[1;32m 1821\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21mdecode_example\u001b[39m(\u001b[38;5;28mself\u001b[39m, example: \u001b[38;5;28mdict\u001b[39m, token_per_repo_id: Optional[Dict[\u001b[38;5;28mstr\u001b[39m, Union[\u001b[38;5;28mstr\u001b[39m, \u001b[38;5;28mbool\u001b[39m, \u001b[38;5;28;01mNone\u001b[39;00m]]] \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;01mNone\u001b[39;00m):\n\u001b[1;32m 1822\u001b[0m \u001b[38;5;124;03m\"\"\"Decode example with custom feature decoding.\u001b[39;00m\n\u001b[1;32m 1823\u001b[0m \n\u001b[1;32m 1824\u001b[0m \u001b[38;5;124;03m Args:\u001b[39;00m\n\u001b[0;32m (...)\u001b[0m\n\u001b[1;32m 1832\u001b[0m \u001b[38;5;124;03m `dict[str, Any]`\u001b[39;00m\n\u001b[1;32m 1833\u001b[0m \u001b[38;5;124;03m \"\"\"\u001b[39;00m\n\u001b[1;32m 1835\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m {\n\u001b[0;32m-> 1836\u001b[0m column_name: \u001b[43mdecode_nested_example\u001b[49m\u001b[43m(\u001b[49m\u001b[43mfeature\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mvalue\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mtoken_per_repo_id\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mtoken_per_repo_id\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 1837\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_column_requires_decoding[column_name]\n\u001b[1;32m 1838\u001b[0m \u001b[38;5;28;01melse\u001b[39;00m value\n\u001b[1;32m 1839\u001b[0m \u001b[38;5;28;01mfor\u001b[39;00m column_name, (feature, value) \u001b[38;5;129;01min\u001b[39;00m zip_dict(\n\u001b[1;32m 1840\u001b[0m {key: value \u001b[38;5;28;01mfor\u001b[39;00m key, value \u001b[38;5;129;01min\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mitems() \u001b[38;5;28;01mif\u001b[39;00m key \u001b[38;5;129;01min\u001b[39;00m example}, example\n\u001b[1;32m 1841\u001b[0m )\n\u001b[1;32m 1842\u001b[0m }\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/features/features.py:1298\u001b[0m, in \u001b[0;36mdecode_nested_example\u001b[0;34m(schema, obj, token_per_repo_id)\u001b[0m\n\u001b[1;32m 1295\u001b[0m \u001b[38;5;28;01melif\u001b[39;00m \u001b[38;5;28misinstance\u001b[39m(schema, (Audio, Image)):\n\u001b[1;32m 1296\u001b[0m \u001b[38;5;66;03m# we pass the token to read and decode files from private repositories in streaming mode\u001b[39;00m\n\u001b[1;32m 1297\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m obj \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;129;01mnot\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m schema\u001b[38;5;241m.\u001b[39mdecode:\n\u001b[0;32m-> 1298\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m \u001b[43mschema\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mdecode_example\u001b[49m\u001b[43m(\u001b[49m\u001b[43mobj\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mtoken_per_repo_id\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mtoken_per_repo_id\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 1299\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m obj\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/features/audio.py:154\u001b[0m, in \u001b[0;36mAudio.decode_example\u001b[0;34m(self, value, token_per_repo_id)\u001b[0m\n\u001b[1;32m 152\u001b[0m \u001b[38;5;28;01mraise\u001b[39;00m \u001b[38;5;167;01mValueError\u001b[39;00m(\u001b[38;5;124mf\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mAn audio sample should have one of \u001b[39m\u001b[38;5;124m'\u001b[39m\u001b[38;5;124mpath\u001b[39m\u001b[38;5;124m'\u001b[39m\u001b[38;5;124m or \u001b[39m\u001b[38;5;124m'\u001b[39m\u001b[38;5;124mbytes\u001b[39m\u001b[38;5;124m'\u001b[39m\u001b[38;5;124m but both are None in \u001b[39m\u001b[38;5;132;01m{\u001b[39;00mvalue\u001b[38;5;132;01m}\u001b[39;00m\u001b[38;5;124m.\u001b[39m\u001b[38;5;124m\"\u001b[39m)\n\u001b[1;32m 153\u001b[0m \u001b[38;5;28;01melif\u001b[39;00m path \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;129;01mnot\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m path\u001b[38;5;241m.\u001b[39mendswith(\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mmp3\u001b[39m\u001b[38;5;124m\"\u001b[39m):\n\u001b[0;32m--> 154\u001b[0m array, sampling_rate \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;43mself\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43m_decode_mp3\u001b[49m\u001b[43m(\u001b[49m\u001b[43mfile\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;28;43;01mif\u001b[39;49;00m\u001b[43m \u001b[49m\u001b[43mfile\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;28;43;01melse\u001b[39;49;00m\u001b[43m \u001b[49m\u001b[43mpath\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 155\u001b[0m \u001b[38;5;28;01melif\u001b[39;00m path \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;129;01mnot\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m path\u001b[38;5;241m.\u001b[39mendswith(\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mopus\u001b[39m\u001b[38;5;124m\"\u001b[39m):\n\u001b[1;32m 156\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m file:\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/features/audio.py:339\u001b[0m, in \u001b[0;36mAudio._decode_mp3\u001b[0;34m(self, path_or_file)\u001b[0m\n\u001b[1;32m 337\u001b[0m _librosa_warned \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;01mTrue\u001b[39;00m\n\u001b[1;32m 338\u001b[0m \u001b[38;5;28;01mtry\u001b[39;00m:\n\u001b[0;32m--> 339\u001b[0m array, sampling_rate \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;43mself\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43m_decode_mp3_librosa\u001b[49m\u001b[43m(\u001b[49m\u001b[43mpath_or_file\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 340\u001b[0m \u001b[38;5;28;01mexcept\u001b[39;00m \u001b[38;5;167;01mRuntimeError\u001b[39;00m \u001b[38;5;28;01mas\u001b[39;00m err:\n\u001b[1;32m 341\u001b[0m \u001b[38;5;28;01mraise\u001b[39;00m \u001b[38;5;167;01mRuntimeError\u001b[39;00m(\n\u001b[1;32m 342\u001b[0m \u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mDecoding of \u001b[39m\u001b[38;5;124m'\u001b[39m\u001b[38;5;124mmp3\u001b[39m\u001b[38;5;124m'\u001b[39m\u001b[38;5;124m failed, probably because of streaming mode \u001b[39m\u001b[38;5;124m\"\u001b[39m\n\u001b[1;32m 343\u001b[0m \u001b[38;5;124m\"\u001b[39m\u001b[38;5;124m(`librosa` cannot decode \u001b[39m\u001b[38;5;124m'\u001b[39m\u001b[38;5;124mmp3\u001b[39m\u001b[38;5;124m'\u001b[39m\u001b[38;5;124m file-like objects, only path-like).\u001b[39m\u001b[38;5;124m\"\u001b[39m\n\u001b[1;32m 344\u001b[0m ) \u001b[38;5;28;01mfrom\u001b[39;00m \u001b[38;5;21;01merr\u001b[39;00m\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/datasets/features/audio.py:373\u001b[0m, in \u001b[0;36mAudio._decode_mp3_librosa\u001b[0;34m(self, path_or_file)\u001b[0m\n\u001b[1;32m 371\u001b[0m \u001b[38;5;28;01melse\u001b[39;00m:\n\u001b[1;32m 372\u001b[0m _audioread_warned \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;01mTrue\u001b[39;00m\n\u001b[0;32m--> 373\u001b[0m array, sampling_rate \u001b[38;5;241m=\u001b[39m \u001b[43mlibrosa\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mload\u001b[49m\u001b[43m(\u001b[49m\u001b[43mpath_or_file\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mmono\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[38;5;28;43mself\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mmono\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43msr\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[38;5;28;43mself\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43msampling_rate\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 375\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m array, sampling_rate\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/librosa/util/decorators.py:88\u001b[0m, in \u001b[0;36mdeprecate_positional_args.._inner_deprecate_positional_args..inner_f\u001b[0;34m(*args, **kwargs)\u001b[0m\n\u001b[1;32m 86\u001b[0m extra_args \u001b[38;5;241m=\u001b[39m \u001b[38;5;28mlen\u001b[39m(args) \u001b[38;5;241m-\u001b[39m \u001b[38;5;28mlen\u001b[39m(all_args)\n\u001b[1;32m 87\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m extra_args \u001b[38;5;241m<\u001b[39m\u001b[38;5;241m=\u001b[39m \u001b[38;5;241m0\u001b[39m:\n\u001b[0;32m---> 88\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m \u001b[43mf\u001b[49m\u001b[43m(\u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43margs\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43mkwargs\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 90\u001b[0m \u001b[38;5;66;03m# extra_args > 0\u001b[39;00m\n\u001b[1;32m 91\u001b[0m args_msg \u001b[38;5;241m=\u001b[39m [\n\u001b[1;32m 92\u001b[0m \u001b[38;5;124m\"\u001b[39m\u001b[38;5;132;01m{}\u001b[39;00m\u001b[38;5;124m=\u001b[39m\u001b[38;5;132;01m{}\u001b[39;00m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;241m.\u001b[39mformat(name, arg)\n\u001b[1;32m 93\u001b[0m \u001b[38;5;28;01mfor\u001b[39;00m name, arg \u001b[38;5;129;01min\u001b[39;00m \u001b[38;5;28mzip\u001b[39m(kwonly_args[:extra_args], args[\u001b[38;5;241m-\u001b[39mextra_args:])\n\u001b[1;32m 94\u001b[0m ]\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/librosa/core/audio.py:170\u001b[0m, in \u001b[0;36mload\u001b[0;34m(path, sr, mono, offset, duration, dtype, res_type)\u001b[0m\n\u001b[1;32m 168\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28misinstance\u001b[39m(path, (\u001b[38;5;28mstr\u001b[39m, pathlib\u001b[38;5;241m.\u001b[39mPurePath)):\n\u001b[1;32m 169\u001b[0m warnings\u001b[38;5;241m.\u001b[39mwarn(\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mPySoundFile failed. Trying audioread instead.\u001b[39m\u001b[38;5;124m\"\u001b[39m, stacklevel\u001b[38;5;241m=\u001b[39m\u001b[38;5;241m2\u001b[39m)\n\u001b[0;32m--> 170\u001b[0m y, sr_native \u001b[38;5;241m=\u001b[39m \u001b[43m__audioread_load\u001b[49m\u001b[43m(\u001b[49m\u001b[43mpath\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43moffset\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mduration\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mdtype\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 171\u001b[0m \u001b[38;5;28;01melse\u001b[39;00m:\n\u001b[1;32m 172\u001b[0m \u001b[38;5;28;01mraise\u001b[39;00m exc\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/librosa/core/audio.py:226\u001b[0m, in \u001b[0;36m__audioread_load\u001b[0;34m(path, offset, duration, dtype)\u001b[0m\n\u001b[1;32m 223\u001b[0m reader \u001b[38;5;241m=\u001b[39m path\n\u001b[1;32m 224\u001b[0m \u001b[38;5;28;01melse\u001b[39;00m:\n\u001b[1;32m 225\u001b[0m \u001b[38;5;66;03m# If the input was not an audioread object, try to open it\u001b[39;00m\n\u001b[0;32m--> 226\u001b[0m reader \u001b[38;5;241m=\u001b[39m \u001b[43maudioread\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43maudio_open\u001b[49m\u001b[43m(\u001b[49m\u001b[43mpath\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 228\u001b[0m \u001b[38;5;28;01mwith\u001b[39;00m reader \u001b[38;5;28;01mas\u001b[39;00m input_file:\n\u001b[1;32m 229\u001b[0m sr_native \u001b[38;5;241m=\u001b[39m input_file\u001b[38;5;241m.\u001b[39msamplerate\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/audioread/__init__.py:127\u001b[0m, in \u001b[0;36maudio_open\u001b[0;34m(path, backends)\u001b[0m\n\u001b[1;32m 125\u001b[0m \u001b[38;5;28;01mfor\u001b[39;00m BackendClass \u001b[38;5;129;01min\u001b[39;00m backends:\n\u001b[1;32m 126\u001b[0m \u001b[38;5;28;01mtry\u001b[39;00m:\n\u001b[0;32m--> 127\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m \u001b[43mBackendClass\u001b[49m\u001b[43m(\u001b[49m\u001b[43mpath\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m 128\u001b[0m \u001b[38;5;28;01mexcept\u001b[39;00m DecodeError:\n\u001b[1;32m 129\u001b[0m \u001b[38;5;28;01mpass\u001b[39;00m\n",
"File \u001b[0;32m~/hf_env/lib/python3.8/site-packages/audioread/rawread.py:59\u001b[0m, in \u001b[0;36mRawAudioFile.__init__\u001b[0;34m(self, filename)\u001b[0m\n\u001b[1;32m 58\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21m__init__\u001b[39m(\u001b[38;5;28mself\u001b[39m, filename):\n\u001b[0;32m---> 59\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_fh \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;43mopen\u001b[39;49m\u001b[43m(\u001b[49m\u001b[43mfilename\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;124;43m'\u001b[39;49m\u001b[38;5;124;43mrb\u001b[39;49m\u001b[38;5;124;43m'\u001b[39;49m\u001b[43m)\u001b[49m\n\u001b[1;32m 61\u001b[0m \u001b[38;5;28;01mtry\u001b[39;00m:\n\u001b[1;32m 62\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_file \u001b[38;5;241m=\u001b[39m aifc\u001b[38;5;241m.\u001b[39mopen(\u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_fh)\n",
"\u001b[0;31mFileNotFoundError\u001b[0m: [Errno 2] No such file or directory: 'https://s3.amazonaws.com/bloom-speech/audio/tgl-0000-i98c3fa6a-8838-4840-8b82-b199b93007ab.mp3'"
]
}
],
"source": [
"trainer.train()"
]
},
{
"cell_type": "markdown",
"id": "747c6a6e",
"metadata": {
"pycharm": {
"name": "#%% md\n"
}
},
"source": [
"(note that training may take some time to commence as we load the first training data samples with streaming mode)"
]
},
{
"cell_type": "markdown",
"id": "810ced54-7187-4a06-b2fe-ba6dcca94dc3",
"metadata": {},
"source": [
"We can label our checkpoint with the `whisper-event` tag on push by setting the appropriate key-word arguments (kwargs):"
]
},
{
"cell_type": "code",
"execution_count": 48,
"id": "6dd0e310-9b07-4133-ac14-2ed2d7524e22",
"metadata": {},
"outputs": [],
"source": [
"kwargs = {\n",
" \"dataset_tags\": \"sil-ai/bloom-speech\",\n",
" \"dataset\": \"Bloom Speech Tagalog\", # a 'pretty' name for the training dataset\n",
" \"language\": \"tgl\",\n",
" \"model_name\": \"Whisper Tiny Tgl - Marc-Anthony\", # a 'pretty' name for your model\n",
" \"finetuned_from\": \"openai/whisper-tiny\",\n",
" \"tasks\": \"automatic-speech-recognition\",\n",
" \"tags\": \"whisper-event\",\n",
"}"
]
},
{
"cell_type": "markdown",
"id": "090d676a-f944-4297-a938-a40eda0b2b68",
"metadata": {},
"source": [
"The training results can now be uploaded to the Hub. To do so, execute the `push_to_hub` command:"
]
},
{
"cell_type": "code",
"execution_count": null,
"id": "95737cda-c5dd-4887-a4d0-dfcb0d61d977",
"metadata": {},
"outputs": [],
"source": [
"trainer.push_to_hub(**kwargs)"
]
}
],
"metadata": {
"kernelspec": {
"display_name": "hf_env",
"language": "python",
"name": "hf_env"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.8.10"
}
},
"nbformat": 4,
"nbformat_minor": 5
}