Spaces:
Runtime error
Runtime error
# Preparing datasets for LAVILA | |
Please download the (selected) datasets from the official websites and place or sim-link them under `$LAVILA_ROOT/datasets/`. | |
```bash | |
$LAVILA_ROOT/datasets/ | |
CharadesEgo/ | |
EGTEA/ | |
EK100/ | |
Ego4D/ | |
``` | |
## Ego4D | |
1. Download [Ego4D videos](https://ego4d-data.org/docs/start-here/#download-data) (license is required). | |
2. Preprocess(TBA) | |
3. Download annotations | |
a. Download [egomcq.json](https://drive.google.com/file/d/1-5iRYf4BCHmj4MYQYFRMY4bhsWJUN3rW/view) to `$LAVILA_ROOT/datasets/Ego4D` (if you want to evaluate EgoMCQ). | |
b. Download [metadata for train split](https://dl.fbaipublicfiles.com/lavila/metadata/ego4d/ego4d_train.pkl) and [val split](https://dl.fbaipublicfiles.com/lavila/metadata/ego4d/ego4d_val.pkl) to `$LAVILA_ROOT/datasets/Ego4D` ((if you want to train LAVILA from scratch). | |
The fold should look like this: | |
```bash | |
$LAVILA_ROOT/datasets/ | |
Ego4D/ | |
ego4d_train.pkl | |
ego4d_val.pkl | |
egomcq.json | |
video_288px/ | |
000786a7-3f9d-4fe6-bfb3-045b368f7d44.mp4/ | |
0.mp4 | |
300.mp4 | |
000a3525-6c98-4650-aaab-be7d2c7b9402.mp4/ | |
0.mp4 | |
... | |
``` | |
## EPIC-Kitchens-100 (EK-100) | |
1. Download annotations | |
```bash | |
# Assume that you are under `datasets/EK100/` | |
git clone https://github.com/epic-kitchens/epic-kitchens-100-annotations | |
``` | |
2. Download videos. | |
a. For raw videos, please download them from [https://epic-kitchens.github.io/](https://epic-kitchens.github.io/). | |
b. (Recommended) The raw videos are huge (~1 TB). As an alternative, please check out a [resized version](). | |
3. (For EK-100 MIR) | |
a. Generate the relevancy matrix of train/val splits using [the official code](https://github.com/mwray/Joint-Part-of-Speech-Embeddings). | |
b. (Recommended) The generated result has some randomness. Therefore, we also provide the [replica of train split](https://dl.fbaipublicfiles.com/lavila/metadata/EK100/caption_relevancy_EPIC_100_retrieval_train.pkl) and [val split](https://dl.fbaipublicfiles.com/lavila/metadata/EK100/caption_relevancy_EPIC_100_retrieval_test.pkl). Please put them to the folder `$LAVILA_ROOT/datasets/EK100/epic-kitchens-100-annotations/retrieval_annotations/relevancy/`. | |
The folder should look like this: | |
```bash | |
$LAVILA_ROOT/datasets/ | |
EK100/ | |
epic-kitchens-100-annotations/ | |
EPIC_100_train.csv | |
EPIC_100_validation.csv | |
... | |
retrieval_annotations/relevancy/ # this appears if you do 3. | |
caption_relevancy_EPIC_100_retrieval_train.pkl | |
caption_relevancy_EPIC_100_retrieval_test.pkl | |
video_ht256px/ | |
P01/ | |
P01_01.MP4 | |
P01_02.MP4 | |
... | |
P01_19.MP4 | |
P02/ | |
P02_01.MP4 | |
P02_02.MP4 | |
... | |
P02_15.MP4 | |
... | |
``` | |
## CharadesEgo | |
1. Download annotations at [https://prior.allenai.org/projects/charades-ego](https://prior.allenai.org/projects/charades-ego). | |
```bash | |
### Annotations | |
# Assume that you are under `datasets/CharadesEgo/` | |
wget https://ai2-public-datasets.s3-us-west-2.amazonaws.com/charades/CharadesEgo.zip | |
unzip CharadesEgo.zip && rm CharadesEgo.zip | |
``` | |
2. Download data (~11GB) at [https://prior.allenai.org/projects/charades-ego](https://prior.allenai.org/projects/charades-ego). | |
```bash | |
### Data | |
wget https://ai2-public-datasets.s3-us-west-2.amazonaws.com/charades/CharadesEgo_v1_480.tar | |
tar -xvf CharadesEgo_v1_480.tar # Or specify an external path using `-C` and sim-link it to here | |
rm CharadesEgo_v1_480.tar | |
``` | |
3. (For fine-tuning CharadesEgo) Download two additional metadata files: [clip-level metadata (train)](https://dl.fbaipublicfiles.com/lavila/metadata/CharadesEgo/metadata_filtered_train.pkl) and [clip-level metadata (val)](https://dl.fbaipublicfiles.com/lavila/metadata/CharadesEgo/metadata_filtered_val.pkl). Put them to the folder `$LAVILA_ROOT/datasets/CharadesEgo/CharadesEgo/`. | |
The folder should look like this: | |
```bash | |
$LAVILA_ROOT/datasets/ | |
CharadesEgo/ | |
CharadesEgo/ | |
CharadesEgo_v1_train_only1st.csv | |
CharadesEgo_v1_test_only1st.csv | |
... | |
metadata_filtered_train.pkl # this appears if you do 3. | |
metadata_filtered_val.pkl # this appears if you do 3. | |
CharadesEgo_v1_480/ | |
005BU.mp4 | |
005BUEGO.mp4 | |
... | |
``` | |
## EGTEA | |
1. Visit [https://cbs.ic.gatech.edu/fpv/](https://cbs.ic.gatech.edu/fpv/). | |
2. Download `TRIMMED_ACTION_CLIPS` (~20GB) and `ACTION_ANNOTATIONS` and untar to the current folder `$LAVILA_ROOT/datasets/EGTEA`. | |
```bash | |
unzip action_annotation.zip -d EGTEA/ && rm action_annotation.zip | |
``` | |
The folder should look like this: | |
```bash | |
$LAVILA_ROOT/datasets/ | |
EGTEA/ | |
train_split1.txt | |
test_split1.txt | |
cropped_clips/ | |
OP01-R01-PastaSalad/ | |
OP01-R01-PastaSalad-1002316-1004005-F024051-F024101.mp4 | |
OP01-R01-PastaSalad-1004110-1021110-F024057-F024548.mp4 | |
OP01-R01-PastaSalad-1022590-1024050-F024539-F024581.mp4 | |
... | |
OP01-R02-TurkeySandwich/ | |
OP01-R02-TurkeySandwich-102320-105110-F002449-F002529.mp4 | |
OP01-R02-TurkeySandwich-105440-106460-F002528-F002558.mp4 | |
OP01-R02-TurkeySandwich-107332-133184-F002513-F003259.mp4 | |
... | |
... | |
``` | |