|
# 7Scenes dataset |
|
|
|
## Installation |
|
|
|
Download the images from the [7Scenes project page](https://www.microsoft.com/en-us/research/project/rgb-d-dataset-7-scenes/): |
|
```bash |
|
export dataset=datasets/7scenes |
|
for scene in chess fire heads office pumpkin redkitchen stairs; \ |
|
do wget http://download.microsoft.com/download/2/8/5/28564B23-0828-408F-8631-23B1EFF1DAC8/$scene.zip -P $dataset \ |
|
&& unzip $dataset/$scene.zip -d $dataset && unzip $dataset/$scene/'*.zip' -d $dataset/$scene; done |
|
``` |
|
|
|
Download the SIFT SfM models and DenseVLAD image pairs, courtesy of Torsten Sattler: |
|
```bash |
|
function download { |
|
wget --load-cookies /tmp/cookies.txt "https://docs.google.com/uc?export=download&confirm=$(wget --quiet --save-cookies /tmp/cookies.txt --keep-session-cookies --no-check-certificate "https://docs.google.com/uc?export=download&id=$1" -O- | sed -rn 's/.*confirm=([0-9A-Za-z_]+).*/\1\n/p')&id=$1" -O $2 && rm -rf /tmp/cookies.txt |
|
unzip $2 -d $dataset && rm $2; |
|
} |
|
download 1cu6KUR7WHO7G4EO49Qi3HEKU6n_yYDjb $dataset/7scenes_sfm_triangulated.zip |
|
download 1IbS2vLmxr1N0f3CEnd_wsYlgclwTyvB1 $dataset/7scenes_densevlad_retrieval_top_10.zip |
|
``` |
|
|
|
Download the rendered depth maps, courtesy of Eric Brachmann for [DSAC\*](https://github.com/vislearn/dsacstar): |
|
```bash |
|
wget https://heidata.uni-heidelberg.de/api/access/datafile/4037 -O $dataset/7scenes_rendered_depth.tar.gz |
|
mkdir $dataset/depth/ |
|
tar xzf $dataset/7scenes_rendered_depth.tar.gz -C $dataset/depth/ && rm $dataset/7scenes_rendered_depth.tar.gz |
|
``` |
|
|
|
## Pipeline |
|
|
|
```bash |
|
python3 -m hloc.pipelines.7Scenes.pipeline [--use_dense_depth] |
|
``` |
|
By default, hloc triangulates a sparse point cloud that can be noisy in indoor environements due to image noise and lack of texture. With the flag `--use_dense_depth`, the pipeline improves the accuracy of the sparse point cloud using dense depth maps provided by the dataset. The original depth maps captured by the RGBD sensor are miscalibrated, so we use depth maps rendered from the mesh obtained by fusing the RGBD data. |
|
|
|
## Results |
|
We report the median error in translation/rotation in cm/deg over all scenes: |
|
| Method \ Scene | Chess | Fire | Heads | Office | Pumpkin | Kitchen | Stairs | |
|
| ------------------------------- | -------------- | -------------- | -------------- | -------------- | -------------- | -------------- | ---------- | |
|
| Active Search | 3/0.87 | **2**/1.01 | **1**/0.82 | 4/1.15 | 7/1.69 | 5/1.72 | 4/**1.01** | |
|
| DSAC* | **2**/1.10 | **2**/1.24 | **1**/1.82 | **3**/1.15 | **4**/1.34 | 4/1.68 | **3**/1.16 | |
|
| **SuperPoint+SuperGlue** (sfm) | **2**/0.84 | **2**/0.93 | **1**/**0.74** | **3**/0.92 | 5/1.27 | 4/1.40 | 5/1.47 | |
|
| **SuperPoint+SuperGlue** (RGBD) | **2**/**0.80** | **2**/**0.77** | **1**/0.79 | **3**/**0.80** | **4**/**1.07** | **3**/**1.13** | 4/1.15 | |
|
|
|
## Citation |
|
Please cite the following paper if you use the 7Scenes dataset: |
|
``` |
|
@inproceedings{shotton2013scene, |
|
title={Scene coordinate regression forests for camera relocalization in {RGB-D} images}, |
|
author={Shotton, Jamie and Glocker, Ben and Zach, Christopher and Izadi, Shahram and Criminisi, Antonio and Fitzgibbon, Andrew}, |
|
booktitle={CVPR}, |
|
year={2013} |
|
} |
|
``` |
|
|
|
Also cite DSAC* if you use dense depth maps with the flag `--use_dense_depth`: |
|
``` |
|
@article{brachmann2020dsacstar, |
|
title={Visual Camera Re-Localization from {RGB} and {RGB-D} Images Using {DSAC}}, |
|
author={Brachmann, Eric and Rother, Carsten}, |
|
journal={TPAMI}, |
|
year={2021} |
|
} |
|
``` |
|
|