--- license: openrail language: - en pipeline_tag: depth-estimation tags: - monocular depth estimation - video depth estimation - in-the-wild - zero-shot - depth --- # 🛹 RllingDepth: Video Depth without Video Models [![Website](doc/badges/badge-website.svg)](https://rollingdepth.github.io) [![Hugging Face Model](https://img.shields.io/badge/GitHub-Code-blue)](https://github.com/prs-eth/RollingDepth) This repository represents the official implementation of the paper titled "Video Depth without Video Models". [Bingxin Ke](http://www.kebingxin.com/)1, [Dominik Narnhofer](https://scholar.google.com/citations?user=tFx8AhkAAAAJ&hl=en)1, [Shengyu Huang](https://shengyuh.github.io/)1, [Lei Ke](https://www.kelei.site/)2, [Torben Peters](https://scholar.google.com/citations?user=F2C3I9EAAAAJ&hl=de)1, [Katerina Fragkiadaki](https://www.cs.cmu.edu/~katef/)2, [Anton Obukhov](https://www.obukhov.ai/)1, [Konrad Schindler](https://scholar.google.com/citations?user=FZuNgqIAAAAJ&hl=en)1 1ETH Zurich, 2Carnegie Mellon University These components are copied from [Stable-Diffusion-2](https://huggingface.co/stabilityai/stable-diffusion-2): `text_encoder`, `tokenizer`, and `vae`. ## 🎫 License This code of this work is licensed under the Apache License, Version 2.0 (as defined in the [LICENSE-CODE](LICENSE-CODE.txt)). The model is licensed under RAIL++-M License (as defined in the [LICENSE-MODEL](LICENSE-MODEL.txt)) By downloading and using the code and model you agree to the terms in [LICENSE-CODE](LICENSE-CODE.txt) and [LICENSE-MODEL](LICENSE-MODEL.txt) respectively.