--- license: openrail language: - en pipeline_tag: depth-estimation tags: - monocular depth estimation - video depth estimation - in-the-wild - zero-shot - depth --- # 🛹 RollingDepth: Video Depth without Video Models [![Website](doc/badges/badge-website.svg)](https://rollingdepth.github.io) [![Hugging Face Model](https://img.shields.io/badge/GitHub-Code-blue)](https://github.com/prs-eth/RollingDepth) This repository represents the official implementation of the paper titled ["Video Depth without Video Models"](https://hf.co/papers/2411.19189). [Bingxin Ke](http://www.kebingxin.com/)¹, [Dominik Narnhofer](https://scholar.google.com/citations?user=tFx8AhkAAAAJ&hl=en)¹, [Shengyu Huang](https://shengyuh.github.io/)¹, [Lei Ke](https://www.kelei.site/)², [Torben Peters](https://scholar.google.com/citations?user=F2C3I9EAAAAJ&hl=de)¹, [Katerina Fragkiadaki](https://www.cs.cmu.edu/~katef/)², [Anton Obukhov](https://www.obukhov.ai/)¹, [Konrad Schindler](https://scholar.google.com/citations?user=FZuNgqIAAAAJ&hl=en)¹ ¹ETH Zurich, ²Carnegie Mellon University These components are copied from [Stable-Diffusion-2](https://huggingface.co/stabilityai/stable-diffusion-2): `text_encoder`, `tokenizer`, and `vae`. ## 🎫 License This code of this work is licensed under the Apache License, Version 2.0 (as defined in the [LICENSE-CODE](LICENSE-CODE.txt)). The model is licensed under RAIL++-M License (as defined in the [LICENSE-MODEL](LICENSE-MODEL.txt)) By downloading and using the code and model you agree to the terms in [LICENSE-CODE](LICENSE-CODE.txt) and [LICENSE-MODEL](LICENSE-MODEL.txt) respectively.