---
license: openrail
language:
- en
pipeline_tag: depth-estimation
tags:
- monocular depth estimation
- video depth estimation
- in-the-wild
- zero-shot
- depth
---
# 🛹 RollingDepth: Video Depth without Video Models
[![Website](doc/badges/badge-website.svg)](https://rollingdepth.github.io)
[![Hugging Face Model](https://img.shields.io/badge/GitHub-Code-blue)](https://github.com/prs-eth/RollingDepth)
This repository represents the official implementation of the paper titled ["Video Depth without Video Models"](https://hf.co/papers/2411.19189).
[Bingxin Ke](http://www.kebingxin.com/)1,
[Dominik Narnhofer](https://scholar.google.com/citations?user=tFx8AhkAAAAJ&hl=en)1,
[Shengyu Huang](https://shengyuh.github.io/)1,
[Lei Ke](https://www.kelei.site/)2,
[Torben Peters](https://scholar.google.com/citations?user=F2C3I9EAAAAJ&hl=de)1,
[Katerina Fragkiadaki](https://www.cs.cmu.edu/~katef/)2,
[Anton Obukhov](https://www.obukhov.ai/)1,
[Konrad Schindler](https://scholar.google.com/citations?user=FZuNgqIAAAAJ&hl=en)1
1ETH Zurich,
2Carnegie Mellon University
These components are copied from [Stable-Diffusion-2](https://huggingface.co/stabilityai/stable-diffusion-2): `text_encoder`, `tokenizer`, and `vae`.
## 🎫 License
This code of this work is licensed under the Apache License, Version 2.0 (as defined in the [LICENSE-CODE](LICENSE-CODE.txt)).
The model is licensed under RAIL++-M License (as defined in the [LICENSE-MODEL](LICENSE-MODEL.txt))
By downloading and using the code and model you agree to the terms in [LICENSE-CODE](LICENSE-CODE.txt) and [LICENSE-MODEL](LICENSE-MODEL.txt) respectively.