This project is an unofficial ComfyUI implementation of Video Depth Anything, for depth estimation on long videos without compromising quality, consistency, or generalization ability.
Navigate to the ComfyUI /custom_nodes
directory
git clone https://github.com/yuvraj108c/ComfyUI-Video-Depth-Anything.git
cd ./ComfyUI-Video-Depth-Anything
pip install -r requirements.txt
Xformers (optional)
pip install xformers
- Load example workflow
- Models will download automatically to
ComfyUI/models/videodepthanything
- The large model doesn't work with 24GB vram
9 February 2025
- Support autocast inference, minor cleanups
@article{video_depth_anything,
title={Video Depth Anything: Consistent Depth Estimation for Super-Long Videos},
author={Chen, Sili and Guo, Hengkai and Zhu, Shengnan and Zhang, Feihu and Huang, Zilong and Feng, Jiashi and Kang, Bingyi}
journal={arXiv:2501.12375},
year={2025}
}
- Video-Depth-Anything-Small model is under the Apache-2.0 license.
- Video-Depth-Anything-Large model is under the CC-BY-NC-4.0 license.