GitHub - jonstephens85/InstantSplat_Windows: InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds

InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds

This repository is a modified implementation of InstantSplat, an sparse-view, SfM-free framework for large-scale scene reconstruction method using Gaussian Splatting. InstantSplat supports 3D-GS, 2D-GS, and Mip-Splatting. This version works for Windows! Enjoy!

Free-view Rendering

example.mp4

TODO List

Support 2D-GS
Long sequence cross window alignment
Support Mip-Splatting

Get Started

Make your life a lot easier, follow along with the tutorail video I created.
Watch it here

Installation

Clone InstantSplat and download pre-trained model.

git clone --recursive https://github.com/NVlabs/InstantSplat.git
cd InstantSplat
if not exist "mast3r\checkpoints" mkdir "mast3r\checkpoints"
curl -o mast3r\checkpoints\MASt3R_ViTLarge_BaseDecoder_512_catmlpdpt_metric.pth ^
     https://download.europe.naverlabs.com/ComputerVision/MASt3R/MASt3R_ViTLarge_BaseDecoder_512_catmlpdpt_metric.pth

Create the environment (or use pre-built docker), here we show an example using conda.

conda create -n instantsplat python=3.10.13 cmake=3.14.0 -y
conda activate instantsplat
conda install pytorch torchvision pytorch-cuda=12.1 -c pytorch -c nvidia  # use the correct version of cuda for your system
pip install -r requirements.txt
pip install submodules/simple-knn
pip install submodules/diff-gaussian-rasterization
pip install submodules/fused-ssim
pip install plyfile
pip install open3d
pip install "imageio[ffmpeg]"

Optional but highly suggested, compile the cuda kernels for RoPE (as in CroCo v2).

# DUST3R relies on RoPE positional embeddings for which you can compile some cuda kernels for faster runtime.
cd croco/models/curope/
python setup.py build_ext --inplace
cd ../../..

Download the run_infer.py and instantsplat_gradio.py and place them in the root folder C:/user/<username>/InstantSplat

TROUBLESHOOTING

If you have CUDA Toolkit 12.6, I ran into issues running:

conda install pytorch torchvision pytorch-cuda=12.6 -c pytorch -c nvidia

I downloaded and installed CUDA Toolkit 11.8. Then set CUDA Toolkit 11.8 for your command session using:

set CUDA_HOME=C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.8
set PATH=%CUDA_HOME%\bin;%PATH%
set LD_LIBRARY_PATH=%CUDA_HOME%\lib64;%LD_LIBRARY_PATH%

Then running

conda install pytorch torchvision pytorch-cuda=11.8 -c pytorch -c nvidia

You can check what version of CUDA Toolkit you are running by using nvcc --version

Data Prep

The original project provides a few examples to try, you can also download their pre-processed data: link

Place 3, 6, or 12 photos in an images folder nested in a project foler. Here is an example of what it should look like:

Projects/
├── Scene
│   ├── image1.jpg
│   ├── image2.jpg
│   └── image3.jpg

InstantSplat comes with example data to use as a test located at:

assets/
└── sora/
    └── Santorini/
        └── images/
            ├── image1.jpg
            ├── image2.jpg
            └── image3.jpg
    └── Art/
        └── images/
            ├── image1.jpg
            ├── image2.jpg
            └── image3.jpg

Running Inference

The windows implementation currently only supports inference. If you are looking to run eval, refer to the original project page.

Using Gradio

Run python instantsplat_gradio.py

Once launch, navigate to http://127.0.0.1:7860/ in your browser.

Using CLI

Run python run_infer.py /path/to/input/images /path/to/output --n_views 3 --iterations 1000

Command line arguments:

--n_views Number of input views. Must be 3, 6, or 9

--iterations 1000 Number of training iterations, can be set from 1000 to 30000. Suggested increasing in increments of 1000.

Acknowledgement

This work is built on many amazing research works and open-source projects, thanks a lot to all the authors for sharing!

Citation

If you find our work useful in your research, please consider giving a star ⭐ and citing the following paper 📝.

@misc{fan2024instantsplat,
        title={InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds},
        author={Zhiwen Fan and Wenyan Cong and Kairun Wen and Kevin Wang and Jian Zhang and Xinghao Ding and Danfei Xu and Boris Ivanovic and Marco Pavone and Georgios Pavlakos and Zhangyang Wang and Yue Wang},
        year={2024},
        eprint={2403.20309},
        archivePrefix={arXiv},
        primaryClass={cs.CV}
      }

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
arguments		arguments
assets/sora		assets/sora
croco @ d3d0ab2		croco @ d3d0ab2
dust3r		dust3r
gaussian_renderer		gaussian_renderer
lpipsPyTorch		lpipsPyTorch
mast3r		mast3r
scene		scene
scripts		scripts
submodules		submodules
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
init_geo.py		init_geo.py
instantsplat_gradio.py		instantsplat_gradio.py
metrics.py		metrics.py
render.py		render.py
requirements.txt		requirements.txt
run_infer.py		run_infer.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds

Table of Contents

Free-view Rendering

TODO List

Get Started

Installation

TROUBLESHOOTING

Data Prep

Running Inference

Using Gradio

Using CLI

Acknowledgement

Citation

About

Releases

Packages

Languages

License

jonstephens85/InstantSplat_Windows

Folders and files

Latest commit

History

Repository files navigation

InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds

Table of Contents

Free-view Rendering

TODO List

Get Started

Installation

TROUBLESHOOTING

Data Prep

Running Inference

Using Gradio

Using CLI

Acknowledgement

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages