spirulae-splat

My custom method for Nerfstudio. Based on the splatfacto method in official Nerfstudio implementation.

Major changes / Novelties:

Use 2D splats instead of 3D, add depth and normal regularization, use ray-splat intersection, as per SIGGRAPH 2024 paper 2D Gaussian Splatting for Geometrically Accurate Radiance Fields
- Use an interpolation function for median depth to reduce discontinuities and encourage denser gradient
- For depth regularization, allow blending between two modes: pairwise L2 loss, and L1 loss to rendered depth
Use polynomial spline kernel splats instead of Gaussians, as inspired by kernels used in computational physics
- This increases rendering speed by zeroing out opacities beyond a certain radius
- A splat with higher overall opacity produces fewer Gaussians, especially when combined with culling splats with absgrad below a threshold
- $\alpha=\max(1-r^2,0)$ empirically produces 1.5x-2x faster training and inference compared to Gaussians, with 0.2-0.5 drop in PSNR compared to Gaussian kernel for a similar number of splats
Use cylindrical harmonics to model uv-dependent color
- Base+SH color multiplied by the sigmoid of 2D "polar" harmonics
- Able to better capture fine color details and potentially reduce export file size on scenes with fine detail
- Optionally use absolute gradient of CH coefficients as a criteria to split and duplicate splats
- Slows down training by about a half, due to resource spent on Bessel function evaluation and loading CH coefficients from global memory

Introduce splat anisotropy by multiplying opacity by a smoothstep of UV directional dot product $1-\mathrm{smoothstep}(\mathbf{a}\cdot(u,v))$, intended to better represent sharp edges
Handle exposure change by fitting output image to ground-truth image with a linear model before evaluating loss
- Supported models: gt ~ k * pred with scalar or per-channel k, log(gt) ~ k * log(pred) + b with scalar or per-channel k and b, gt ~ A * pred with 3x3 matrix A, log(gt) ~ A * log(pred) + b with matrix A and vector b
- Closed-form solution exists using linear least squares
- To ensure predicted images look good during inference, the (weighted) mean and covariance of splat colors is fitted to mean and covariance of training image colors. This is alternatively achieved by adding a small loss component from pre-adjustment image when evaluating photometric loss with ground-truth image.

Use spherical harmonics for direction-dependent background color
Initialize splat scale and orientation based on SVD of neighbor SfM points

Ideas / To-do

Fast per-pixel depth sorting
- Has potential to stop "pops" and significantly reduce the number of splats, as per this paper
- Has potential to improve depth regularization: So far L2 depth regularization on intersected depth performs worse than L1 regularization based on center of splats
- Alternatively try ray tracing? Might not be fast for training, but seems to be an inference solution that's friendly with existing graphics pipelines
Better model for view-dependent colors for glossy surfaces, e.g. floor where most camera views have low elevation angle, result doesn't generalize well to bird's eye views
- Idea: low-degree SH with Fresnel reflectance?
Depth supervision
- Fit rendered depth to depth predicted using MVS or a foundational depth model
- Special attention needed for biased depth, possibly using an approach similar to how exposure is handled
Speed up uv-dependent color
- Idea: use an interpolated circular grid that allow both low-latency access and accomodation for circular splat shape
- Also apply texture to opacity?
Anti-aliasing
Try non-planar splats, like quadratic patches
Support camera distortion, either in Gaussian projection or using ray tracing

Installation

Install Nerfstudio (see instructions). Clone this repository and run the commands:

cd spirulae-splat/
git submodule update --init
conda activate nerfstudio
pip install -e .
ns-install-cli

I mainly tested on nerfstudio 1.1.5, Python 3.8.10, Ubuntu 20.04, CUDA 11.8 and 12.5. It may also work with other versions and platforms.

Running the new method

This repository creates a new Nerfstudio method named "spirulae". To train with it, run the command:

ns-train spirulae --data [DATASET_PATH]

By default, spirulae-splat uses all available images for training. To support ns-eval, for nerfstudio dataset, use the following command for training:

ns-train spirulae --data [DATASET_PATH] nerfstudio-data --train-split-fraction 0.9

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
spirulae_splat		spirulae_splat
tests		tests
webgl		webgl
.gitignore		.gitignore
.gitmodules		.gitmodules
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

spirulae-splat

Major changes / Novelties:

Ideas / To-do

Installation

Running the new method

About

Releases

Packages

Languages

License

harry7557558/spirulae-splat

Folders and files

Latest commit

History

Repository files navigation

spirulae-splat

Major changes / Novelties:

Ideas / To-do

Installation

Running the new method

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages