Keunhong Park, Philipp Henzler, Ben Mildenhall, Jonathan T. Barron, and Ricardo Martin-Brualla, “CamP: Camera Preconditioning for Neural Radiance Fields,” ACM Trans. Graph., 2023.
@article{park_camp_2023,
author = {Park, Keunhong and Henzler, Philipp and Mildenhall, Ben and Barron, Jonathan T. and Martin-Brualla, Ricardo},
title = {CamP: Camera Preconditioning for Neural Radiance Fields},
journal = {ACM Trans. Graph.},
publisher = {ACM},
year = {2023},
issue_date = {December 2023},
freepdf = {https://arxiv.org/pdf/2308.10902},
website = {https://camp-nerf.github.io/},
arxiv = {arXiv:2308.10902 [cs.CV]},
arxivdoi = {https://doi.org/10.48550/arXiv.2308.10902},
tldr = {NeRF joint optimization of camera parameters and scene reconstruction. Uses a left preconditioner for each camera's parameters (Zero Component Analysis (ZCA) whitening transform (Kessy et al. 2018)), derived from a projection function; apply this at the initial iteration of the optimization. The new method is implemented on top of Zip-NeRF (Barron et al. 2023).}
}
Neural Radiance Fields (NeRF) can be optimized to obtain high-fidelity 3D scene reconstructions of objects and large-scale scenes. However, NeRFs require accurate camera parameters as input – inaccurate camera parameters result in blurry renderings. Extrinsic and intrinsic camera parameters are usually estimated using Structure-from-Motion (SfM) methods as a pre-processing step to NeRF, but these techniques rarely yield perfect estimates. Thus, prior works have proposed jointly optimizing camera parameters alongside a NeRF, but these methods are prone to local minima in challenging settings. In this work, we analyze how different camera parameterizations affect this joint optimization problem, and observe that standard parameterizations exhibit large differences in magnitude with respect to small perturbations, which can lead to an ill-conditioned optimization problem. We propose using a proxy problem to compute a whitening transform that eliminates the correlation between camera parameters and normalizes their effects, and we propose to use this transform as a preconditioner for the camera parameters during joint optimization. Our preconditioned camera optimization significantly improves reconstruction quality on scenes from the Mip-NeRF 360 dataset: we reduce error rates (RMSE) by 67% compared to state-of-the-art NeRF approaches that do not optimize for cameras like Zip-NeRF, and by 29% relative to state-of-the-art joint optimization approaches using the camera parameterization of SCNeRF. Our approach is easy to implement, does not significantly increase runtime, can be applied to a wide variety of camera parameterizations, and can straightforwardly be incorporated into other NeRF-like models.
tl;dr: NeRF joint optimization of camera parameters and scene reconstruction. Uses a left preconditioner for each camera’s parameters (Zero Component Analysis (ZCA) whitening transform (Kessy et al. 2018)), derived from a projection function; apply this at the initial iteration of the optimization. The new method is implemented on top of Zip-NeRF (Barron et al. 2023).
CamP: Camera Preconditioning for Neural Radiance Fields.
Keunhong Park, Philipp Henzler, Ben Mildenhall, Jonathan T. Barron, and Ricardo Martin-Brualla.
arXiv: http://doi.org/https://doi.org/10.48550/arXiv.2308.10902
h/t Amy Tabbpdf: https://arxiv.org/pdf/2308.10902
web: https://camp-nerf.github.io/