[ICCV21] Self-Calibrating Neural Radiance Fields

Last update: Dec 30, 2022

Related tags

Deep Learning computer-vision deep-learning pytorch calibration nerf self-calibration implicit-representions

Overview

Self-Calibrating Neural Radiance Fields, ICCV, 2021

Project Page | Paper | Video

Author Information

News

2021-09-02: The first version of Self-Calibrating Neural Radiance Fields is published

Overview

In this work, we propose a camera self-calibration algorithm for generic cameras with arbitrary non-linear distortions. We jointly learn the geometry of the scene and the accurate camera parameters without any calibration objects. Our camera model consists a pinhole model, radial distortion, and a generic noise model that can learn arbitrary non-linear camera distortions. While traditional self-calibration algorithms mostly rely on geometric constraints, we additionally incorporate photometric consistency. This requires learning the geometry of the scene and we use Neural Radiance Fields (NeRF). We also propose a new geometric loss function, viz., projected ray distance loss, to incorporate geometric consistency for complex non-linear camera models. We validate our approach on standard real image datasets and demonstrate our model can learn the camera intrinsics and extrinsics (pose) from scratch without COLMAP initialization. Also, we show that learning accurate camera models in differentiable manner allows us to improves PSNR over NeRF. We experimentally demonstrate that our proposed method is applicable to variants of NeRF. In addition, we use a set of images captured with a fish-eye lens to demonstrate that learning camera model jointly improves the performance significantly over the COLMAP initialization.

Method

Generic Camera Model

We provide the definition of our differentiable camera model that combines the pinhole camera model, radial distortion, and a generic non-linear camera distortion for self-calibration. Our differentiable generic camera model consists of four components: intrinsic, extrinsic, radial distortion, and non-linear distortion parameters. We show that modeling the rays more accurately (camera model) results in better neural rendering. The following figure shows the computational steps to generate rays of our proposed learnable generic camera model.

Projected Ray Distance

The generic camera model poses a new challenge defining a geometric loss. In most traditional work, the geometric loss is defined as an epipolar constraint that measures the distance between an epipolar line and the corresponding point, or reprojection error where a 3D point for a correspondence is defined first which is then projected to an image plane to measure the distance between the projection and the correspondence. In this work, rather than requiring a 3D reconstruction to compute an indirect loss like the reprojection error, we propose the projected ray distance loss that directly measures the discrepancy between rays using our generic camera model.

Curriculum Learning

The camera parameters determine the positions and directions of the rays for NeRF learning, and unstable values often result in divergence or sub-optimal results. Thus, we incrementally add a subset of learning parameters to the optimization process to reduce the complexity of learning cameras and geometry jointly. First, we learn the NeRF network while initializing the camera focal lengths and camera centers to half the image width and height. Learning coarse geometry first is crucial since it initializes the network parameters suitable for learning better camera parameters. Next, we sequentially add camera parameters from the linear camera model, radial distortion, to nonlinear noise of ray direction, ray origin to the learning. We progressively make the camera model more complex to prevent the camera parameters from overfitting and also allows faster training.

Installation

Requirements

Ubuntu 16.04 or higher
CUDA 11.1 or higher
Python v3.7 or higher
Pytorch v1.7 or higher
Hardware Spec
- GPUs 11GB (2080ti) or larger capacity
- For NeRF++, 2GPUs(2080ti) are required to reproduce the result
- For FishEyeNeRF experiments, we have used 4GPUs(V100).

Environment Setup

We recommend to conda for installation. All the requirements for two codes, NeRF and NeRF++, are included in the requirements.txt
```
conda create -n icn python=3.8
conda activate icn
pip install -r requirements.txt
git submodule update --init --recursive
```

Pretrained Weights & Qualitative Results

Here, we provide pretrained weights for users to easily reproduce results in the paper. You can download the pretrained weight in the following link. In the link, we provide all the weights of experiments, reported in our paper. To load the pretrained weight, add the following argument at the end of argument in each script. In the zip file, we have also included qualitative results that are used in our paper.

Link to download the pretrained weight: [link]

Datasets

We use three datasets for evaluation: LLFF dataset, tanks and temples dataset, and FishEyeNeRF dataset (Images captured with a fish-eye lens).

LLFF dataset: [link]
Tanks and Temples dataset: [link]
FishEyeNeRF: [link]

Put the data in the directory "data/" then add soft link with one of the following:

ln -s data/nerf_llff_data NeRF/data
ln -s data/tanks_and_temples nerfplusplus/data
ln -s data/FishEyeNeRF nerfplusplus/data/fisheyenerf

Demo Code

The demo code is available at "demo.sh" file. This code runs curriculum learning in NeRF architecture. Please install the aforementioned requirements before running the code. To run the demo code, run:

sh demo.sh

If you want to reproduce the results that are reported in our main paper, run the scripts in the "scripts" directory.

Main Table 1: Self-Calibration Experiment (LLFF)
Main Table 2: Improvement over NeRF (LLFF)
Main Table 3: Improvement over NeRF++ (Tanks and Temples)
Main Table 4: Improvement over NeRF++ (Images with a fish-eye lens)

Code Example:

sh scripts/main_table_1/fern/main1_fern_ours.sh
sh scripts/main_table_2/fern/main2_fern_ours.sh
sh scripts/main_table_3/main_3_m60.sh
sh scripts/main_table_4/globe_ours.sh

Citing Self-Calibrating Neural Radiance Fields

@inproceedings{SCNeRF2021,
    author = {Yoonwoo Jeong, Seokjun Ahn, Christopehr Choy, Animashree Anandkumar, 
    Minsu Cho, and Jaesik Park},
    title = {Self-Calibrating Neural Radiance Fields},
    booktitle = {ICCV},
    year = {2021},
}

Concurrent Work

We list a few recent concurrent projects that tackle camera extrinsics (pose) optimization in NeRF. Note that our Self-Calibrating NeRF optimizes an extensive set of camera parameters for intrinsics, extrinsics, radial distortion, and non-linear distortion.

Acknowledgements

We appreciate all ICCV reviewers for valuable comments. Their valuable suggestions have helped us to improve our paper. We also acknowledge amazing implementations of NeRF++(https://github.com/Kai-46/nerfplusplus) and NeRF-pytorch(https://github.com/yenchenlin/nerf-pytorch).

Comments

How to run experiment with only photos?

Hi Mr,

I would like to run an experiment with your model using a list of pictures from an object to get the estimated camera poses of each picture. How can I mount that experiment?

Thanks in advance,
enhancement

opened by franciscoWizz 16
Does `--dataset_type custom` work for 360 scenes with photos only ?
Two questions, if you don't mind:

Does the code in "custom" branch with --dataset_type custom work for 360 degree scenes like the "tanks_and_temples" images ? Or is it only for forward facing scenes like LLFF fern dataset?

If it does work for 360 scenes, can you confirm that it doesn't need any COLMAP camera parameters, initialization, etc. ?

help wanted question suggestion
opened by vishnukool 12
How to get more detail about the camera pose?

Hello author,

Thank you for sharing your code. I want to capture the corresponding camera pose by taking images of a circle. For example, I take a picture every 10 degrees. After I train the network, I find the results in logs are some images and some .tar file. Can I get some information about the camera pose like a 4x4 matrix?

Looking forward to your reply.
help wanted

opened by xufengfan96 10
Problems running colmap_utils script
Hi,

Thanks for the repo. I was trying to run SCNeRF with only images, but after looking at the code and issues related, it seems like I need to run colmap_utils script nontheless. But there are several errors trying to run the script:

File "/home/SCNeRF/colmap_utils/read_sparse_model.py", line 378, in main depth_ext = os.listdir(os.path.join(args.working_dir, "depth"))[0][-4:]

and

File "/home/SCNeRF/colmap_utils/post_colmap.py", line 33, in load_colmap_data with open(os.path.join(realdir, "train.txt"), "r") as f: FileNotFoundError: [Errno 2] No such file or directory: '/data/TUM_desk_rgb/train.txt'

I checked the code, I think the error happens because there is no depth output from colmap directly, and no idea what is train.txt. Could you double check the provided script work for a pure rgb dataset all the way through?

And if possible, it would be very helpful if you could provide a more detailed guide on how to run with only image inputs.

Thanks in advance!
question
opened by kudo1026 6
How dose "multiplicative_noise" infrence results?

Thank for your great works! I want to know how "multiplicative_noise" infrences results. Will results be better if I use add it for training? I find that you set it to be "True" in all experiments. Thanks in advance! Reference codes are here: https://github.com/POSTECH-CVLab/SCNeRF/blob/master/model/camera_model.py#L166
question

opened by DRosemei 6
Parameter setting different with paper
In scripts/main_table_2/fern/main2_fern_ours.sh, last line is:

--ft_path logs/main1_fern_nerf/200000.tar

which means using main1_fern_nerf to init model. but this 200000 iter in table1 nerf setting is trained with --run_without_colmap both, and in paper the Table2 result is initialized by COLMAP camera information. So the first 200000 iter should be trained with --run_without_colmap none, instead of --run_without_colmap both,

According to the description above, there will be conflicts. And I think maybe it should be

--ft_path logs/main2_fern_nerf/200000.tar

?
suggestion
opened by LOOKCC 4
Implementing SCNeRF on custom dataset

Hi @jeongyw12382 ,

I have a set of images. However, I am aware of the FOV and θ, Φ 3D angles for each image. Would it be possible for me train the NeRF model without COLMAP? Unfortunately, colmap doesn't work well on my dataset. I get an error saying: ERROR: the correct camera poses for current points cannot be accessed
question

opened by shreyask3107 3
Question about the equations in the paper

Q1 Is it true that each element of n is divided by c ? not f?

Also, what is the meaning of p' value? undistorted pixel?

Q2 In this equation, I'm not sure why $z_d$ is multiplied twice.

Q3 In this equation, why divide it by $r_{A,d} \cdot r_{A,d}$ instead of $||r_{A,d}||$ ?

Q4 In this equation, round L in the last term should be round r?
question

opened by emjay73 3
ERROR Error while calling W&B API: project not found ()

Loaded SuperPoint model Loaded SuperGlue model ("outdoor" weights) wandb: (1) Create a W&B account wandb: (2) Use an existing W&B account wandb: (3) Don't visualize my results wandb: Enter your choice: 2 wandb: You chose 'Use an existing W&B account' wandb: You can find your API key in your browser here: https://wandb.ai/authorize wandb: Paste an API key from your profile and hit enter: wandb: Appending key for api.wandb.ai to your netrc file: /root/.netrc wandb: ERROR Error while calling W&B API: project not found (<Response [404]>) Thread SenderThread: Traceback (most recent call last): File "/root/anaconda3/envs/icn/lib/python3.8/site-packages/wandb/sdk/lib/retry.py", line 102, in call result = self._call_fn(*args, **kwargs) File "/root/anaconda3/envs/icn/lib/python3.8/site-packages/wandb/sdk/internal/internal_api.py", line 138, in execute six.reraise(*sys.exc_info()) File "/root/anaconda3/envs/icn/lib/python3.8/site-packages/six.py", line 719, in reraise raise value File "/root/anaconda3/envs/icn/lib/python3.8/site-packages/wandb/sdk/internal/internal_api.py", line 132, in execute return self.client.execute(*args, **kwargs) File "/root/anaconda3/envs/icn/lib/python3.8/site-packages/wandb/vendor/gql-0.2.0/gql/client.py", line 52, in execute result = self._get_result(document, *args, **kwargs) File "/root/anaconda3/envs/icn/lib/python3.8/site-packages/wandb/vendor/gql-0.2.0/gql/client.py", line 60, in _get_result return self.transport.execute(document, *args, **kwargs) File "/root/anaconda3/envs/icn/lib/python3.8/site-packages/wandb/vendor/gql-0.2.0/gql/transport/requests.py", line 39, in execute request.raise_for_status() File "/root/anaconda3/envs/icn/lib/python3.8/site-packages/requests/models.py", line 953, in raise_for_status raise HTTPError(http_error_msg, response=self) requests.exceptions.HTTPError: 404 Client Error: Not Found for url: https://api.wandb.ai/graphql
bug

opened by NewSignal 3
Ablation Study on Tank and Temple datasets

Hi， thanks for your great work! i have some questions about applying this work on some large scale datasets. 1.In your paper you did ablation study on LLFF datasets about IE OD PRD, did you do the ablation study on Tank and Temple dataset ? 2.Is it feasible to apply this work on some large scale datasets without initial poses by colmap? 3.In BARF: Bundle-Adjusting Neural Radiance Fields paper, they mention that it's hard to optimize nerf and poses because of the position encoding, in your experienments, do you think it is necessary to change the position encoding function as BARF said ? @chrischoy @minsucho @soskek @joonahn Looking forward to your reply!
question

opened by cv-lab-x 2
proj_ray_dist_threshold

Hello. Thank you for great paper and code.

I have one small question.

The threshold value of projected ray distance loss is set to 5, is there a reason why you chose this value?

Also, is this threshold value used even when the camera parameters are initialized with identity matrix and zero vector? (Self-Calibration experiments) When the camera parameter is coarse or has a bad value, I think the proj_ray_dist loss will be much larger than 5, but wasn't it? I wonder if this threshold works.

Thank you!
question

opened by nogoing 2
susperious data leakage ?

Hi, I found that you used SuperGlue and SuperPoint as feature extractor and matcher, as far as I know, these two algorithms are trained supervisedly, Is there any suspicion of data leakage here? This approach may affect the fairness of your experiment, because the Colmap-based pose information is not data-driven, and your method somehow references external extra data unless your experimental data is entirely based on SIFT and Bfmatcher.

opened by blacksino 0
about main table 1

Thank you for sharing your code. I'm trying to reproduce the results in the main 1 table. Now I fully trained NeRF results (not 'ours' results) and all of the values are showing slightly worse than the values in the table. Following is the Test Set Results / Train Set Result / Result in the paper.

test | | psnr | ssim | lpips | prd -- | -- | -- | -- | -- | -- nerf | flower | 13.628 | 0.2909 | 0.7835 | nan nerf | fortress | 15.618 | 0.4311 | 0.6794 | nan nerf | leaves | 12.734 | 0.1451 | 0.7938 | nan nerf | trex | 12.419 | 0.3743 | 0.6729 | nan

train | | psnr | ssim | lpips | prd -- | -- | -- | -- | -- | -- nerf | flower | 13.062 | 0.2887 | 0.8028 | nan nerf | fortress | 13.539 | 0.3868 | 0.7249 | nan nerf | leaves | 12.38599 | 0.143 | 0.819662 | nan nerf | trex | 12.58406 | 0.425573 | 0.692024 | nan

paper | | psnr | ssim | lpips | prd -- | -- | -- | -- | -- | -- nerf | flower | 13.8 | 0.302 | 0.716 | nan nerf | fortress | 16.3 | 0.524 | 0.445 | nan nerf | leaves | 13.01 | 0.18 | 0.687 | nan nerf | trex | 15.7 | 0.409 | 0.575 | nan

Can I get a clue? Also, I wonder which dataset is used for the table among train/val/test set
help wanted

opened by emjay73 5

Owner

GitHub

Open source repository for the code accompanying the paper 'Non-Rigid Neural Radiance Fields Reconstruction and Novel View Synthesis of a Deforming Scene from Monocular Video'.

Non-Rigid Neural Radiance Fields This is the official repository for the project "Non-Rigid Neural Radiance Fields: Reconstruction and Novel View Synt

296 Dec 29, 2022

(Arxiv 2021) NeRF--: Neural Radiance Fields Without Known Camera Parameters

NeRF--: Neural Radiance Fields Without Known Camera Parameters Project Page | Arxiv | Colab Notebook | Data Zirui Wang¹, Shangzhe Wu², Weidi Xie², Min

411 Dec 26, 2022

Unofficial & improved implementation of NeRF--: Neural Radiance Fields Without Known Camera Parameters

[Unofficial code-base] NeRF--: Neural Radiance Fields Without Known Camera Parameters [ Project | Paper | Official code base ] ⬅️ Thanks the original

239 Dec 22, 2022

Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields.

This repository contains the code release for Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields. This implementation is written in JAX, and is a fork of Google's JaxNeRF implementation. Contact Jon Barron if you encounter any issues.

625 Dec 30, 2022

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs Check out the paper on arXiv: https://arxiv.org/abs/2103.13744 This repo cont

373 Dec 20, 2022

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis | Project Page | Paper | PyTorch implementation for the paper "AD-NeRF: Audio

551 Dec 29, 2022

Code release for DS-NeRF (Depth-supervised Neural Radiance Fields)

Depth-supervised NeRF: Fewer Views and Faster Training for Free Project | Paper | YouTube Pytorch implementation of our method for learning neural rad

524 Jan 8, 2023

PyTorch implementation for MINE: Continuous-Depth MPI with Neural Radiance Fields

MINE: Continuous-Depth MPI with Neural Radiance Fields Project Page | Video PyTorch implementation for our ICCV 2021 paper. MINE: Towards Continuous D

325 Dec 29, 2022

This repository contains the source code for the paper "DONeRF: Towards Real-Time Rendering of Compact Neural Radiance Fields using Depth Oracle Networks",

DONeRF: Towards Real-Time Rendering of Compact Neural Radiance Fields using Depth Oracle Networks Project Page | Video | Presentation | Paper | Data L

281 Dec 22, 2022

[ICCV21] Self-Calibrating Neural Radiance Fields

Related tags

Overview

Self-Calibrating Neural Radiance Fields, ICCV, 2021

Author Information

News

Overview

Method

Generic Camera Model

Projected Ray Distance

Curriculum Learning

Installation

Requirements

Environment Setup

Pretrained Weights & Qualitative Results

Datasets

Demo Code

Citing Self-Calibrating Neural Radiance Fields

Concurrent Work

Acknowledgements

Comments

Owner

Open source repository for the code accompanying the paper 'Non-Rigid Neural Radiance Fields Reconstruction and Novel View Synthesis of a Deforming Scene from Monocular Video'.

(Arxiv 2021) NeRF--: Neural Radiance Fields Without Known Camera Parameters

Unofficial & improved implementation of NeRF--: Neural Radiance Fields Without Known Camera Parameters

Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields.

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Code release for DS-NeRF (Depth-supervised Neural Radiance Fields)

PyTorch implementation for MINE: Continuous-Depth MPI with Neural Radiance Fields

This repository contains the source code for the paper "DONeRF: Towards Real-Time Rendering of Compact Neural Radiance Fields using Depth Oracle Networks",

BARF: Bundle-Adjusting Neural Radiance Fields 🤮 (ICCV 2021 oral)

[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

This is the code for "HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields".

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

pixelNeRF: Neural Radiance Fields from One or Few Images

D-NeRF: Neural Radiance Fields for Dynamic Scenes

Code release for NeRF (Neural Radiance Fields)

A PyTorch re-implementation of Neural Radiance Fields

[ICCV'21] UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction

This is a JAX implementation of Neural Radiance Fields for learning purposes.