Official implementation of the paper "Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering"

Vincent Sitzmann

Last update: Dec 29, 2022

Related tags

Deep Learning light-field-networks

Overview

Light Field Networks

Project Page | Paper | Data | Pretrained Models

Vincent Sitzmann*, Semon Rezchikov*, William Freeman, Joshua Tenenbaum, Frédo Durand
MIT, *denotes equal contribution

This is the official implementation of the paper "Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering".

Get started

You can set up a conda environment with all dependencies like so:

conda env create -f environment.yml
conda activate siren

High-Level structure

The code is organized as follows:

multiclass_dataio.py and dataio.py contain the dataio for mutliclass- and single-class experiments respectively.
models.py contains the code for light field networks.
training.py contains a generic training routine.
./experiment_scripts/ contains scripts to reproduce experiments in the paper.

Reproducing experiments

The directory experiment_scripts contains one script per experiment in the paper.

train_single_class.py trains a model on classes in the Scene Representation Networks format, such as cars or chairs. Note that since these datasets have a resolution of 128, this model starts with a lower resolution (64) and then increases the resolution to 128 (see line 43 in the script).

train_nmr.py trains a model on the NMR dataset. An example call is:

python experiment_scripts/train_nmr.py --data_root=path_to_nmr_dataset
python experiment_scripts/train_single_class.py --data_root=path_to_single_class

To reconstruct test objects, use the scripts "rec_single_class.py" and "rec_nmr.py". In addition to the data root, you have to point these scripts to the checkpoint from the training run. Note that the rec_nmr.py script uses the viewlist under ./experiment_scripts/viewlists/src_dvr.txt to pick which views to reconstruct the objects from, while rec_single_class.py per default reconstructs from the view with index 64.

python experiment_scripts/rec_nmr.py --data_root=path_to_nmr_dataset --checkpoint=path_to_training_checkpoint
python experiment_scripts/rec_single_class.py --data_root=path_to_single_class_TEST_SET --checkpoint=path_to_training_checkpoint

Finally, you may test the models on the test set with the test.py script. This script is used for testing all the models. You have to pass it as a parameter which dataset you are reconstructing ("NMR" or no). For the NMR dataset, you need to pass the "viewlist" again to make sure that the model is not evaluated on the context view.

python experiment_scripts/test.py --data_root=path_to_nmr_dataset --dataset=NMR --checkpoint=path_to_rec_checkpoint
python experiment_scripts/test.py --data_root=path_to_single_class_TEST_SET --dataset=single --checkpoint=path_to_rec_checkpoint

To monitor progress, both the training and reconstruction scripts write tensorboard summaries into a "summaries" subdirectory in the logging_root.

Bells & whistles

This code has a bunch of options that were not discussed in the paper.

switch between a ReLU network and a SIREN to better fit high-frequency content with the flag --network (see the init of model.py for options).
switch between a hypernetwork, conditioning via concatenation, and low-rank concditioning with the flag --conditioning
there is an implementation of encoder-based inference in models.py (LFEncoder) which uses a ResNet18 with global conditioning to generate the latent codes z.

Data

We use two types of datasets: class-specific ones and multi-class ones.

The class-specific ones are loaded via dataio.py, and we provide them in a fast-to-load hdf5 format in our google drive.
For the multiclass experiment, we use the ShapeNet 64x64 dataset from NMR https://s3.eu-central-1.amazonaws.com/avg-projects/differentiable_volumetric_rendering/data/NMR_Dataset.zip (Hosted by DVR authors). Additionally, you will need to place the intrinsics.txt file that you can find in the google drive in the nmr root directory.

Coordinate and camera parameter conventions

This code uses an "OpenCV" style camera coordinate system, where the Y-axis points downwards (the up-vector points in the negative Y-direction), the X-axis points right, and the Z-axis points into the image plane. Camera poses are assumed to be in a "camera2world" format, i.e., they denote the matrix transform that transforms camera coordinates to world coordinates.

Misc

Citation

If you find our work useful in your research, please cite:

@inproceedings{sitzmann2021lfns,
               author = {Sitzmann, Vincent
                         and Rezchikov, Semon
                         and Freeman, William T.
                         and Tenenbaum, Joshua B.
                         and Durand, Fredo},
               title = {Light Field Networks: Neural Scene Representations
                        with Single-Evaluation Rendering},
               booktitle = {Proc. NeurIPS},
               year={2021}
            }

Contact

If you have any questions, please email Vincent Sitzmann at [email protected].

Comments

class-specific custom datasets

👋 Any pointers on how class-specific datasets are structured, or how they were created... or to the urls for those? atm I can't find them in the shared drive folder. Thank you!

opened by xyzalanix 2
CVE-2007-4559 Patch

Patching CVE-2007-4559

Hi, we are security researchers from the Advanced Research Center at Trellix. We have began a campaign to patch a widespread bug named CVE-2007-4559. CVE-2007-4559 is a 15 year old bug in the Python tarfile package. By using extract() or extractall() on a tarfile object without sanitizing input, a maliciously crafted .tar file could perform a directory path traversal attack. We found at least one unsantized extractall() in your codebase and are providing a patch for you via pull request. The patch essentially checks to see if all tarfile members will be extracted safely and throws an exception otherwise. We encourage you to use this patch or your own solution to secure against CVE-2007-4559. Further technical information about the vulnerability can be found in this blog.

If you have further questions you may contact us through this projects lead researcher Kasimir Schulz.

opened by TrellixVulnTeam 0
Equation (5) seems wrong?

The E = [R|t] is the transformation matrix from the world coordinates to the camera coordinate, so the direction (d) of the light should be: and the camera position (t) should be: so the Plücker coordinates is (d, d x t)?

opened by caiyongqi 1
Pretrained model and datasets
For the "Pretrained Model" in Google folder, cars_rec.pth actually are models based on chairs.

For testing dataset of cars, they are slightly different with car set of other methods in ShapeNet-V2 . For example, car with same object name sometimes has different color,compared with other methods provided data like PixelNeRF. not sure provided wrong version hdf5 data or not. Please check it, Thanks.
opened by violet257 0
Pretrained Model

Hello,I think in the shared goole drive folder "Pretrained Model" : The four rec.pth files (two cars_rec.pth and two charis_rec.pth).They all refer to the rec.pth of chairs not cars. And there is also no NMR_rec.pth file. Could you check it ,thanks.

opened by HaoRiziyou 0
training time and parameters for single class example

Hello @vsitzmann and co-authors very nice work and code!

I am trying to replicate the training results for the single_class example in a 32GB Tesla GPU server with the cars_train.hdf5. But the training time seems to be huge with the default options (higher than 5000h! which is far far bigger than the 3 days usage).

I've had a look in the implem. details you provided in the suppl but I am a bit confused with the epochs and max interactions for this case (first 200K). Currently it is showing more than 19.2M iterations per epoch... Could you please provide the python call with parameters or tell me if the default parameters inside the script are correct? Thanks!

opened by renatojmsdh 0

Official implementation of the paper "Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering"

Related tags

Overview

Light Field Networks

Project Page | Paper | Data | Pretrained Models

Get started

High-Level structure

Reproducing experiments

Bells & whistles

Data

Coordinate and camera parameter conventions

Misc

Citation

Contact

Comments

class-specific custom datasets

CVE-2007-4559 Patch

Patching CVE-2007-4559

Equation (5) seems wrong?

Pretrained model and datasets

Pretrained Model

training time and parameters for single class example

Owner

Vincent Sitzmann

Official PyTorch implementation for paper Context Matters: Graph-based Self-supervised Representation Learning for Medical Images

Official implementation of the ICLR 2021 paper

Official implementation of the paper Image Generators with Conditionally-Independent Pixel Synthesis https://arxiv.org/abs/2011.13775

Official pytorch implementation of paper "Image-to-image Translation via Hierarchical Style Disentanglement".

Official pytorch implementation of paper "Inception Convolution with Efficient Dilation Search" (CVPR 2021 Oral).

Official implementation of our paper "LLA: Loss-aware Label Assignment for Dense Pedestrian Detection" in Pytorch.

This project is the official implementation of our accepted ICLR 2021 paper BiPointNet: Binary Neural Network for Point Clouds.

Official implementation of our CVPR2021 paper "OTA: Optimal Transport Assignment for Object Detection" in Pytorch.

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

Official implementation for NIPS'17 paper: PredRNN: Recurrent Neural Networks for Predictive Learning Using Spatiotemporal LSTMs.

[PyTorch] Official implementation of CVPR2021 paper "PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency". https://arxiv.org/abs/2103.05465

This is an official implementation of our CVPR 2021 paper "Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression" (https://arxiv.org/abs/2104.02300)

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

PixelPick This is an official implementation of the paper "All you need are a few pixels: semantic segmentation with PixelPick."

Official implementation of GraphMask as presented in our paper Interpreting Graph Neural Networks for NLP With Differentiable Edge Masking.

The official implementation of our CVPR 2021 paper - Hybrid Rotation Averaging: A Fast and Robust Rotation Averaging Approach

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(2021) paper

The official PyTorch implementation of the paper: *Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." *.

The repository offers the official implementation of our paper in PyTorch.

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .