The codebase for our paper "Generative Occupancy Fields for 3D Surface-Aware Image Synthesis" (NeurIPS 2021)

xuxudong

Last update: Nov 10, 2022

Related tags

Overview

Generative Occupancy Fields for 3D Surface-Aware Image Synthesis (NeurIPS 2021)

Project Page | Paper

Xudong Xu, Xingang Pan, Dahua Lin and Bo Dai

GOF can synthesize high-quality images with high 3D consistency and simultaneously learn compact and smooth object surfaces.

Requirements

Python 3.8 is used. Basic requirements are listed in the requirements.txt

pip install -r requirements.txt

Training

We have put several bash files of BFM, CelebA, and Cats datasets in auto_bash for reference. The adopted hyperparameters in our paper has been listed in the curriculums.py file.

If you want to train with your own dataset, you should set the hyperparameters carefully, especially those related to the camera pose distribution. Just as the settings in the curriculums.py file, you can leverage some camera pose predictors to obtain the rough 'h_stddev' and 'v_stddev', and tune them according to the corresponding performance. Besides, you should add the dataset class in dataset.py and modify the reference bash file to fit your own dataset accordingly.

Evaluation

Evaluation Metrics

To calculate FID/IS/KID scores, please run

python eval_metrics.py path/to/generator.pth --real_image_dir path/to/real_images --curriculum CURRICULUM

To calculate weighted variance proposed in the paper, please run

python cal_weighted_var.py path/to/generator.pth --curriculum CURRICULUM

Render Multi-view Images

python render_multiview_images.py path/to/generator.pth --curriculum CURRICULUM --seeds_start 0 --seeds_end 100

Render Videos

python render_video.py path/to/generator.pth --curriculum CURRICULUM --seed 0

After running, you will obtain a series of images in a specific folder. And then you can transfer them into a video with ffmpeg:

ffmpeg -r 15 -f image2 -i xxx.png -c:v libx264 -crf 25 -pix_fmt yuv420p xxx.mp4

Similarly, you can render videos interpolating bettween given latent codes/seeds following:

python render_video_interpolation.py path/to/generator.pth --curriculum CURRICULUM --seeds 0 1 2 3

Extract 3D Shapes

You should first generate a voxel npy file by running:

python extract_shapes.py path/to/generator.pth --curriculum CURRICULUM --seed 0

and render it to the corresponding multi-view images with the render_meshimg.py script.

Pretrained Models

We provide pretrained models for BFM, CelebA, and Cats. Please refer to this link.

As mentioned in the supplementary, the training of all models starts from an early (about 2K iterations) pretrained model with the correct outward-facing faces. We also provide the early pretrained models for three datasets in this link. If you want to start from the early pretrained models, you can replace the 'load_dir' name in bash files in auto_bash with the corresponding path of these pretrained models. Since the optimizer parameters are not provided here, you may need to comment L138~139 out.

Citation

If you find this codebase useful for your research, please cite:

@inproceedings{xu2021generative,
  title={Generative Occupancy Fields for 3D Surface-Aware Image Synthesis},
  author={Xu, Xudong and Pan, Xingang and Lin, Dahua and Dai, Bo},
  booktitle={Advances in Neural Information Processing Systems(NeurIPS)},
  year={2021}
}

Acknowledgement

The structure of this codebase is borrowed from pi-GAN.

Comments

Additional Results on CARLA

Congrats on this great work, and thanks a lot for open-sourcing the code! I have some trouble with the Carla dataset. Where did you get the Carla dataset？I can‘t find the pi-gan’s Carla dataset. And it seems different from Graf's. Can you provide me with a link to download the Carla dataset in your paper? Thanks a lot in advance.

opened by wasdzzzzzz 6
AssertionError: No inf checks were recorded for this optimizer.
Hello, when trying to train the model by myself, I met the following error:

Traceback (most recent call last): File ".../site-packages/torch/multiprocessing/spawn.py", line 59, in _wrap fn(i, *args) File ".../GOF_NeurIPS2021/train.py", line 340, in train scaler.step(optimizer_G) File ".../site-packages/torch/cuda/amp/grad_scaler.py", line 337, in step assert len(optimizer_state["found_inf_per_device"]) > 0, "No inf checks were recorded for this optimizer."

The environment is the same as in requirements.txt (besides, the package name mcubes should be PyMCubes?). I tried to comment that line in grad_scaler.py, although it can train now, the results seem not converging (output is still random noise after around 30000 steps). Any help would be appreciated!
opened by ashawkey 6
Scripts of rendering mesh

Hi, thanks for releasing the code. I notice the script of rendering mesh only supports rotation on yaw. Could you please release scripts that also support rotation on the pitch, such as gif on your project page?

opened by MrTornado24 2
HI，I have some questions about the preprocess of CelebA dataset

Congrats on this great work, and thanks a lot for open-sourcing the code! In the paper, you mention that “we crop all images in CelebA from the top of the hair to the bottom of the chin as a pre-processing step.” Can you provide specific steps or code scripts for pre-processing methods？ Thanks a lot in advance.

opened by wasdzzzzzz 2
Possible error in the Opacity Regularization formula in the paper.

First of all, thank you for sharing this brilliant work. I discrovered a possible erorr in the Opacity Regularization formula in the paper while trying to implement this algorithm. The problem is that the opacity term described in the paper approaches negative infinity. According to the definition of entropy, I think the formula should be something like
$-\sum \alpha \log_2(\alpha) + (1 - \alpha) \log_2(1-\alpha)$

opened by yzhq97 2
Could you please supplement curriculums.py with the hyperparameters about Carla's dataset?

Could you please supplement curriculums.py with the hyperparameters about Carla's dataset? I ran conjecture experiments on the hyperparameters based on the previous three datasets, but could not train the same results as in the paper.

opened by abcddcbaabcddcba 0

Owner

xuxudong

Deep learning, deep research. CUHK MMLAB PhD

GitHub

Code for our NeurIPS 2021 paper Mining the Benefits of Two-stage and One-stage HOI Detection

CDN Code for our NeurIPS 2021 paper "Mining the Benefits of Two-stage and One-stage HOI Detection". Contributed by Aixi Zhang*, Yue Liao*, Si Liu, Mia

71 Dec 14, 2022

Code to reproduce the experiments from our NeurIPS 2021 paper " The Limitations of Large Width in Neural Networks: A Deep Gaussian Process Perspective"

Code To run: python runner.py new --save <SAVE_NAME> --data <PATH_TO_DATA_DIR> --dataset <DATASET> --model <model_name> [options] --n 1000 - train - t

5 Dec 12, 2022

Code for our NeurIPS 2021 paper 'Exploiting the Intrinsic Neighborhood Structure for Source-free Domain Adaptation'

Exploiting the Intrinsic Neighborhood Structure for Source-free Domain Adaptation (NeurIPS 2021) Code for our NeurIPS 2021 paper 'Exploiting the Intri

53 Dec 25, 2022

PyTorch implementation for our NeurIPS 2021 Spotlight paper "Long Short-Term Transformer for Online Action Detection".

Long Short-Term Transformer for Online Action Detection Introduction This is a PyTorch implementation for our NeurIPS 2021 Spotlight paper "Long Short

77 Dec 16, 2022

This is the codebase for the ICLR 2021 paper Trajectory Prediction using Equivariant Continuous Convolution

Trajectory Prediction using Equivariant Continuous Convolution (ECCO) This is the codebase for the ICLR 2021 paper Trajectory Prediction using Equivar

45 Jul 22, 2022

This repo includes our code for evaluating and improving transferability in domain generalization (NeurIPS 2021)

Transferability for domain generalization This repo is for evaluating and improving transferability in domain generalization (NeurIPS 2021), based on

9 Nov 29, 2022

PyTorch implementation of our Adam-NSCL algorithm from our CVPR2021 (oral) paper "Training Networks in Null Space for Continual Learning"

Adam-NSCL This is a PyTorch implementation of Adam-NSCL algorithm for continual learning from our CVPR2021 (oral) paper: Title: Training Networks in N

34 Dec 21, 2022

Codebase for the Summary Loop paper at ACL2020

Summary Loop This repository contains the code for ACL2020 paper: The Summary Loop: Learning to Write Abstractive Summaries Without Examples. Training

Canny Lab @ The University of California, Berkeley

44 Nov 4, 2022

Codebase for the self-supervised goal reaching benchmark introduced in the LEXA paper

LEXA Benchmark Codebase for the self-supervised goal reaching benchmark introduced in the LEXA paper (Discovering and Achieving Goals via World Models

36 Dec 22, 2022

Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling

CLIORA This is the official codebase for ICLR oral paper: Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling. We introduce

32 Dec 23, 2022

Convolutional neural network web app trained to track our infant’s sleep schedule using our Google Nest camera.

Machine Learning Sleep Schedule Tracker What is it? Convolutional neural network web app trained to track our infant’s sleep schedule using our Google

7 Jul 15, 2022

Spearmint Bayesian optimization codebase

Spearmint Spearmint is a software package to perform Bayesian optimization. The Software is designed to automatically run experiments (thus the code n

Formerly: Harvard Intelligent Probabilistic Systems Group -- Now at Princeton

1.5k Dec 29, 2022

A general 3D Object Detection codebase in PyTorch.

Det3D is the first 3D Object Detection toolbox which provides off the box implementations of many 3D object detection algorithms such as PointPillars, SECOND, PIXOR, etc, as well as state-of-the-art methods on major benchmarks like KITTI(ViP) and nuScenes(CBGS).

1.4k Jan 5, 2023

The codebase for our paper "Generative Occupancy Fields for 3D Surface-Aware Image Synthesis" (NeurIPS 2021)

Related tags

Overview

Generative Occupancy Fields for 3D Surface-Aware Image Synthesis (NeurIPS 2021)

Project Page | Paper

Requirements

Training

Evaluation

Evaluation Metrics

Render Multi-view Images

Render Videos

Extract 3D Shapes

Pretrained Models

Citation

Acknowledgement

Comments

Additional Results on CARLA

AssertionError: No inf checks were recorded for this optimizer.

Scripts of rendering mesh

HI，I have some questions about the preprocess of CelebA dataset

Possible error in the Opacity Regularization formula in the paper.

Could you please supplement curriculums.py with the hyperparameters about Carla's dataset?

Owner

xuxudong

Code for our NeurIPS 2021 paper Mining the Benefits of Two-stage and One-stage HOI Detection

Code to reproduce the experiments from our NeurIPS 2021 paper " The Limitations of Large Width in Neural Networks: A Deep Gaussian Process Perspective"

Code for our NeurIPS 2021 paper 'Exploiting the Intrinsic Neighborhood Structure for Source-free Domain Adaptation'

PyTorch implementation for our NeurIPS 2021 Spotlight paper "Long Short-Term Transformer for Online Action Detection".

This is the codebase for the ICLR 2021 paper Trajectory Prediction using Equivariant Continuous Convolution

This repo includes our code for evaluating and improving transferability in domain generalization (NeurIPS 2021)

PyTorch implementation of our Adam-NSCL algorithm from our CVPR2021 (oral) paper "Training Networks in Null Space for Continual Learning"

Codebase for the Summary Loop paper at ACL2020

Codebase for the self-supervised goal reaching benchmark introduced in the LEXA paper

Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling

Convolutional neural network web app trained to track our infant’s sleep schedule using our Google Nest camera.

Spearmint Bayesian optimization codebase

A general 3D Object Detection codebase in PyTorch.

Official codebase for Pretrained Transformers as Universal Computation Engines.

AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)

This is the codebase for Diffusion Models Beat GANS on Image Synthesis.

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

X-modaler is a versatile and high-performance codebase for cross-modal analytics.

Codebase for Diffusion Models Beat GANS on Image Synthesis.