Invertible conditional GANs for image editing

Guim

Last update: Dec 12, 2022

Related tags

Deep Learning IcGAN

Overview

Invertible Conditional GANs

This is the implementation of the IcGAN model proposed in our paper:

Invertible Conditional GANs for image editing. November 2016.

This paper is a summarized and updated version of my master thesis, which you can find here:

Master thesis: Invertible Conditional Generative Adversarial Networks. September 2016.

The baseline used is the Torch implementation of the DCGAN by Radford et al.

Training the model
1. Face dataset: CelebA
2. Digit dataset: MNIST
Visualize the results

Requisites

Please refer to DCGAN torch repository to know the requirements and dependencies to run the code. Additionally, you will need to install the threads and optnet package:

luarocks install threads

luarocks install optnet

In order to interactively display the results, follow these steps.

1. Training the model

The IcGAN is trained in four steps.

Train the generator.
Create a dataset of generated images with the generator.
Train the encoder Z to map an image x to a latent representation z with the dataset generated images.
Train the encoder Y to map an image x to a conditional information vector y with the dataset of real images.

All the parameters of the training phase are located in cfg/mainConfig.lua.

There is already a pre-trained model for CelebA available in case you want to skip the training part. Here you can find instructions on how to use it.

1.1 Train with a face dataset: CelebA

Note: for speed purposes, the whole dataset will be loaded into RAM during training time, which requires about 10 GB of RAM. Therefore, 12 GB of RAM is a minimum requirement. Also, the dataset will be stored as a tensor to load it faster, make sure that you have around 25 GB of free space.

Preprocess

mkdir celebA; cd celebA

Download img_align_celeba.zip here under the link "Align&Cropped Images". Also, you will need to download list_attr_celeba.txt from the same link, which is found under Anno folder.

unzip img_align_celeba.zip; cd ..
DATA_ROOT=celebA th data/preprocess_celebA.lua

Now move list_attr_celeba.txt to celebA folder.

mv list_attr_celeba.txt celebA

Training

Conditional GAN: parameters are already configured to run CelebA (dataset=celebA, dataRoot=celebA).
```
 th trainGAN.lua
```

Generate encoder dataset:

 net=[GENERATOR_PATH] outputFolder=celebA/genDataset/ samples=182638 th data/generateEncoderDataset.lua

(GENERATOR_PATH example: checkpoints/celebA_25_net_G.t7)

Train encoder Z:

 datasetPath=celebA/genDataset/ type=Z th trainEncoder.lua

Train encoder Y:

 datasetPath=celebA/ type=Y th trainEncoder.lua

1.2 Train with a digit dataset: MNIST

Preprocess

Download MNIST as a luarocks package: luarocks install mnist

Training

Conditional GAN:

 name=mnist dataset=mnist dataRoot=mnist th trainGAN.lua

Generate encoder dataset:

 net=[GENERATOR_PATH] outputFolder=mnist/genDataset/ samples=60000 th data/generateEncoderDataset.lua

(GENERATOR_PATH example: checkpoints/mnist_25_net_G.t7)

Train encoder Z:

 datasetPath=mnist/genDataset/ type=Z th trainEncoder.lua

Train encoder Y:

 datasetPath=mnist type=Y th trainEncoder.lua

2 Pre-trained CelebA model:

CelebA model is available for download here. The file includes the generator and both encoders (encoder Z and encoder Y).

3. Visualize the results

For visualizing the results you will need an already trained IcGAN (i.e. a generator and two encoders). The parameters for generating results are in cfg/generateConfig.lua.

3.1 Reconstruct and modify real images

decNet=celeba_24_G.t7 encZnet=celeba_encZ_7.t7 encYnet=celeba_encY_5.t7 loadPath=[PATH_TO_REAL_IMAGES] th generation/reconstructWithVariations.lua

3.2 Swap attributes

Swap the attribute information between two pairs of faces.

decNet=celeba_24_G.t7 encZnet=celeba_encZ_7.t7 encYnet=celeba_encY_5.t7 im1Path=[IM1] im2Path=[IM2] th generation/attributeTransfer.lua

3.3 Interpolate between faces

decNet=celeba_24_G.t7 encZnet=celeba_encZ_7.t7 encYnet=celeba_encY_5.t7 im1Path=[IM1] im2Path=[IM2] th generation/interpolate.lua

Do you like or use our work? Please cite us as

@inproceedings{Perarnau2016,
  author    = {Guim Perarnau and
               Joost van de Weijer and
               Bogdan Raducanu and
               Jose M. \'Alvarez},
  title     = {{Invertible Conditional GANs for image editing}},
  booktitle   = {NIPS Workshop on Adversarial Training},
  year      = {2016},
}

Comments

I am trying to re-implement mnist generation

Hi Guim, thank you for this enlightening model. I am trying to re-implement the mnist generation image in your paper, which fixed latent code Z and modify label Y to generate different hand-written numbers.

I can see that you haven't provided instructions in README, nor could I find any pre-trained models. So right now I am using 'reconstructionWithVariations.lua' with slight modifications to support mnist. Also, I am using the generator of epoch 25 and encoder of epoch 15 for both Z and Y, trained exactly as instructed by README.

So could your please tell me if you used the code from other files? And if not, which pre-trained models did you use? Thanks a lot!

opened by chengdazhi 8
meet some errors in 'trainGAN.lua'

I get the error below on running th trainGAN.lua

/IcGAN/data/data.lua:32: attempt to compare number with nil

and the error of data.lua is in 9: function data.new(n, dataset_name, opt_)
32: if n > 0 then

I find that the n in the 'data.lua' equals nil, which is defined in trainGAN.lua by opt.nThreads 32: local data = DataLoader.new(opt.nThreads, opt.dataset, opt)

but i can't find the definition of opt.nThreads in code,then n equals nil. Please suggest what could possibly be wrong.

opened by rogerbao 3
GAN error display with different axis.

Discriminator and generator error have different error scales, so discriminator error can't be appreciated. Having two independent y-axis will make both error displays visible.
enhancement

opened by Guim3 0
smile Generation output

Can you tell where i have to change if i only want smiling face of any input image for example in attributeTransfer.lua file we pass two image and it swaps the attribute of that but what i input an single image and want only the 17th column smiling face of that. Can you suggest me how to do that,

opened by xyzdcgan 2
Changing the attributes

I love your work and I want to replicate it for my research. Which file should I modify if I want different attributes from the one in the paper? For example, changing the skin complexion and also eye sizes

opened by Zingcekhusta 1

Invertible conditional GANs for image editing

Related tags

Overview

Invertible Conditional GANs

Requisites

1. Training the model

1.1 Train with a face dataset: CelebA

Preprocess

Training

1.2 Train with a digit dataset: MNIST

Preprocess

Training

2 Pre-trained CelebA model:

3. Visualize the results

3.1 Reconstruct and modify real images

3.2 Swap attributes

3.3 Interpolate between faces

Comments

I am trying to re-implement mnist generation

meet some errors in 'trainGAN.lua'

GAN error display with different axis.

smile Generation output

Changing the attributes

Owner

Guim

[CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editing

Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data - Official PyTorch Implementation (CVPR 2022)

Synthesizing and manipulating 2048x1024 images with conditional GANs

PyTorch implementation for OCT-GAN Neural ODE-based Conditional Tabular GANs (WWW 2021)

Collapse by Conditioning: Training Class-conditional GANs with Limited Data

[CVPR 2020] Interpreting the Latent Space of GANs for Semantic Face Editing

InterFaceGAN - Interpreting the Latent Space of GANs for Semantic Face Editing

[CVPR2021] Invertible Image Signal Processing

[ACMMM 2021 Oral] Enhanced Invertible Encoding for Learned Image Compression

Editing a Conditional Radiance Field

Code for "Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks", CVPR 2021

This is the repository for paper NEEDLE: Towards Non-invertible Backdoor Attack to Deep Learning Models.

InvTorch: memory-efficient models with invertible functions

This is the PyTorch implementation of GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation

Image-to-Image Translation with Conditional Adversarial Networks (Pix2pix) implementation in keras

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

PyTorch implementation of "Image-to-Image Translation Using Conditional Adversarial Networks".

A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.