PyTorch implementation of VAGAN: Visual Feature Attribution Using Wasserstein GANs

Orobix

Last update: Aug 17, 2022

Related tags

Deep Learning python cnn pytorch gan anomaly-detection vagan

Overview

PyTorch implementation of VAGAN: Visual Feature Attribution Using Wasserstein GANs

This code aims to reproduce results obtained in the paper "Visual Feature Attribution using Wasserstein GANs" (official repo, TensorFlow code)

Description

This repository contains the code to reproduce results for the paper cited above, where the authors presents a novel feature attribution technique based on Wasserstein Generative Adversarial Networks (WGAN). The code works for both synthetic (2D) and real 3D neuroimaging data, you can check below for a brief description of the two datasets.

anomaly maps examples

Here is an example of what the generator/mapper network should produce: ctrl-click on the below image to open the gifv in a new tab (one frame every 50 iterations, left: input, right: anomaly map for synthetic data at iteration 50 * (its + 1)).

Synthetic Dataset

"Data: In order to quantitatively evaluate the performance of the examined visual attribution methods, we generated a synthetic dataset of 10000 112x112 images with two classes, which model a healthy control group (label 0) and a patient group (label 1). The images were split evenly across the two categories. We closely followed the synthetic data generation process described in [31][SubCMap: Subject and Condition Specific Effect Maps] where disease effects were studied in smaller cohorts of registered images. The control group (label 0) contained images with ran- dom iid Gaussian noise convolved with a Gaussian blurring filter. Examples are shown in Fig. 3. The patient images (label 1) also contained the noise, but additionally exhib- ited one of two disease effects which was generated from a ground-truth effect map: a square in the centre and a square in the lower right (subtype A), or a square in the centre and a square in the upper left (subtype B). Importantly, both dis- ease subtypes shared the same label. The location of the off-centre squares was randomly offset in each direction by a maximum of 5 pixels. This moving effect was added to make the problem harder, but had no notable effect on the outcome."

ADNI Dataset

Currently we only implemented training on synthetic dataset, we will work on implement training on ADNI dataset asap (but pull requests are welcome as always), we put below ADNI dataset details for sake of completeness.

"We selected 5778 3D T1-weighted MR images from 1288 subjects with either an MCI (label 0) or AD (label 1) diagnosis from the ADNI cohort. 2839 of the images were acquired using a 1.5T magnet, the remainder using a 3T magnet. The subjects are scanned at regular intervals as part of the ADNI study and a number of subjects converted from MCI to AD over the years. We did not use these cor- respondences for training, however, we took advantage of it for evaluation as will be described later. All images were processed using standard operations available in the FSL toolbox [52][Advances in functional and structural MR image analysis and implementation as FSL.] in order to reorient and rigidly register the images to MNI space, crop them and correct for field inhomogeneities. We then skull-stripped the images using the ROBEX algorithm [24][Robust brain extraction across datasets and comparison with publicly available methods]. Lastly, we resampled all images to a resolution of 1.3 mm 3 and nor- malised them to a range from -1 to 1. The final volumes had a size of 128x160x112 voxels."

"Data used in preparation of this article were obtained from the Alzheimers disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf"

Usage

Training

To train the WGAN on this task, cd into this repo's src root folder and execute:

$ python train.py

This script takes the following command line options:

dataset_root: the root directory where tha dataset is stored, default to '../dataset'
experiment: directory in where samples and models will be saved, default to '../samples'
batch_size: input batch size, default to 32
image_size: the height / width of the input image to network, default to 112
channels_number: input image channels, default to 1
num_filters_g: number of filters for the first layer of the generator, default to 16
num_filters_d: number of filters for the first layer of the discriminator, default to 16
nepochs: number of epochs to train for, default to 1000
d_iters: number of discriminator iterations per each generator iter, default to 5
learning_rate_g: learning rate for generator, default to 1e-3
learning_rate_d: learning rate for discriminator, default to 1e-3
beta1: beta1 for adam. default to 0.0
cuda: enables cuda (store True)
manual_seed: input for the manual seeds initializations, default to 7

Running the command without arguments will train the models with the default hyperparamters values (producing results shown above).

Models

We ported all models found in the original repository in PyTorch, you can find all implemented models here: https://github.com/orobix/Visual-Feature-Attribution-Using-Wasserstein-GANs-Pytorch/tree/master/src/models

Useful repositories and code

vagan-code: Reposiory for the reference paper from its authors
ganhacks: Starter from "How to Train a GAN?" at NIPS2016
WassersteinGAN: Code accompanying the paper "Wasserstein GAN"
wgan-gp: Pytorch implementation of Paper "Improved Training of Wasserstein GANs".
c3d-pytorch: Model used as discriminator in the reference paper
Pytorch-UNet: Model used as genertator in this repository
dcgan: Model used as discriminator in this repository

.bib citation

cite the paper as follows (copied-pasted it from arxiv for you):

@article{DBLP:journals/corr/abs-1711-08998,
  author    = {Christian F. Baumgartner and
               Lisa M. Koch and
               Kerem Can Tezcan and
               Jia Xi Ang and
               Ender Konukoglu},
  title     = {Visual Feature Attribution using Wasserstein GANs},
  journal   = {CoRR},
  volume    = {abs/1711.08998},
  year      = {2017},
  url       = {http://arxiv.org/abs/1711.08998},
  archivePrefix = {arXiv},
  eprint    = {1711.08998},
  timestamp = {Sun, 03 Dec 2017 12:38:15 +0100},
  biburl    = {http://dblp.org/rec/bib/journals/corr/abs-1711-08998},
  bibsource = {dblp computer science bibliography, http://dblp.org}
}

License

This project is licensed under the MIT License

You might also like...

Distributional Sliced-Wasserstein distance code

GAN encoders in PyTorch that could match PGGAN, StyleGAN v1/v2, and BigGAN. Code also integrates the implementation of these GANs.

MTV-TSA: Adaptable GAN Encoders for Image Reconstruction via Multi-type Latent Vectors with Two-scale Attentions. This is the official code release fo

37 Dec 24, 2022

Comments

Questions about the results

Hello!

I got the following picture from the last epoch.

epoch 1000 (fake image)

epoch 1000 (real Image + fake Image)

I understand that only two square boxes of images in one of the 4 x 8 images add a generated image, but I do not like the image.

The sum_samples are like images that ignore the square and add a fake image to the real image.

red : The part where the fake image is added to the real image without the area in the two squares boxes. blue : two squares boxes.

I thought only the blue box added the fake image, but I think the red box added the fake image.

I want to know how to add images.

Additionally, I would like to know how to train with ADNI data.

Thank you very much. I look forward to hearing from you.

opened by KangSH9776 2
Tanh applied only to mask rather than to final result

We found another deviation to the original implementation. The tanh non-linearity is only applied to the mask output, rather than to the generated image, i.e. tanh(m(x)) rather than tanh(m(x)+x). The mistake happens on this line: https://github.com/orobix/Visual-Feature-Attribution-Using-Wasserstein-GANs-Pytorch/blob/master/src/train.py#L63

I know you don't have the capacities to fix this at the moment, and I also don't know if I will have the time. So I am mainly posting this issue to document the problem.

Best wishes, Christian

opened by baumgach 0
Fixing bugs in data loading and cpu/gpu managment
This PR consists of three commits which roughly correspond to three bugs I found:

Remove requires_grad for network input: This was not required for anything, so I removed it, but it didn't affect the results of the model

Sample new batch for generator loss: The generator loss was calculated on the last critic batch rather than with a new batch. I changed the code to sample a new batch. But it turns out the length of the data loader happened to be exactly 100 which is equal the number of critic updates. Now that we need to sample 101 batches in total, this code was running out of samples. The fact that this is possible can be considered another bug. So I added some safe guards against this happening, and changed the dataset so that it generates data loaders of 101 batches instead.

Move images to cpu before saving: There was also an issue in the validation stage that is run every 50 epochs that was preventing any images from being written, because they were still on GPU. Not sure if this issue was introduced by a later PyTorch version (I am using 1.10.2+cu102), but the code was in fact not functional out of the box. Luckily that was a simple fix.
opened by baumgach 1
Questions about the code
Dear Orobix team,

I am the first author of the VAGAN paper. I would like to thank you one more time for porting our code to PyTorch this has been immensely useful to the community working with these types of approaches! In fact, since TensorFlow 1 has become more or less obsolete people from my own team have started working with your code. This also motivated me to have a closer look at it. I have one potential issue and a question that I would like to bring to your attention:

If I read the code correctly, you are re-using the last batch from the discriminator update for the generator update. While I don't think this would be a real issue in practice, I think to be completely correct and avoid unwanted dependencies between the two optimisation steps, a new batch should be used for the generator update. (https://github.com/orobix/Visual-Feature-Attribution-Using-Wasserstein-GANs-Pytorch/blob/master/src/train.py#L204)

I do not understand why you set the requires_grad=True for the input images. This may be down to my lack of experience with PyTorch, though. What is the motivation for this?

Happy, to make a PR if you agree with my points.

Best wishes, Christian
opened by baumgach 3

PyTorch implementation of VAGAN: Visual Feature Attribution Using Wasserstein GANs

Related tags

Overview

PyTorch implementation of VAGAN: Visual Feature Attribution Using Wasserstein GANs

Description

anomaly maps examples

Synthetic Dataset

ADNI Dataset

Usage

Training

Models

Useful repositories and code

.bib citation

License

You might also like...

Distributional Sliced-Wasserstein distance code

(NeurIPS 2020) Wasserstein Distances for Stereo Disparity Estimation

Code accompanying the paper "Wasserstein GAN"

Dynamical Wasserstein Barycenters for Time Series Modeling

This is the PyTorch implementation of GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation

[CVPR 2021] Pytorch implementation of Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs

A PyTorch implementation of ViTGAN based on paper ViTGAN: Training GANs with Vision Transformers.

PyTorch implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation.

GAN encoders in PyTorch that could match PGGAN, StyleGAN v1/v2, and BigGAN. Code also integrates the implementation of these GANs.

Comments

Questions about the results

Tanh applied only to mask rather than to final result

Fixing bugs in data loading and cpu/gpu managment

Questions about the code

Owner

Orobix

A pytorch implementation of Paper "Improved Training of Wasserstein GANs"

Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data - Official PyTorch Implementation (CVPR 2022)

This is the pytorch implementation of the paper - Axiomatic Attribution for Deep Networks.

This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is accepted to ICCV2021.

Official PyTorch implementation of the paper "Recycling Discriminator: Towards Opinion-Unaware Image Quality Assessment Using Wasserstein GAN", accepted to ACM MM 2021 BNI Track.

Text mining project; Using distilBERT to predict authors in the classification task authorship attribution.

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

Implementation of Wasserstein adversarial attacks.

An implementation of the [Hierarchical (Sig-Wasserstein) GAN] algorithm for large dimensional Time Series Generation

pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination"