Official Pytorch implementation of C3-GAN

NAVER AI

Last update: Dec 2, 2022

Related tags

Deep Learning c3-gan

Overview

Official pytorch implemenation of C3-GAN

Contrastive Fine-grained Class Clustering via Generative Adversarial Networks [Paper]

Authors: Yunji Kim, Jung-Woo Ha

Abstract

Unsupervised fine-grained class clustering is practical yet challenging task due to the difficulty of feature representations learning of subtle object details. We introduce C3-GAN, a method that leverages the categorical inference power of InfoGAN by applying contrastive learning. We aim to learn feature representations that encourage the data to form distinct cluster boundaries in the embedding space, while also maximizing the mutual information between the latent code and its observation. Our approach is to train the discriminator, which is used for inferring clusters, to optimize the contrastive loss, where the image-latent pairs that maximize the mutual information are considered as positive pairs and the rest as negative pairs. Specifically, we map the input of the generator, which was sampled from the categorical distribution, to the embedding space of the discriminator and let them act as a cluster centroid. In this way, C3-GAN achieved to learn a clustering-friendly embedding space where each cluster is distinctively separable. Experimental results show that C3-GAN achieved state-of-the-art clustering performance on four fine-grained benchmark datasets, while also alleviating the mode collapse phenomenon.

I. To do list before you run the code

The initial code is optimized for CUB dataset. 🦉 🦜 🦢 🦅 🦆 You may have to adjust few things for running this code on another datasets. Please refer to descriptions below.

※ Hyperparameters setting

You can adjust various hyperparemeters' values such as the number of clusters, the degree of perturbation, etc. in config.py file.

※ Annotate data for evaluation

It is required to annotate each image with its ground truth class label for evaluating Accuracy (ACC) and Normalized Mutual Information (NMI) scores. The class information should be represented in the int format. Please check out sample files in data/cub. You may also have to adjust datasets.py file depending on where you saved the image files and how you made the annotation files.

II. Train

If you have set every arguments in config.py file, the training code would be run with the simple command below.

python train.py

※ Pre-trained model for CUB

For loading the parameters of the pre-trained model, please adjust the value of cfg.OVER to '2' and set cfg.MODEL_PATH to wherever you saved the file.

III. Results

※ Fine-grained Class Clustering Results

	Acc				NMI
	Bird	Car	Dog	Flower	Bird	Car	Dog	Flower
IIC	7.4	4.9	5.0	8.7	0.36	0.27	0.18	0.24
SimCLR + k-Means	8.4	6.7	6.8	12.5	0.40	0.33	0.19	0.29
InfoGAN	8.6	6.5	6.4	23.2	0.39	0.31	0.21	0.44
FineGAN	6.9	6.8	6.0	8.1	0.37	0.33	0.22	0.24
MixNMatch	10.2	7.3	10.3	39.0	0.41	0.34	0.30	0.57
SCAN	11.9	8.8	12.3	56.5	0.45	0.38	0.35	0.77
C3-GAN	27.6	14.1	17.9	67.8	0.53	0.41	0.36	0.67

※ Image Generation Results

Conditional Generation

^{Images synthesized with the predicted cluster indices of given real images.}

Random Generation

^{Images synthesized by random value sampling of the latent code c and noise variable z.}

※※ bibtex

@article{kim2021c3gan,
  title={Contrastive Fine-grained Class Clustering via Generative Adversarial Networks},
  author={Kim, Yunji and Ha, Jung-Woo},
  year={2021},
  booktitle = {arXiv}
}

※※ Acknowledgement

This code was developed from the released source code of FineGAN: Unsupervised Hierarchical Disentanglement for Fine-grained Object Generation and Discovery.

License

Copyright 2022-present NAVER Corp.

Redistribution and use in source and binary forms, with or without
modification, are permitted provided that the following conditions are met:

* Redistributions of source code must retain the above copyright notice, this
  list of conditions and the following disclaimer.

* Redistributions in binary form must reproduce the above copyright notice,
  this list of conditions and the following disclaimer in the documentation
  and/or other materials provided with the distribution.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE
FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

You might also like...

GAN encoders in PyTorch that could match PGGAN, StyleGAN v1/v2, and BigGAN. Code also integrates the implementation of these GANs.

MTV-TSA: Adaptable GAN Encoders for Image Reconstruction via Multi-type Latent Vectors with Two-scale Attentions. This is the official code release fo

37 Dec 24, 2022

PyTorch implementation of the WarpedGANSpace: Finding non-linear RBF paths in GAN latent space (ICCV 2021)

Comments

Results with default configuration

Hello. Thank you for the great work! I found the proposed framework simple and effective on both clustering and generation. But I'm wondering if I have the correct numbers for the results with the default configuration provided in this repository. I suppose that the default configuration corresponds to the settings without overclustering, which is the setting of i) in Table 2, but got different numbers as ACC 18% and NMI 0.48, which are expected to be ACC 22.7%, and NMI 0.50. I ran this experiment with the same number of GPUs, 2 TITAN RTX cards. Is the variation of results significant on average, or is my understanding wrong on some points? I would appreciate your answer, and it would be helpful for further research. Thank you!

opened by shim94kr 0

Official Pytorch implementation of C3-GAN

Related tags

Overview

Official pytorch implemenation of C3-GAN

Contrastive Fine-grained Class Clustering via Generative Adversarial Networks [Paper]

Authors: Yunji Kim, Jung-Woo Ha

Abstract

I. To do list before you run the code

※ Hyperparameters setting

※ Annotate data for evaluation

II. Train

※ Pre-trained model for CUB

III. Results

※ Fine-grained Class Clustering Results

※ Image Generation Results

Conditional Generation

Random Generation

※※ bibtex

※※ Acknowledgement

License

You might also like...

GAN encoders in PyTorch that could match PGGAN, StyleGAN v1/v2, and BigGAN. Code also integrates the implementation of these GANs.

PyTorch implementation of the WarpedGANSpace: Finding non-linear RBF paths in GAN latent space (ICCV 2021)

PyTorch implementation of saliency map-aided GAN for Auto-demosaic+denosing

PyTorch implementation for OCT-GAN Neural ODE-based Conditional Tabular GANs (WWW 2021)

PyTorch implementation of Lip to Speech Synthesis with Visual Context Attentional GAN (NeurIPS2021)

Official repository for the paper "Instance-Conditioned GAN"

A Fast and Stable GAN for Small and High Resolution Imagesets - pytorch

A PyTorch Reimplementation of TecoGAN: Temporally Coherent GAN for Video Super-Resolution

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

Comments

Results with default configuration

Owner

NAVER AI

Official PyTorch implementation of the paper "Recycling Discriminator: Towards Opinion-Unaware Image Quality Assessment Using Wasserstein GAN", accepted to ACM MM 2021 BNI Track.

Official PyTorch Implementation of GAN-Supervised Dense Visual Alignment

Official pytorch code for SSC-GAN: Semi-Supervised Single-Stage Controllable GANs for Conditional Fine-Grained Image Generation(ICCV 2021)

Official Implementation of LARGE: Latent-Based Regression through GAN Semantics

Official implementation of the paper Chunked Autoregressive GAN for Conditional Waveform Synthesis

Implementation of 'lightweight' GAN, proposed in ICLR 2021, in Pytorch. High resolution image generations that can be trained within a day or two

PyTorch 1.5 implementation for paper DECOR-GAN: 3D Shape Detailization by Conditional Refinement.

This is a pytorch implementation of the NeurIPS paper GAN Memory with No Forgetting.

[CVPR 2021] Pytorch implementation of Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs

Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"