[ArXiv 2021] Data-Efficient Instance Generation from Instance Discrimination

GenForce: May Generative Force Be with You

Last update: Dec 25, 2022

Related tags

Deep Learning insgen

Overview

InsGen - Data-Efficient Instance Generation from Instance Discrimination

Data-Efficient Instance Generation from Instance Discrimination
Ceyuan Yang, Yujun Shen, Yinghao Xu, Bolei Zhou
arXiv preprint arXiv: 2106.04566

[Paper] [Project Page]

In this work, we develop a novel data-efficient Instance Generation (InsGen) method for training GANs with limited data. With the instance discrimination as an auxiliary task, our method makes the best use of both real and fake images to train the discriminator. The discriminator in turn guides the generator to synthesize as many diverse images as possible. Experiments under different data regimes show that InsGen brings a substantial improvement over the baseline in terms of both image quality and image diversity, and outperforms previous data augmentation algorithms by a large margin.

Qualitative results

Here we provide some synthesized samples with different numbers of training images and correspoding FID. Full codebase and weights are coming soon.

Inference

Here, all pretrained models can be downloaded from Google Drive:

Model	FID	Link
AFHQ512-CAT	2.60	link
AFHQ512-DOG	5.44	link
AFHQ512-WILD	1.77	link

Model	FID	Link
FFHQ256-2K	11.92	link
FFHQ256-10K	4.90	link
FFHQ256-140K	3.31	link

You can download one of them and put it under MODEL_ZOO directory, then synthesize images via

# Generate AFHQ512-CAT with truncation.
python generate.py --network=${MODEL_ZOO}/afhqcat.pkl \
                   --outdir=${TARGET_DIR} \
                   --trunc=0.7 \
                   --seeds=0-10

Training

This repository is built based on styleGAN2-ada-pytorch. Therefore, please prepare datasets first use dataset_tool.py. On top of Generative Adversarial Networks (GANs), we introduce contrastive loss into the training of discriminator, following MoCo. Concretely, the discriminator is used to extract features from images (either real or synthesized) and then trained with an auxiliary task by distinguishing every individual image.

As described in training/contrastive_head.py, we add two addition heads on top of the original discriminator. These two heads are used to project features extracted from real and fake data onto a unit ball respectively. More details can be found in paper. Note that InsGen can be easily applied to any GAN model by merely introducing two contrastive heads. According to MoCo, the feature extractor should be updated in a momentum manner. Here, in InsGen, the contrastive heads are updated in the forward() function, while the discriminator is updated in training/training_loop.py (see D_ema).

Please use the following command to start your own training:

python train.py --gpus=8 \
                --data=${DATA_PATH} \
                --cfg=paper256 \
                --outdir=training_example

In this example, the results are saved to a created director training_example. --cfg specifies the training configuration, which can be further customized with additional options:

--no_insgen disables InsGen, back to original StyleGAN2-ADA.
--rqs overrides the number of real image queue size. (default: 5% of the total number of training samples)
--fqs overrides the number of fake image queue size. More samples are beneficial, especially when the training samples are limited. (default: 5% of the total number of training samples)
--gamma overrides the R1 gamma (i.e., gradient penalty). As described in styleGAN2-ada-pytorch, training can be sensitive to this hyper-parameter. It would be better to try some different values. Here, we recommend using a smaller one than that in original StyleGAN2-ADA.

More functions would be supported after this projest is merged into our genforce. Please stay tuned!

License

This work is made available under the Nvidia Source Code License.

Acknowledgements

We thank Janne Hellsten and Tero Karras for the pytorch version codebase of their styleGAN2-ada-pytorch.

BibTeX

@article{yang2021insgen,
  title   = {Data-Efficient Instance Generation from Instance Discrimination},
  author  = {Yang, Ceyuan and Shen, Yujun and Xu, Yinghao and Zhou, Bolei},
  journal = {arXiv preprint arXiv:2106.04566},
  year    = {2021}
}

Comments

Error while resuming training！

Thank you for your excellent job！ When I was used " --resume" to resume training, I received annoying error: Constructing networks... Resuming from "/content/drive/MyDrive/FakeCLR-main/training-runs-7k/00002-pv-lianpu7k-paper256-gamma10-kimg5000-batch8/network-snapshot-000200.pkl" Traceback (most recent call last): File "train.py", line 583, in <module> main() # pylint: disable=no-value-for-parameter File "/usr/local/lib/python3.8/site-packages/click/core.py", line 1130, in __call__ return self.main(*args, **kwargs) File "/usr/local/lib/python3.8/site-packages/click/core.py", line 1055, in main rv = self.invoke(ctx) File "/usr/local/lib/python3.8/site-packages/click/core.py", line 1404, in invoke return ctx.invoke(self.callback, **ctx.params) File "/usr/local/lib/python3.8/site-packages/click/core.py", line 760, in invoke return __callback(*args, **kwargs) File "/usr/local/lib/python3.8/site-packages/click/decorators.py", line 26, in new_func return f(get_current_context(), *args, **kwargs) File "train.py", line 576, in main subprocess_fn(rank=0, args=args, temp_dir=temp_dir) File "train.py", line 421, in subprocess_fn training_loop.training_loop(rank=rank, **args) File "/content/drive/MyDrive/FakeCLR-main/training/training_loop.py", line 172, in training_loop misc.copy_params_and_buffers(resume_data[name], module, require_all=False) File "/content/drive/MyDrive/FakeCLR-main/torch_utils/misc.py", line 160, in copy_params_and_buffers tensor.copy_(src_tensors[name].detach()).requires_grad_(tensor.requires_grad) RuntimeError: a leaf Variable that requires grad is being used in an in-place operation.

Sincerely hope to get your help!

opened by 49xxy 0
KeyError : 'D_ema' : error when training with --resume options

Hi, thanks for your wonderful work.

I have some issues when I try to use my custom dataset.

First. the command that you mentioned on README works for my custom dataset. python train.py --gpus=8
--data=${DATA_PATH}
--cfg=paper256
--outdir=training_example

However, when I try to use pretrained model(ffhq256) and retrain with my custom dataset it gave me some error message. The command that I tired is CUDA_VISIBLE_DEVICES=0,1 python3 train.py --gpus=2 --data=${DATA_PATH} --cfg=paper256 --resume=ffhq256 --outdir=training_example

And the error message that I got is this

I hope that I can get any solution!

Thanks!

opened by LeoAlethio 2
How to train insgen with only 1 GPU

thanks for your sharing, but I only have 1 GPU ， it can not be trained

I see the reason why need multi-GPU is for ‘effect of disabling shuffle BN to MoCo’

but I can not understand why must shuffle batch data among all gpus not only in GPU？

would you provide a way to shuffle batch date on 1 GPU， it can be not ‘effect’ ？

opened by Johnson-yue 5
Training set selection

Hello, Thanks for this awesome work, it really pushes the envelope for generation with few data. Would you mind providing the random indices used for training each subset for the ffhq models for example?

opened by ryanwebster90 1

Training does not work with 1 GPU

There seems to be a problem with the contrastive loss when using 1 GPU to train, training only works when setting no_insgen=true.

The output is:

Setting up augmentation...
Distributing across 1 GPUs...
Distributing Contrastive Heads across 1 GPUS...
Setting up training phases...
Setting up contrastive training phases...
Exporting sample images...
Initializing logs...
2021-09-18 04:23:26.767334: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcudart.so.11.0
Training for 25000 kimg...

Traceback (most recent call last):
  File "train.py", line 583, in <module>
    main() # pylint: disable=no-value-for-parameter
  File "/usr/lib/python3/dist-packages/click/core.py", line 764, in __call__
    return self.main(*args, **kwargs)
  File "/usr/lib/python3/dist-packages/click/core.py", line 717, in main
    rv = self.invoke(ctx)
  File "/usr/lib/python3/dist-packages/click/core.py", line 956, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/usr/lib/python3/dist-packages/click/core.py", line 555, in invoke
    return callback(*args, **kwargs)
  File "/usr/lib/python3/dist-packages/click/decorators.py", line 17, in new_func
    return f(get_current_context(), *args, **kwargs)
  File "train.py", line 576, in main
    subprocess_fn(rank=0, args=args, temp_dir=temp_dir)
  File "train.py", line 421, in subprocess_fn
    training_loop.training_loop(rank=rank, **args)
  File "/home/katarina/ML/insgen/training/training_loop.py", line 326, in training_loop
    loss.accumulate_gradients(phase=phase.name, real_img=real_img, real_c=real_c, gen_z=gen_z, gen_c=gen_c, sync=sync, gain=gain, cl_phases=cl_phases, D_ema=D_ema, g_fake_cl=not no_cl_on_g, **cl_loss_weight)
  File "/home/katarina/ML/insgen/training/contrastive_loss.py", line 156, in accumulate_gradients
    loss_Dreal = loss_Dreal + lw_real_cl * self.run_cl(real_img_tmp, real_c, sync, Dphase.module, D_ema, loss_name='D_cl')
  File "/home/katarina/ML/insgen/training/contrastive_loss.py", line 71, in run_cl
    loss = contrastive_head(logits0, logits1, loss_only=loss_only, update_q=update_q)
  File "/home/katarina/.local/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1051, in _call_impl
    return forward_call(*input, **kwargs)
  File "/home/katarina/ML/insgen/training/contrastive_head.py", line 183, in forward
    self._dequeue_and_enqueue(k)
  File "/home/katarina/.local/lib/python3.8/site-packages/torch/autograd/grad_mode.py", line 28, in decorate_context
    return func(*args, **kwargs)
  File "/home/katarina/ML/insgen/training/contrastive_head.py", line 51, in _dequeue_and_enqueue
    keys = concat_all_gather(keys)
  File "/home/katarina/.local/lib/python3.8/site-packages/torch/autograd/grad_mode.py", line 28, in decorate_context
    return func(*args, **kwargs)
  File "/home/katarina/ML/insgen/training/contrastive_head.py", line 197, in concat_all_gather
    for _ in range(torch.distributed.get_world_size())]
  File "/home/katarina/.local/lib/python3.8/site-packages/torch/distributed/distributed_c10d.py", line 748, in get_world_size
    return _get_group_size(group)
  File "/home/katarina/.local/lib/python3.8/site-packages/torch/distributed/distributed_c10d.py", line 274, in _get_group_size
    default_pg = _get_default_group()
  File "/home/katarina/.local/lib/python3.8/site-packages/torch/distributed/distributed_c10d.py", line 358, in _get_default_group
    raise RuntimeError("Default process group has not been initialized, "
RuntimeError: Default process group has not been initialized, please make sure to call init_process_group.

opened by kata44 6

[ArXiv 2021] Data-Efficient Instance Generation from Instance Discrimination

Related tags

Overview

InsGen - Data-Efficient Instance Generation from Instance Discrimination

Qualitative results

Inference

Training

License

Acknowledgements

BibTeX

Comments

Error while resuming training！

KeyError : 'D_ema' : error when training with --resume options

How to train insgen with only 1 GPU

Training set selection

Training does not work with 1 GPU

Owner

GenForce: May Generative Force Be with You

We have implemented shaDow-GNN as a general and powerful pipeline for graph representation learning. For more details, please find our paper titled Deep Graph Neural Networks with Shallow Subgraph Samplers, available on arXiv (https//arxiv.org/abs/2012.01380).

arxiv-sanity, but very lite, simply providing the core value proposition of the ability to tag arxiv papers of interest and have the program recommend similar papers.

Listing arxiv - Personalized list of today's articles from ArXiv

Arxiv harvester - Poor man's simple harvester for arXiv resources

ISTR: End-to-End Instance Segmentation with Transformers (https://arxiv.org/abs/2105.00637)

Source Code for DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances (https://arxiv.org/pdf/2012.01775.pdf)

Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://arxiv.org/abs/2103.06332).

Implementation of Memory-Efficient Neural Networks with Multi-Level Generation, ICCV 2021

The official implementation of NeMo: Neural Mesh Models of Contrastive Features for Robust 3D Pose Estimation [ICLR-2021]. https://arxiv.org/pdf/2101.12378.pdf

This is an official implementation of our CVPR 2021 paper "Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression" (https://arxiv.org/abs/2104.02300)

Official PyTorch Implementation of Hypercorrelation Squeeze for Few-Shot Segmentation, arXiv 2021

(Arxiv 2021) NeRF--: Neural Radiance Fields Without Known Camera Parameters

Investigating Attention Mechanism in 3D Point Cloud Object Detection (arXiv 2021)

Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation

Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, Oral

Efficient-GlobalPointer - Pytorch Efficient GlobalPointer

Efficient 6-DoF Grasp Generation in Cluttered Scenes

ZeroGen: Efficient Zero-shot Learning via Dataset Generation

Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Loop Story Generation"