GAN example for Keras. Cuz MNIST is too small and there should be something more realistic.

Last update: Sep 20, 2022

Related tags

Deep Learning Keras-GAN-Animeface-Character

Overview

Keras-GAN-Animeface-Character

GAN example for Keras. Cuz MNIST is too small and there should an example on something more realistic.

Some results

Training for 22 epochs

Youtube Video, click on the image

Loss graph for 5000 mini-batches

1 mini-batch = 64 images. Dataset = 14490, hence 5000 mini-batches is approximately 22 epochs.

Some outputs of 5000th min-batch

Some training images

Useful resources, before you go on

There are great examples on MNIST already. Be sure to check them out.
"How to Train a GAN? Tips and tricks to make GANs work" is a must read! (GAN Hacks)
- The advices were extremely helpful in making this example.
- https://github.com/soumith/ganhacks
Projects doing the same thing:
- https://github.com/jayleicn/animeGAN
- https://github.com/tdrussell/IllustrationGAN
I used slow implementation for the sake of simplicity. However, the correct way is:
- https://ctmakro.github.io/site/on_learning/fast_gan_in_keras.html
https://github.com/shekkizh/neuralnetworks.thought-experiments/blob/master/Generative%20Models/GAN/Readme.md

How to run this example

Setup

My environment: Python 3.6 + Keras 2.0.4 + Tensorflow 1.x
- If you are on Keras 2.0.0, you need to update it otherwise BatchNormalization() will cause bug, saying "you need to pass float to input" or something like that from Tensorflow back end.
Use virtualenv to initialize a similar environment (python and dependencies):

pip install virtualenv
virtualenv -p <PATH_TO_BIN_DIR>/python3.6 venv
source venv/bin/activate
pip install -r requirements.txt

I HATE making a program that has so many command line parameters to pass. Many of the parameters are there in the scripts. Adjust the script as you need. The "main()" function is at the bottom of the script as people do in C/C++
Most global parameters are defined in args.py.
- They are defined as class variables not instance variables so you may have trouble running/training multiple instances of the GAN with different parameters. (which is very unlikely to happen)
Download dataset from http://www.nurs.or.jp/~nagadomi/animeface-character-dataset/
- Extract it to this directory so that the scipt can find ./animeface-character-dataset/thumb/
- Any dataset should work in principle but GAN is sensitive to hyperparameters and may not work on yours. I tuned the parameters for animeface-character-dataset.

Preprocessing

Run the preprocessing script. It saves training time to resize/scale the input than doing those tasks on the fly in the training loop.
- ./data.py
- The image, when loaded from PNG files, the RGB values have [0, 255]. (uint8 type). data.py will collect the images, resize the images to 64x64 and scale the RGB values so that they will be in [-1.0, 1.0] range.
- Data.py will only sample a subset of the dataset if configured to do so. The size of the subset is determined by dataset_sz defined in args.py
- The images will be written to data.hdf5.
  - Made it small to verify the training is working.
  - You can increase it but you need to adjust the network sizes accordingly.
- Again, which files to read is defined in the script at the bottom, not by sys.argv.
You need a large enough dataset. Otherwise the discriminator will sort of "memorize" the true data and reject all that's generated.

Training

Open gan.py then at the bottom, uncomment train_autoenc() if you wish.
- This is useful for seeing the generator network's capability to reproduce the input.
- The auto-encoder will be trained on input images.
- The output will be blurry, as the auto-encoder having mean-squared-error loss. (This is why GAN got invented in the first place!)
To run training, modify main() so that train_gan() is uncommented.
The script will dump reals.png and fakes.png every 10 epoch so that you can see how the training is going.
The training takes a while. For this example on Anime Face dataset, it took about 10000 mini-batches to get good results.
- If you see only uniform color or "modern art" until 2000 then the training is not working!
The script also dumps weights every 10 batches. Utilize them to save training time. Weights before diverging is preferred :) Uncomment load_weights() in train_gan().

Training tips

What I experienced during my training of GAN.

As described in GAN Hacks, discriminator should be ahead of the generator so that the generator can be "guided" by the discriminator.
If you look at loss graph at https://github.com/osh/KerasGAN, they had gen loss in range of 2 to 4. Their training worked well. The discriminator loss is low, arond 0.1.
You'll need trial and error to get the hyper-pameters right so that the training stays in the stable, balanced zone. That includes learning rate of D and G, momentums, etc.
The convergence is quite sensitive with LR, beware!
If things go well, the discriminator loss for detecting real/fake = dloss0/dloss1 should be less than or around 0.1, which means it is good at telling whether the input is real or fake.
If learning rate is too high, the discriminator will diverge and one of the loss will get high and will not fall. Training fails in this case.
If you make LR too small, it will only slow the learning and will not prevent other issues such as oscillation. It only needs to be lower than certain threshold that is data dependent.
If adjusting LR doesn't work, it could be lack of complexity in the discriminator layer. Add more layers, or some other parameters. It could be anything :( Good luck!
On the other hand, generator loss will be relatively higher than discriminator loss. In this script, it oscillates in range 0.1 to 4.
If you see any of the D loss staying > 15 (when batch size is 32) the training is screwed.
In case of G loss > 15, see if it escapes within 30 batches. If it stays there for too long, it isn't good, I think.
In case you're seeing high G loss, it could mean it can't keep up with discriminator. You might need to increase LR. (Must be slower than discriminator though)
One final piece of the training I was missing was the parameter in BatchNormalization. I found about it in this link: https://github.com/shekkizh/neuralnetworks.thought-experiments/blob/master/Generative%20Models/GAN/Readme.md
- Sort of interesting, in PyTorch, momentum parameter for BatchNorm is 0.1, according to the API documents, while in Keras it is 0.99. I'm not sure if 0.1 in PyTorch actually means 1 - 0.1. I didn't look into PyTorch backend implementation.

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

This is a playground for pytorch beginners, which contains predefined models on popular dataset. Currently we support mnist, svhn cifar10, cifar100 st

2.4k Dec 28, 2022

Code image classification of MNIST dataset using different architectures: simple linear NN, autoencoder, and highway network

Deep Learning for image classification pip install -r http://webia.lip6.fr/~baskiotisn/requirements-amal.txt Train an autoencoder python3 train_auto

0 Mar 30, 2022

Automated image registration. Registrationimation was too much of a mouthful.

alignimation Automated image registration. Registrationimation was too much of a mouthful. This repo contains the code used for my blog post Alignimat

9 Oct 13, 2022

Python-kafka-reset-consumergroup-offset-example - Python Kafka reset consumergroup offset example

Python Kafka reset consumergroup offset example This is a simple example of how

1 Feb 16, 2022

Allows including an action inside another action (by preprocessing the Yaml file). This is how composite actions should have worked.

actions-includes Allows including an action inside another action (by preprocessing the Yaml file). Instead of using uses or run in your action step,

70 Nov 4, 2022

Stock-history-display - something like a easy yearly review for your stock performance

Comments

There is some BC issue with new versions of Keras and OpenCV. Downgrading

Sorry, did not fully achieve my previous PR https://github.com/forcecore/Keras-GAN-Animeface-Character/pull/3

Here's a fix to downgrade Keras from 2.2 to 2.0 to avoid this issue: Number of trainable weights seem to change after model compilation

opened by pitpit 1
Minibatch discrimination missing bias term
The minibatch discrimination proposed here indicates that a bias term is added before concatenating the kernels to the main input. As I'm not 100% sure which implementation is the correct one, I would suggest to check and improve your code accordingly, with an update in the build method

self.b = self.add_weight(shape=(self.nb_kernels,), initializer=keras.initializers.zeros(), name='bias', regularizer=self.W_regularizer, trainable=True, constraint=self.W_constraint)

and in the call method

minibatch_features += self.b
opened by HitLuca 1

GAN example for Keras. Cuz MNIST is too small and there should be something more realistic.

Related tags

Overview

Keras-GAN-Animeface-Character

Some results

Training for 22 epochs

Loss graph for 5000 mini-batches

Some outputs of 5000th min-batch

Some training images

Useful resources, before you go on

How to run this example

Setup

Preprocessing

Training

Training tips

You might also like...

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

Code image classification of MNIST dataset using different architectures: simple linear NN, autoencoder, and highway network

Automated image registration. Registrationimation was too much of a mouthful.

Python-kafka-reset-consumergroup-offset-example - Python Kafka reset consumergroup offset example

Allows including an action inside another action (by preprocessing the Yaml file). This is how composite actions should have worked.

Stock-history-display - something like a easy yearly review for your stock performance

A Runtime method overload decorator which should behave like a compiled language

Extract MNIST handwritten digits dataset binary file into bmp images

Attention mechanism with MNIST dataset

Comments

There is some BC issue with new versions of Keras and OpenCV. Downgrading

Minibatch discrimination missing bias term

Owner

In this project we investigate the performance of the SetCon model on realistic video footage. Therefore, we implemented the model in PyTorch and tested the model on two example videos.

FuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space OptimizationFuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space Optimization

DR-GAN: Automatic Radial Distortion Rectification Using Conditional GAN in Real-Time

A Fast and Stable GAN for Small and High Resolution Imagesets - pytorch

Classification models 1D Zoo - Keras and TF.Keras

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

Neon-erc20-example - Example of creating SPL token and wrapping it with ERC20 interface in Neon EVM

This is an implementation of Googles Yogi-Optimizer in Keras (tf.keras)

Keras udrl - Keras implementation of Upside Down Reinforcement Learning

Random Erasing Data Augmentation. Experiments on CIFAR10, CIFAR100 and Fashion-MNIST