AttGAN: Facial Attribute Editing by Only Changing What You Want (IEEE TIP 2019)

Zhenliang He

Last update: Dec 14, 2022

Related tags

Overview

News

11 Jan 2020: We clean up the code to make it more readable! The old version is here: v1.

AttGAN
_{_{TIP Nov. 2019, arXiv Nov. 2017}}

TensorFlow implementation of AttGAN: Facial Attribute Editing by Only Changing What You Want.

Other implementations of AttGAN
- AttGAN-PyTorch by Yu-Jing Lin
- AttGAN-PaddlePaddle by ceci3 and zhumanyu (AttGAN is one of the official reproduced models of PaddlePaddle)
Closely related works
- An excellent work built upon our code - STGAN (CVPR 2019) by Ming Liu
- Changing-the-Memorability (CVPR 2019 MBCCV Workshop) by acecreamu
- Fashion-AttGAN (CVPR 2019 FSS-USAD Workshop) by Qing Ping
An unofficial demo video of AttGAN by 王一凡

Exemplar Results

See results.md for more results, we try higher resolution and more attributes (all 40 attributes!!!)
Inverting 13 attributes respectively

from left to right: Input, Reconstruction, Bald, Bangs, Black_Hair, Blond_Hair, Brown_Hair, Bushy_Eyebrows, Eyeglasses, Male, Mouth_Slightly_Open, Mustache, No_Beard, Pale_Skin, Young

Usage

Environment
- Python 3.6
- TensorFlow 1.15
- OpenCV, scikit-image, tqdm, oyaml
- we recommend Anaconda or Miniconda, then you can create the AttGAN environment with commands below
```
conda create -n AttGAN python=3.6

source activate AttGAN

conda install opencv scikit-image tqdm tensorflow-gpu=1.15

conda install -c conda-forge oyaml
```
- NOTICE: if you create a new conda environment, remember to activate it before any other command
```
source activate AttGAN
```
Data Preparation
- Option 1: CelebA-unaligned (higher quality than the aligned data, 10.2GB)
  - download the dataset
    - img_celeba.7z (move to ./data/img_celeba/img_celeba.7z): Google Drive or Baidu Netdisk (password rp0s)
    - annotations.zip (move to ./data/img_celeba/annotations.zip): Google Drive
  - unzip and process the data
```
7z x ./data/img_celeba/img_celeba.7z/img_celeba.7z.001 -o./data/img_celeba/

unzip ./data/img_celeba/annotations.zip -d ./data/img_celeba/

python ./scripts/align.py
```
- Option 2: CelebA-HQ (we use the data from CelebAMask-HQ, 3.2GB)
  - CelebAMask-HQ.zip (move to ./data/CelebAMask-HQ.zip): Google Drive or Baidu Netdisk
  - unzip and process the data
```
unzip ./data/CelebAMask-HQ.zip -d ./data/

python ./scripts/split_CelebA-HQ.py
```

Run AttGAN

training (see examples.md for more training commands)

\\ for CelebA
CUDA_VISIBLE_DEVICES=0 \
python train.py \
--load_size 143 \
--crop_size 128 \
--model model_128 \
--experiment_name AttGAN_128

\\ for CelebA-HQ
CUDA_VISIBLE_DEVICES=0 \
python train.py \
--img_dir ./data/CelebAMask-HQ/CelebA-HQ-img \
--train_label_path ./data/CelebAMask-HQ/train_label.txt \
--val_label_path ./data/CelebAMask-HQ/val_label.txt \
--load_size 128 \
--crop_size 128 \
--n_epochs 200 \
--epoch_start_decay 100 \
--model model_128 \
--experiment_name AttGAN_128_CelebA-HQ

testing

single attribute editing (inversion)

\\ for CelebA
CUDA_VISIBLE_DEVICES=0 \
python test.py \
--experiment_name AttGAN_128

\\ for CelebA-HQ
CUDA_VISIBLE_DEVICES=0 \
python test.py \
--img_dir ./data/CelebAMask-HQ/CelebA-HQ-img \
--test_label_path ./data/CelebAMask-HQ/test_label.txt \
--experiment_name AttGAN_128_CelebA-HQ

multiple attribute editing (inversion) example

\\ for CelebA
CUDA_VISIBLE_DEVICES=0 \
python test_multi.py \
--test_att_names Bushy_Eyebrows Pale_Skin \
--experiment_name AttGAN_128

attribute sliding example

\\ for CelebA
CUDA_VISIBLE_DEVICES=0 \
python test_slide.py \
--test_att_name Pale_Skin \
--test_int_min -2 \
--test_int_max 2 \
--test_int_step 0.5 \
--experiment_name AttGAN_128

loss visualization

CUDA_VISIBLE_DEVICES='' \
tensorboard \
--logdir ./output/AttGAN_128/summaries \
--port 6006

convert trained model to .pb file

python to_pb.py --experiment_name AttGAN_128

Using Trained Weights
- alternative trained weights (move to ./output/*.zip)
  - AttGAN_128.zip (987.5MB)
    - including G, D, and the state of the optimizer
  - AttGAN_128_generator_only.zip (161.5MB)
    - G only
  - AttGAN_384_generator_only.zip (91.1MB)
- unzip the file (AttGAN_128.zip for example)
```
unzip ./output/AttGAN_128.zip -d ./output/
```
- testing (see above)
Example for Custom Dataset
- AttGAN-Cartoon

Citation

If you find AttGAN useful in your research work, please consider citing:

@ARTICLE{8718508,
author={Z. {He} and W. {Zuo} and M. {Kan} and S. {Shan} and X. {Chen}},
journal={IEEE Transactions on Image Processing},
title={AttGAN: Facial Attribute Editing by Only Changing What You Want},
year={2019},
volume={28},
number={11},
pages={5464-5478},
keywords={Face;Facial features;Task analysis;Decoding;Image reconstruction;Hair;Gallium nitride;Facial attribute editing;attribute style manipulation;adversarial learning},
doi={10.1109/TIP.2019.2916751},
ISSN={1057-7149},
month={Nov},}

Comments

TypeError

hello I have downloaded trained model and trying to test it but i am getting following error. can u please suggest what went wrong?

I am testing it on google colab and using only 182000 to 182637 images. TypeError: Input 'filename' of 'ReadFile' Op has type float32 that does not match expected type of string.

opened by shbnm21 21
Unable to use different number of images

Hello. I am using hd - celeba 384 dataset with provided 384_shortcut1_inject1_none_hd model. I am trying to use custom number of images instead of using all 202599 images. I tried to do the following: modify list_attr_celeba.txt file to only include first 20 images and put these 20 images in ./data/img_crop_celeba/*.jpg. However, this is the error I get:

TypeError: Input 'filename' of 'ReadFile' Op has type float32 that does not match expected type of string.

I also tried to train with only 20 images and get the same error. I get no errors when running train/test for all 202599 images.

opened by githubusername001 10
Questions about the handling of noise z in DTLCGAN with an encoder attached

Hi, I am referring to your DTLCGAN code. According to your last reply, I added an encoder to it and have some questions about your code. In your train.py, the z_sample you choose to sample is generated by: z_ipt_samples = [np.stack([np.random.normal(size=[z_dim])] * len(c_ipt_sample)) for i in range(15)] which is of (15,18,100).

So, right now I used an encoder, and the noise of z (as well as the z_ipt used in training) here should be replaced by the encoder's output, right?

But, what does len(c_ipt_sample) here mean? You generated 18 noises for one testing sample? I counted your sampling training, the lowest layer in your decision tree does have 18 images( 233=18). So why do you generate testing sample from bottom to the top, but not the reverse? How can you be certain that this 18 noises all belong to the same person since you generated from bottom to top?

Besides, should my encoder do the parellel, choosing 18 frontcodes of 18 images and uses them to do the sampling? It seems wrong here because the 18 frontcodes of mine are from 18 different images(or say 18 different persons), the resulting sampling tree was weird(some are ok, and I am confused about them). But if I used the same frontcode of one image(or say the same person) and copy it for 18 times, the training samples are the same, no change of attributes at all.

opened by XijieJiao 8
Facial Attribute

Hello, @LynnHo Can you tell how we can do facial feature extraction, means if input any image of face and then how we can get the 40 facial attribute from that.

Thanks.

opened by xyzdcgan 8
The same result for all the attributes.

As written in the title, I obtain a row of the same images without any changes regardless to the attribute (column). I use custom data-set organized as CelebA. Could you give an advise, what may cause it?

opened by acecreamu 8
Attribute Classifier for Editing Accuracy/Error

I'm curious what you used for the attribute classifier to measure the attribute editing accuracy and preservation error. Also do you have any plans to release this trained model? Thanks.
Attribute Classifier

opened by tegillis 7
About the performance of pretrained model

The pre-trained model you provided is not well-performing over the Celeb-A-HQ dataset. So I've got a question that for how many epochs you have trained the pre-trained model and on what data set. Another question is that my use case applies glasses to the face, so I need to know that if I trained a new model from scratch over the Celeb-A-HQ dataset it will help us to achieve my task. can we train the model over a single attribute like eyeglasses or a smile? Thanks in advance.

opened by alan-ai-learner 4
Attribute Style Manipulation
Hi, thank you for sharing the great project. I found your attribute style manipulation particularly meaningful and useful for my recent research. I saw from previous issue that you have no plan to open source the code for this part. I have the following questions:

I found nowhere in your paper as for how you derive your θ and the relationship between θ and the image, so how do you get the θ in an unsupervised way for each input?

Is this part's idea (and the way you derived θ) based on the paper 'Generative Attribute Controller with Conditional Filtered Generative Adversarial Networks'? (I found their code is also not open source).

If I want to realize this part myself, could you give me some hints of where to start or any papers and sources I could refer to (there is really very few works on accurate or multiple attribute style manipulation)?

Thank you!
opened by XijieJiao 4
Cannot get a desired result on CelebA-HQ dataset

Hi there,

Your work is interesting. I have a problem. Could you help figure it out?

I applied your method on CelebA-HQ dataset for a single attribute manipulation. But I cannot get the desired result. The result (the interested attribute is "Smiling") at the 59th training epoch is shown as follows. There is no change in the third column images.

Thanks and Regards,

opened by EvaFlower 4
Hi, I have a question about the training and the test

First, I appreciate your excellent work and have been interested in your work since 2018.

I have a question about the test and training in your work. In advance, I clarify that I consider the case where the value of attributes is binary.

For training, the value of attributes seems to be -1 or 1. (Read 0 or 1 then, *2 - 1 -> [-1, 1]) (https://github.com/LynnHo/AttGAN-Tensorflow/blob/master/train.py#L161)

On the other hand, the range of attributes is [-2, 2] for test. ( Read 0 or 1 then, *2 - 1 -> [-1, 1], finally *2 -> [-2, 2] ) (https://github.com/LynnHo/AttGAN-Tensorflow/blob/master/train.py#L246, test_int = 2.0)

Is it right that you use the different values of the attribute vector in training and test?

I just find that I cannot reproduce the result of attribute classification without this trick. However, I can reproduce the result by using [-2, 2].

Thanks!

opened by FriedRonaldo 4
Why attributes are encoded into [-1, 1] not [0, 1]

@LynnHo Hi, I read your code theses days, and I wonder about why the label of attributes has to map into [-1, 1] instead of [0, 1]. It seems that it is very important and has some technical reason because you commented on three exclamation marks on that code. Could you share some experimental knowledge about this?

opened by ChengBinJin 4
Applying your code in datasets from masked face to non-masked face

I want to apply this code on Celeb-A with fake masked images and i want to remove mask .. so how can i apply this concept .. can you guide me if i can do it or no using your code ?? if yes where should i change in the code just train.py and data.py ??

opened by Nuha1412 1
Style manipulation not robust, very sensitive to varied parameters

How do you get a good balance among varied hyper-parameters like different loss weights, learning_rate when style manipulation is adopted? I found the training of the network very unstable.

I can get style manipulation results on bangs and eyeglasses, but the control is unstable and the sharpness of images are also affected. The control on eyeglasses is only on the shade and the model has no control on shape and size.

Except for hyper-parameters, is there any other places where there can be problems like training settings?

Besides, when implementing style manipulation, except for the loss of generated style controller, do you also use the original attribute loss?

Looking forward to your answer. Thank you!

opened by jiaoxijie 2

Releases(v1)

v1(Jan 3, 2020)

Source code(tar.gz)
Source code(zip)

Owner

Zhenliang He

GitHub

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.

FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning PyTorch implementation for the paper: FACIAL: Synthesizing Dynamic Talking

226 Jan 8, 2023

Deepface is a lightweight face recognition and facial attribute analysis (age, gender, emotion and race) framework for python

deepface Deepface is a lightweight face recognition and facial attribute analysis (age, gender, emotion and race) framework for python. It is a hybrid

2 Feb 10, 2022

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search, accepted by IJCAI 2021.

Instance-Aware Latent-Space Search This is a PyTorch implementation of the following paper: Disentangled Face Attribute Editing via Instance-Aware Lat

67 Dec 21, 2022

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing Figure: Joint multi-attribute edits using DyStyle model. Great diversity

74 Dec 3, 2022

Implementation for HFGI: High-Fidelity GAN Inversion for Image Attribute Editing

HFGI: High-Fidelity GAN Inversion for Image Attribute Editing High-Fidelity GAN Inversion for Image Attribute Editing Update: We released the inferenc

371 Dec 30, 2022

Web service for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation based on OpenFace 2.0

OpenGaze: Web Service for OpenFace Facial Behaviour Analysis Toolkit Overview OpenFace is a fantastic tool intended for computer vision and machine le

4 Nov 3, 2022

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

OpenFace 2.2.0: a facial behavior analysis toolkit Over the past few years, there has been an increased interest in automatic facial behavior analysis

5.8k Dec 31, 2022

Automatically measure the facial Width-To-Height ratio and get facial analysis results provided by Microsoft Azure

fwhr-calc-website This project is to automatically measure the facial Width-To-Height ratio and get facial analysis results provided by Microsoft Azur

1 Feb 7, 2022

Code for Talk-to-Edit (ICCV2021). Paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

Talk-to-Edit (ICCV2021) This repository contains the implementation of the following paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog Yumin

221 Jan 7, 2023

Stitch it in Time: GAN-Based Facial Editing of Real Videos

STIT - Stitch it in Time [Project Page] Stitch it in Time: GAN-Based Facial Edit

1.1k Jan 4, 2023

[CVPR 2022] TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing (CVPR 2022) This repository provides the official PyTorch impleme

128 Jan 3, 2023

I have created this Virtual Paint Program, in this you can paint(draw) on your screen using hand gestures, created in Python-3 using OpenCV and Mediapipe library. Gestures :- Index Finger for drawing and Index+Middle Finger for changing position and objects.

Virtual-Paint I have created this Virtual Paint Program, in this you can paint(draw) on your screen using hand gestures, created in Python-3. Gestures

6 Sep 22, 2021

AttGAN: Facial Attribute Editing by Only Changing What You Want (IEEE TIP 2019)

Related tags

Overview

AttGAN TIP Nov. 2019, arXiv Nov. 2017

Related

Exemplar Results

Usage

Citation

Comments

Releases(v1)

v1(Jan 3, 2020)

Owner

Zhenliang He

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.

Deepface is a lightweight face recognition and facial attribute analysis (age, gender, emotion and race) framework for python

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search, accepted by IJCAI 2021.

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

Implementation for HFGI: High-Fidelity GAN Inversion for Image Attribute Editing

Web service for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation based on OpenFace 2.0

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

Automatically measure the facial Width-To-Height ratio and get facial analysis results provided by Microsoft Azure

Code for Talk-to-Edit (ICCV2021). Paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

Stitch it in Time: GAN-Based Facial Editing of Real Videos

[CVPR 2022] TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

I have created this Virtual Paint Program, in this you can paint(draw) on your screen using hand gestures, created in Python-3 using OpenCV and Mediapipe library. Gestures :- Index Finger for drawing and Index+Middle Finger for changing position and objects.

In the case of your data having only 1 channel while want to use timm models

Changing the Mind of Transformers for Topically-Controllable Language Generation

😊 Python module for face feature changing

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Code release for The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification (TIP 2020)

[TIP 2021] SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

Code for the TIP 2021 Paper "Salient Object Detection with Purificatory Mechanism and Structural Similarity Loss"

AttGAN
_{_{TIP Nov. 2019, arXiv Nov. 2017}}