Pytorch implemenation of Stochastic Multi-Label Image-to-image Translation (SMIT)

Related tags

Deep Learning computer-vision pytorch generative-adversarial-network gans image-translation image-to-image-translation smit iccv2019

Overview

SMIT: Stochastic Multi-Label Image-to-image Translation

This repository provides a PyTorch implementation of SMIT. SMIT can stochastically translate an input image to multiple domains using only a single generator and a discriminator. It only needs a target domain (binary vector e.g., [0,1,0,1,1] for 5 different domains) and a random gaussian noise.

Paper

SMIT: Stochastic Multi-Label Image-to-image Translation
Andrés Romero¹, Pablo Arbelaez¹, Luc Van Gool², Radu Timofte²
¹Biomedical Computer Vision (BCV) Lab, Universidad de Los Andes.
²Computer Vision Lab (CVL), ETH Zürich.

Citation

@article{romero2019smit,
  title={SMIT: Stochastic Multi-Label Image-to-Image Translation},
  author={Romero, Andr{\'e}s and Arbel{\'a}ez, Pablo and Van Gool, Luc and Timofte, Radu},
  journal={ICCV Workshops},
  year={2019}
}

Dependencies

Python (2.7, 3.5+)
PyTorch (0.3, 0.4, 1.0)

Usage

Cloning the repository

$ git clone https://github.com/BCV-Uniandes/SMIT.git
$ cd SMIT

Downloading the dataset

To download the CelebA dataset:

$ bash generate_data/download.sh

Train command:

./main.py --GPU=$gpu_id --dataset_fake=CelebA

Each dataset must has datasets/ .py and datasets/ .yaml files. All models and figures will be stored at snapshot/models/$dataset_fake/ _ .pth and snapshot/samples/$dataset_fake/ _ .jpg, respectivelly.

Test command:

./main.py --GPU=$gpu_id --dataset_fake=CelebA --mode=test

SMIT will expect the .pth weights are stored at snapshot/models/$dataset_fake/ (or --pretrained_model=location/model.pth should be provided). If there are several models, it will take the last alphabetical one.

Demo:

./main.py --GPU=$gpu_id --dataset_fake=CelebA --mode=test --DEMO_PATH=location/image_jpg/or/location/dir

DEMO performs transformation per attribute, that is swapping attributes with respect to the original input as in the images below. Therefore, --DEMO_LABEL is provided for the real attribute if DEMO_PATH is an image (If it is not provided, the discriminator acts as classifier for the real attributes).

Pretrained models

Models trained using Pytorch 1.0.

Multi-GPU

For multiple GPUs we use Horovod. Example for training with 4 GPUs:

mpirun -n 4 ./main.py --dataset_fake=CelebA

Qualitative Results. Multi-Domain Continuous Interpolation.

First column (original input) -> Last column (Opposite attributes: smile, age, genre, sunglasses, bangs, color hair). Up: Continuous interpolation for the fake image. Down: Continuous interpolation for the attention mechanism.

Qualitative Results. Random sampling.

CelebA

EmotionNet

RafD

Edges2Shoes

Edges2Handbags

Yosemite

Painters

Qualitative Results. Style Interpolation between first and last row.

CelebA

EmotionNet

RafD

Edges2Shoes

Edges2Handbags

Yosemite

Painters

Qualitative Results. Label continuous inference between first and last row.

CelebA

EmotionNet

You might also like...

Storchastic is a PyTorch library for stochastic gradient estimation in Deep Learning

140 Dec 30, 2022

Bayesian-Torch is a library of neural network layers and utilities extending the core of PyTorch to enable the user to perform stochastic variational inference in Bayesian deep neural networks

Bayesian-Torch is a library of neural network layers and utilities extending the core of PyTorch to enable the user to perform stochastic variational inference in Bayesian deep neural networks. Bayesian-Torch is designed to be flexible and seamless in extending a deterministic deep neural network architecture to corresponding Bayesian form by simply replacing the deterministic layers with Bayesian layers.

210 Jan 4, 2023

《LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation》(AAAI 2021) GitHub:

LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation

76 Dec 5, 2022

Comments

MASK LOSS to 0

I apply your code to celeba-hq, but found the loss of attention turn to 0 and so the networks do nothing. Why? the d_cls is about 10 while g_cls go to 30 but networks seem to never optim this. Another question is that why you set the parameter of DE fixed?

opened by imlixinyang 20
Main Results

Resultados finales: Circulos verdes significan inicio y final de interpolación de estilos, respectivamente. Estos resultados son SIN regularizar el estilo. Me di cuenta que simplemente inyectando ruido aleatorio genera mejores imágenes, que si reconstruyo el estilo. Y lo más raro es que no hace mode collapse, al menos no muy grave. Es decir, no necesitamos un "style encoder". Esto va en contra de lo que se dice que inyectar ruido aleatorio hace que la red lo ignore si no se regulariza.

@parbela

opened by affromero 3
Unable to download the weight of Painter14

Hi, thanks for your good job! I wanted to download the weight of Painter14 for test, however, I encountered 403 problem in your website. Can you provide a new address?

opened by kingofprank 0

Pytorch implemenation of Stochastic Multi-Label Image-to-image Translation (SMIT)

Related tags

Overview

SMIT: Stochastic Multi-Label Image-to-image Translation

Paper

Citation

Dependencies

Usage

Cloning the repository

Downloading the dataset

Train command:

Test command:

Demo:

Pretrained models

Multi-GPU

Qualitative Results. Multi-Domain Continuous Interpolation.

Qualitative Results. Random sampling.

CelebA

EmotionNet

RafD

Edges2Shoes

Edges2Handbags

Yosemite

Painters

Qualitative Results. Style Interpolation between first and last row.

CelebA

EmotionNet

RafD

Edges2Shoes

Edges2Handbags

Yosemite

Painters

Qualitative Results. Label continuous inference between first and last row.

CelebA

EmotionNet

You might also like...

Storchastic is a PyTorch library for stochastic gradient estimation in Deep Learning

Bayesian-Torch is a library of neural network layers and utilities extending the core of PyTorch to enable the user to perform stochastic variational inference in Bayesian deep neural networks

Binary Stochastic Neurons in PyTorch

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

PyTorch implementation of SCAFFOLD (Stochastic Controlled Averaging for Federated Learning, ICML 2020).

Implementation of Stochastic Image-to-Video Synthesis using cINNs.

iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis

《LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation》(AAAI 2021) GitHub:

Comments

MASK LOSS to 0

Main Results

Unable to download the weight of Painter14

Owner

Biomedical Computer Vision Group @ Uniandes

Official implementation of "Open-set Label Noise Can Improve Robustness Against Inherent Label Noise" (NeurIPS 2021)

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Code of U2Fusion: a unified unsupervised image fusion network for multiple image fusion tasks, including multi-modal, multi-exposure and multi-focus image fusion.

This GitHub repository contains code used for plots in NeurIPS 2021 paper 'Stochastic Multi-Armed Bandits with Control Variates.'

PyTorch implementation for SDEdit: Image Synthesis and Editing with Stochastic Differential Equations

Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(2021) paper

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

PyTorch implementation of Hierarchical Multi-label Text Classification: An Attention-based Recurrent Network

General Multi-label Image Classification with Transformers