Pytorch implementation of the paper: "A Unified Framework for Separating Superimposed Images", in CVPR 2020.

Zhengxia Zou

Last update: Dec 18, 2022

Related tags

Deep Learning Deep-adversarial-decomposition

Overview

Deep Adversarial Decomposition

PDF | Supp | 1min-DemoVideo

Pytorch implementation of the paper: "Deep Adversarial Decomposition: A Unified Framework for Separating Superimposed Images", in CVPR 2020.

In the computer vision field, many tasks can be considered as image layer mixture/separation problems. For example, when we take a picture on rainy days, the image obtained can be viewed as a mixture of two layers: a rain streak layer and a clean background layer. When we look through a transparent glass, we see a mixture of the scene beyond the glass and the scene reflected by the glass.

Separating individual image layers from a single mixed image has long been an important but challenging task. We propose a unified framework named “deep adversarial decomposition” for single superimposed image separation. Our method deals with both linear and non-linear mixtures under an adversarial training paradigm. Considering the layer separating ambiguity that given a single mixed input, there could be an infinite number of possible solutions, we introduce a “Separation-Critic” - a discriminative network which is trained to identify whether the output layers are well-separated and thus further improves the layer separation. We also introduce a “crossroad l1” loss function, which computes the distance between the unordered outputs and their references in a crossover manner so that the training can be well-instructed with pixel-wise supervision. Experimental results suggest that our method significantly outperforms other popular image separation frameworks. Without specific tuning, our method achieves the state of the art results on multiple computer vision tasks, including the image deraining, photo reflection removal, and image shadow removal.

In this repository, we implement the training and testing of our paper based on pytorch and provide several demo datasets that can be used for reproduce the results reported in our paper. With the code, you can also try on your own datasets by following the instructions below.

Our code is partially adapted from the project pytorch-CycleGAN-and-pix2pix.

Requirements

See Requirements.txt.

Setup

Clone this repo:

git clone https://github.com/jiupinjia/Deep-adversarial-decomposition.git 
cd Deep-adversarial-decomposition

Download our demo datasets from 1) Google Drive; or 2) BaiduYun (Key: m9x1), and unzip into the repo directory.

unzip datasets.zip

Please note that in each of our demo datasets, we only uploaded a very small part of the images, which are only used as an example to show how the structure of the file directory is organized. To reproduce the results reported in our paper, you need to download the full versions of these datasets. All datasets used in our experiments are publicly available. Please check out our paper for more details.

Task 1: Image decomposition

On Stanford-Dogs + VGG-Flowers

To train the model:

python train.py --dataset dogsflowers --net_G unet_128 --checkpoint_dir checkpoints --vis_dir val_out --max_num_epochs 200 --batch_size 2 --enable_d1d2 --enable_d3 --enable_synfake --output_auto_enhance

To test the model:

python eval_unmix.py --dataset dogsflowers --ckptdir checkpoints --in_size 128 --net_G unet_128 --save_output

On MNIST + MNIST

To train the model:

python train.py --dataset mnist --net_G unet_64 --checkpoint_dir checkpoints --vis_dir val_out --max_num_epochs 200 --batch_size 2 --enable_d1d2 --enable_d3 --enable_synfake --output_auto_enhance

Task 2: Image deraining

On Rain100H

To train the model:

python train.py --dataset rain100h --checkpoint_dir checkpoints --vis_dir val_out --max_num_epochs 200 --batch_size 2 --enable_d1d2 --enable_d3 --enable_synfake --net_G unet_512 --pixel_loss pixel_loss --metric psnr_gt1

To test the model:

python eval_derain.py --dataset rain100h --ckptdir checkpoints --net_G unet_512 --in_size 512 --save_output

On Rain800

To train the model:

python train.py --dataset rain800 --checkpoint_dir checkpoints --vis_dir val_out --max_num_epochs 200 --batch_size 2 --enable_d1d2 --enable_d3 --enable_synfake --net_G unet_512 --pixel_loss pixel_loss --metric psnr_gt1

To test the model:

python eval_derain.py --dataset rain800 --ckptdir checkpoints --net_G unet_512 --in_size 512 --save_output

On DID-MDN

To train the model:

python train.py --dataset did-mdn --checkpoint_dir checkpoints --vis_dir val_out --max_num_epochs 200 --batch_size 2 --enable_d1d2 --enable_d3 --enable_synfake --net_G unet_512 --pixel_loss pixel_loss --metric psnr_gt1

To test the model on DID-MDN:

python eval_derain.py --dataset did-mdn-test1 --ckptdir checkpoints --net_G unet_512 --save_output

To test the model on DDN-1k:

python eval_derain.py --dataset did-mdn-test2 --ckptdir checkpoints --net_G unet_512 --in_size 512 --save_output

Task 3: Image reflection removal

On Synthesis-Reflection

To train the model (together on all three subsets [defocused, focused, ghosting]):

python train.py --dataset syn3-all --checkpoint_dir checkpoints --vis_dir val_out --max_num_epochs 200 --batch_size 2 --enable_d1d2 --enable_d3 --enable_synfake --net_G unet_512 --pixel_loss pixel_loss --metric psnr_gt1

To test the model:

python eval_dereflection.py --dataset syn3-all --ckptdir checkpoints --net_G unet_512 --in_size 512 --save_output

You can also train and test separately on the three subsets of Synthesis-Reflection by specifying --dataset above to syn3-defocused, syn3-focused, or syn3-ghosting.

On BDN

To train the model:

python train.py --dataset bdn --checkpoint_dir checkpoints --vis_dir val_out --max_num_epochs 200 --batch_size 2 --enable_d1d2 --enable_d3 --enable_synfake --net_G unet_256 --pixel_loss pixel_loss --metric psnr_gt1

To test the model:

python eval_dereflection.py --dataset bdn --ckptdir checkpoints --net_G unet_256 --in_size 256 --save_output

On Zhang's dataset

To train the model:

python train.py --dataset xzhang --checkpoint_dir checkpoints --vis_dir val_out --max_num_epochs 200 --batch_size 2 --enable_d1d2 --enable_d3 --enable_synfake --net_G unet_512 --pixel_loss pixel_loss --metric psnr_gt1

To test the model:

python eval_dereflection.py --dataset xzhang --ckptdir checkpoints --net_G unet_512 --in_size 512 --save_output

Task 4: Shadow Removal

On ISTD

To train the model:

python train.py --dataset istd --checkpoint_dir checkpoints --vis_dir val_out --max_num_epochs 200 --batch_size 2 --enable_d1d2 --enable_d3 --enable_synfake --net_G unet_256 --pixel_loss pixel_loss --metric labrmse_gt1

To test the model:

python eval_deshadow.py --dataset istd --ckptdir checkpoints --net_G unet_256 --in_size 256 --save_output

On SRD

To train the model:

python train.py --dataset srd --checkpoint_dir checkpoints --vis_dir val_out --max_num_epochs 200 --batch_size 2 --enable_d1d2 --enable_d3 --enable_synfake --net_G unet_512 --pixel_loss pixel_loss --metric labrmse_gt1

To test the model:

python eval_deshadow.py --dataset srd --ckptdir checkpoints --net_G unet_512 --in_size 512 --save_output

Pretrained Models

The pre-trained models of the above examples can be found in the following link: https://drive.google.com/drive/folders/1Tv4-woRBZOVUInFLs0-S_cV2u-OjbhQ-?usp=sharing

Citation

If you use this code for your research, please cite our paper:

@inproceedings{zou2020deep,
  title={Deep Adversarial Decomposition: A Unified Framework for Separating Superimposed Images},
  author={Zou, Zhengxia and Lei, Sen and Shi, Tianyang and Shi, Zhenwei and Ye, Jieping},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={12806--12816},
  year={2020}
}

Comments

evaluation metric - 6.566 on ISTD in Table 8
This work is very meaningful.

I tested the code provided by your github on the ISTD dataset. The indicators are as follows. Dataset: istd, average rmse-shadow: 8.4986, rmse-non-shadow: 6.3858, rmse-all: 6.7246. This is different from the results provided in Table 8. Can you explain what the problem is? The way I run the code from https://github.com/jiupinjia/Deep-adversarial-decomposition.

python train.py --dataset istd --checkpoint_dir checkpoints --vis_dir val_out --max_num_epochs 200 --batch_size 2 --enable_d1d2 --enable_d3 --enable_synfake --net_G unet_256 --pixel_loss pixel_loss --metric labrmse_gt1 python eval_deshadow.py --dataset istd --ckptdir checkpoints --net_G unet_256 --in_size 256 --save_output

The dataset download from https://github.com/DeepInsight-PCALab/ST-CGAN. The txt file is log file (training+evl). nohup.txt I will appreciate any reply from you.
opened by nachifur 8
About the best_ckpt.pt

Sorry, my computer is too slow, and there is currently no fast enough training equipment. Can you share your trained models for deraining, reflection removal and shadow removal? Thanks a million!

opened by flyz1 7
DID-MDN test error

I have those dirs or files: './eval_output/. /datasets/DID-MDN/val/1_input.jpg' but display the error: [Errno 2] No such file or directory: './eval_output/./datasets/DID-MDN/val/1_input.jpg'

opened by leno0903 4
about the loss function
Hi, Thanks for your good work! But after reading the paper and the code, I'm confused about the loss function，

Where is DM？

I also can not find βC and βM which are in the final objective function formula in this code.
opened by YongshengDong 2
why two Markovian discriminators are used？

This work is very meaningful. I don't know why two discriminators are used here. Will the performance of one discriminator decrease? I did not find relevant experiments and discussions in the article. I will appreciate any reply from you.

opened by nachifur 2
which picture is the output of the DSC model?

This work is very meaningful. But I am confused about which picture is the output of the DSC model? The first line is input. The second line is your model output or DSC model output? The picture provided by https://openaccess.thecvf.com/content_CVPR_2020/supplemental/Zou_Deep_Adversarial_Decomposition_CVPR_2020_supplemental.pdf. I will appreciate any reply from you.

opened by nachifur 2
Table 8 DSC [22] (TPAMI19) 7.100 in ISTD

7.1 is [8] Argan: Attentive recurrent generative adversarial network for shadow detection and removal. [8] retrained DSC and got 7.1. But in DSC [22] (TPAMI19), is 6.67 in ISTD. This should be a small citation error. If you plan to write this article into a journal, you can correct it. Your work is great and the code is complete!

opened by nachifur 1
Hello, I would like to ask about the define_D

Hello, does the D1 D2 D3 in the discriminator mean multi-scale discriminator? self.net_D1 = cycnet.define_D(input_nc=6, ndf=64, netD='n_layers', n_layers_D=2).to(device) self.net_D2 = cycnet.define_D(input_nc=6, ndf=64, netD='n_layers', n_layers_D=2).to(device) self.net_D3 = cycnet.define_D(input_nc=6, ndf=64, netD='n_layers', n_layers_D=3).to(device) About D1 D2 D3 I don't seem to understand why it is divided into 3 discriminators, because their parameters seem to be the same.

opened by SUTwu 0
About rain100h

Hello, I want to train Rain100H so how should I place the data, I want to know how to place the data I make errors in every round of evaluation and I think it's a data set placement problem.

opened by BALDman66 1
Need a clarification regarding train/validation split.

Hi, I computed the PSNR using your pre-trained checkpoint file on dogsflowers dataset. I'm able to get the same PSNR on the validation set. However, I find that PSNR on validation data is better than PSNR on training data which is surprising to me. I evaluated the model 3 times each on training and validation data. I get 31.41+-0.01 on validation and 30.46+-0.01 on training data.

Secondly, when I train the model from scratch, my validation PSNR reaches only 24 where as the training PSNR reaches 31. I'm sharing the notebook which I've used to compute the PSNR. I'm also sharing the screeshots of the notebook when evaluated on training data and on validation data. PSNR is gettiing reported on the output of 7th cell.

For flowers data, I've used 'trn1' and 'val1' for training and validation splits respectively. For the dogs, I randomly separated the images into two groups while keeping 20% of data for validation and 80% for training.

It would help me immensely if you could explain why the training data performs poorly vis a vis the validation data or where I'm doing things incorrectly :). Notebook.zip

On Validation data

On Training Data

opened by ashesh-0 0
Decompose an image to three images

Dear professor: I follow your program to do colour unmixing. Now I unmix an image with two colours to two image with a single colour. But can I decompose an image with three colors to three images on which ther e is only one color? Any suggestion be ok. Thank you! In other words, if the image includes three colors, like12_input, I can't unmix it correctly.

opened by SikangSHU 0
DogFlowers and LSUN datasets

Would you please upload the datasets for training and testing on superimposed image separation? It will help me a lot for both convenient and fair experimental comparison.

opened by Le2Hu 1

Owner

Zhengxia Zou

Postdoc at the University of Michigan. Research interest: computer vision and applications in remote sensing, self-driving, and video games.

GitHub

An official implementation of "SFNet: Learning Object-aware Semantic Correspondence" (CVPR 2019, TPAMI 2020) in PyTorch.

PyTorch implementation of SFNet This is the implementation of the paper "SFNet: Learning Object-aware Semantic Correspondence". For more information,

87 Dec 30, 2022

Provided is code that demonstrates the training and evaluation of the work presented in the paper: "On the Detection of Digital Face Manipulation" published in CVPR 2020.

FFD Source Code Provided is code that demonstrates the training and evaluation of the work presented in the paper: "On the Detection of Digital Face M

88 Nov 22, 2022

This repository contains the code for the CVPR 2020 paper "Differentiable Volumetric Rendering: Learning Implicit 3D Representations without 3D Supervision"

697 Jan 6, 2023

Source code for CVPR 2020 paper "Learning to Forget for Meta-Learning"

L2F - Learning to Forget for Meta-Learning Sungyong Baik, Seokil Hong, Kyoung Mu Lee Source code for CVPR 2020 paper "Learning to Forget for Meta-Lear

29 May 22, 2022

Unofficial implementation of "TTNet: Real-time temporal and spatial video analysis of table tennis" (CVPR 2020)

TTNet-Pytorch The implementation for the paper "TTNet: Real-time temporal and spatial video analysis of table tennis" An introduction of the project c

438 Dec 29, 2022

The official implementation of Equalization Loss v1 & v2 (CVPR 2020, 2021) based on MMDetection.

The Equalization Losses for Long-tailed Object Detection and Instance Segmentation This repo is official implementation CVPR 2021 paper: Equalization

129 Dec 16, 2022

Implementation of CVPR 2020 Dual Super-Resolution Learning for Semantic Segmentation

Dual super-resolution learning for semantic segmentation 2021-01-02 Subpixel Update Happy new year! The 2020-12-29 update of SISR with subpixel conv p

79 Nov 24, 2022

Poplar implementation of "Bundle Adjustment on a Graph Processor" (CVPR 2020)

Poplar Implementation of Bundle Adjustment using Gaussian Belief Propagation on Graphcore's IPU Implementation of CVPR 2020 paper: Bundle Adjustment o

34 Dec 5, 2022

UDP++ (ECCVW 2020 Oral), (Winner of COCO 2020 Keypoint Challenge).

UDP-Pose This is the pytorch implementation for UDP++, which won the Fisrt place in COCO Keypoint Challenge at ECCV 2020 Workshop. Top-Down Results on

20 Jul 29, 2022

[CVPR 2022] CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation

CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation Prerequisite Please create and activate the following conda envrionment. To r

87 Jan 8, 2023

PyTorch implementation of the Deep SLDA method from our CVPRW-2020 paper "Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis"

Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis This is a PyTorch implementation of the Deep Streaming Linear Discriminant

41 Dec 25, 2022

A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-based singing voice separation." 21th International Society for Music Information Retrieval Conference, ISMIR. 2020.

Investigating U-NETS With Various Intermediate Blocks For Spectrogram-based Singing Voice Separation A Pytorch Implementation of the paper "Investigat

63 Nov 14, 2022

Pytorch implementation for the EMNLP 2020 (Findings) paper: Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering

Path-Generator-QA This is a Pytorch implementation for the EMNLP 2020 (Findings) paper: Connecting the Dots: A Knowledgeable Path Generator for Common

33 Dec 5, 2022

Pytorch implementation of the paper: "A Unified Framework for Separating Superimposed Images", in CVPR 2020.

Related tags

Overview

Deep Adversarial Decomposition

Pytorch implementation of the paper: "Deep Adversarial Decomposition: A Unified Framework for Separating Superimposed Images", in CVPR 2020.

Requirements

Setup

Task 1: Image decomposition

On Stanford-Dogs + VGG-Flowers

On MNIST + MNIST

Task 2: Image deraining

On Rain100H

On Rain800

On DID-MDN

Task 3: Image reflection removal

On Synthesis-Reflection

On BDN

On Zhang's dataset

Task 4: Shadow Removal

On ISTD

On SRD

Pretrained Models

Citation

Comments

On Validation data

On Training Data

Owner

Zhengxia Zou

An official implementation of "SFNet: Learning Object-aware Semantic Correspondence" (CVPR 2019, TPAMI 2020) in PyTorch.

Provided is code that demonstrates the training and evaluation of the work presented in the paper: "On the Detection of Digital Face Manipulation" published in CVPR 2020.

This repository contains the code for the CVPR 2020 paper "Differentiable Volumetric Rendering: Learning Implicit 3D Representations without 3D Supervision"

Source code for CVPR 2020 paper "Learning to Forget for Meta-Learning"

Unofficial implementation of "TTNet: Real-time temporal and spatial video analysis of table tennis" (CVPR 2020)

The official implementation of Equalization Loss v1 & v2 (CVPR 2020, 2021) based on MMDetection.

Implementation of CVPR 2020 Dual Super-Resolution Learning for Semantic Segmentation

Poplar implementation of "Bundle Adjustment on a Graph Processor" (CVPR 2020)

UDP++ (ECCVW 2020 Oral), (Winner of COCO 2020 Keypoint Challenge).

[CVPR 2022] CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation

PyTorch implementation of the Deep SLDA method from our CVPRW-2020 paper "Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis"

A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-based singing voice separation." 21th International Society for Music Information Retrieval Conference, ISMIR. 2020.

Pytorch implementation for the EMNLP 2020 (Findings) paper: Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering

PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "

The pytorch implementation of the paper "text-guided neural image inpainting" at MM'2020

Learning to Simulate Dynamic Environments with GameGAN (CVPR 2020)

Code accompanying "Dynamic Neural Relational Inference" from CVPR 2020

git《Investigating Loss Functions for Extreme Super-Resolution》(CVPR 2020) GitHub:

Block-wisely Supervised Neural Architecture Search with Knowledge Distillation (CVPR 2020)