Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022

Last update: Dec 26, 2022

Related tags

Deep Learning LDL

Overview

LDL

Paper | Supplementary Material

Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution
Jie Liang*, Hui Zeng*, and Lei Zhang.
In CVPR 2022 (Oral Presentation).

Abstract

Single image super-resolution (SISR) with generative adversarial networks (GAN) has recently attracted increasing attention due to its potentials to generate rich details. However, the training of GAN is unstable, and it often introduces many perceptually unpleasant artifacts along with the generated details. In this paper, we demonstrate that it is possible to train a GAN-based SISR model which can stably generate perceptually realistic details while inhibiting visual artifacts. Based on the observation that the local statistics (e.g., residual variance) of artifact areas are often different from the areas of perceptually friendly details, we develop a framework to discriminate between GAN-generated artifacts and realistic details, and consequently generate an artifact map to regularize and stabilize the model training process. Our proposed locally discriminative learning (LDL) method is simple yet effective, which can be easily plugged in off-the-shelf SISR methods and boost their performance. Experiments demonstrate that LDL outperforms the state-of-the-art GAN based SISR methods, achieving not only higher reconstruction accuracy but also superior perceptual quality on both synthetic and real-world datasets.

Overall illustration of the LDL:

For more details, please refer to our paper.

Getting started

Clone this repo.

git clone https://github.com/csjliang/LDL
cd LDL

Install dependencies. (Python 3 + NVIDIA GPU + CUDA. Recommend to use Anaconda)

pip install -r requirements.txt

Prepare the training and testing dataset by following this instruction.
Prepare the pre-trained models by following this instruction.

Training

First, check and adapt the yml file options/train/LDL/train_Synthetic_LDL.yml (or options/train/LDL/train_Realworld_LDL.yml for real-world image super-resolution), then

Single GPU:

PYTHONPATH="./:${PYTHONPATH}" CUDA_VISIBLE_DEVICES=0 python basicsr/train.py -opt options/train/LDL/train_Synthetic_LDL.yml --auto_resume

PYTHONPATH="./:${PYTHONPATH}" CUDA_VISIBLE_DEVICES=0 python realesrgan/train.py -opt options/train/LDL/train_Realworld_LDL.yml --auto_resume

Distributed Training:

PYTHONPATH="./:${PYTHONPATH}" CUDA_VISIBLE_DEVICES=0,1,2,3 python -m torch.distributed.launch --nproc_per_node=4 --master_port=5678 basicsr/train.py -opt options/train/LDL/train_Synthetic_LDL.yml --launcher pytorch --auto_resume

PYTHONPATH=":${PYTHONPATH}" CUDA_VISIBLE_DEVICES=0,1,2,3 python -m torch.distributed.launch --nproc_per_node=4 --master_port=4321 realesrgan/train.py -opt options/train/LDL/train_Realworld_LDL.yml --launcher pytorch --auto_resume

Training files (logs, models, training states and visualizations) will be saved in the directory ./experiments/{name}

Testing

First, check and adapt the yml file options/test/LDL/test_LDL_Synthetic_x4.yml (or options/test/LDL/test_LDL_Realworld_x4.yml for real-world image super-resolution), then

Calculate metrics and save visual results for synthetic tasks:

PYTHONPATH="./:${PYTHONPATH}" CUDA_VISIBLE_DEVICES=0 python basicsr/test.py -opt options/test/LDL/test_LDL_Synthetic_x4.yml

Save visual results for real-world image super-resolution:

PYTHONPATH="./:${PYTHONPATH}" CUDA_VISIBLE_DEVICES=0 python basicsr/test.py -opt options/test/LDL/test_LDL_Realworld_x4.yml

Evaluating files (logs and visualizations) will be saved in the directory ./results/{name}

The Training and testing steps for scale=2 are similar.

Get Quantitative Metrics

First, check and adapt the settings of the files in metrics, then (take PSNR as an example) run

PYTHONPATH="./:${PYTHONPATH}" python scripts/metrics/table_calculate_psnr_all.py

Other metrics are similar.

License

This project is released under the Apache 2.0 license.

Citation

@inproceedings{jie2022LDL,
  title={Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution},
  author={Liang, Jie and Zeng, Hui and Zhang, Lei},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year={2022}
}

Acknowledgement

This project is built based on the excellent BasicSR project.

Contact

Should you have any questions, please contact me via [email protected].

Comments

The pretrain model of scale=2

Hi, i download the pretrain model file that are scale=4. Can you sharring the scale=2 pretrain model file? The parameter scale=4 isn't change in class RRDBNet as follow:

opened by gaowq2017 3
the difference of type A, B and C

Hi, in figure 5, the sigma values for type A, B, and C patches are 0.25, 0.39, 0.67. These numbers are setted originally or calculated by the input image patch?

opened by jiamingNo1 2
error when training

when I tried to train LDL using train_Realworld_LDL.yml, it shows an error below

Traceback (most recent call last): File "./basicsr/train.py", line 212, in train_pipeline(root_path) File "./basicsr/train.py", line 121, in train_pipeline result = create_train_val_dataloader(opt, logger) File "./basicsr/train.py", line 39, in create_train_val_dataloader train_set = build_dataset(dataset_opt) File "/nas/workspace/anse/code/pytorch/SR/LDL/basicsr/data/init.py", line 34, in build_dataset dataset = DATASET_REGISTRY.get(dataset_opt['type'])(dataset_opt) File "/nas/workspace/anse/code/pytorch/SR/LDL/basicsr/utils/registry.py", line 65, in get raise KeyError(f"No object named '{name}' found in '{self._name}' registry!") KeyError: "No object named 'RealESRGANDataset' found in 'dataset' registry!"

opened by anse3832 2
inference error

hi

when i test swinir in /inference/inference.py (two others are ok)

from basicsr.archs.rrdbnet_arch import RRDBNet from basicsr.archs.srresnet_arch import MSRResNet from basicsr.archs.swinir_arch import SwinIR

#set up model #model = RRDBNet(num_in_ch=3, num_out_ch=3, num_feat=64, num_block=23, num_grow_ch=32) #model = MSRResNet(num_in_ch=3, num_out_ch=3, num_feat=64, num_block=16, upscale=4) model = SwinIR(upscale=4, img_size=(64, 64), window_size=8, img_range=1., depths=[6, 6, 6, 6], embed_dim=180, num_heads=[6, 6, 6, 6], mlp_ratio=2, upsampler='pixelshuffle')

#model.load_state_dict(torch.load(args.model_path)['params'], strict=True) model.load_state_dict(torch.load(args.model_path), strict=True)

ir occured error , whats the SwinIR params

thanks

opened by zhangyunming 1
he teeth have a weird phenomenon

When I train on the face dataset, the teeth have a weird phenomenon。The portrait dataset is FFHQ, and the teeth generated by super-resolution are divided into multiple tooth blocks. May I ask what is the reason for this ?

opened by carfei 1
No basicSR Version?

Obviously, BasicSR is a good tool. However It is true that BasicSR is easy to use, but when I try to apply it, it takes too long to find out where those codes are. If you have time, could you make a version that is not BasicSR? It doesn't matter if you don't have distributed learning or other functions.

opened by puppy9207 0
Training error

The error in the training process seems to be a data problem,but i can not find any reason.There is nothing wrong with the data.There is an error in the middle of the training

opened by carfei 0

Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022

Related tags

Overview

LDL

Paper | Supplementary Material

Abstract

Getting started

Training

Testing

Get Quantitative Metrics

License

Citation

Acknowledgement

Contact

Comments

The pretrain model of scale=2

the difference of type A, B and C

error when training

inference error

he teeth have a weird phenomenon

No basicSR Version?

Training error

Owner

pytorch implementation for Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network arXiv:1609.04802

[CVPR 2022] Official PyTorch Implementation for "Reference-based Video Super-Resolution Using Multi-Camera Video Triplets"

Official Implementation of CVPR 2022 paper: "Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning"

The official pytorch implemention of the CVPR paper "Temporal Modulation Network for Controllable Space-Time Video Super-Resolution".

Official implementation of the paper 'Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution'

Joint Discriminative and Generative Learning for Person Re-identification. CVPR'19 (Oral)

[CVPR 2022] CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

Implementation of CVPR 2020 Dual Super-Resolution Learning for Semantic Segmentation

Repository for "Exploring Sparsity in Image Super-Resolution for Efficient Inference", CVPR 2021

Official implementation of Unfolded Deep Kernel Estimation for Blind Image Super-resolution.

The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.

Implementation of paper: "Image Super-Resolution Using Dense Skip Connections" in PyTorch

Unofficial pytorch implementation of the paper "Dynamic High-Pass Filtering and Multi-Spectral Attention for Image Super-Resolution"

[CVPR 2021] Unsupervised Degradation Representation Learning for Blind Super-Resolution

The official implementation of our CVPR 2021 paper - Hybrid Rotation Averaging: A Fast and Robust Rotation Averaging Approach