[ICCV 2021] Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain

Guangyao Chen

Last update: Oct 5, 2022

Related tags

Overview

Amplitude-Phase Recombination (ICCV'21)

Official PyTorch implementation of "Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain", Guangyao Chen, Peixi Peng, Li Ma, Jia Li, Lin Du, and Yonghong Tian.

Paper: https://arxiv.org/abs/2108.08487

Abstract: Recently, the generalization behavior of Convolutional Neural Networks (CNN) is gradually transparent through explanation techniques with the frequency components decomposition. However, the importance of the phase spectrum of the image for a robust vision system is still ignored. In this paper, we notice that the CNN tends to converge at the local optimum which is closely related to the high-frequency components of the training images, while the amplitude spectrum is easily disturbed such as noises or common corruptions. In contrast, more empirical studies found that humans rely on more phase components to achieve robust recognition. This observation leads to more explanations of the CNN's generalization behaviors in both adversarial attack and out-of-distribution detection, and motivates a new perspective on data augmentation designed by re-combing the phase spectrum of the current image and the amplitude spectrum of the distracter image. That is, the generated samples force the CNN to pay more attention on the structured information from phase components and keep robust to the variation of the amplitude. Experiments on several image datasets indicate that the proposed method achieves state-of-the-art performances on multiple generalizations and calibration tasks, including adaptability for common corruptions and surface variations, out-of-distribution detection and adversarial attack.

Highlights

Fig. 1: More empirical studies found that humans rely on more phase components to achieve robust recognition. However, CNN without effective training restrictions tends to converge at the local optimum related to the amplitude spectrum of the image, leading to generalization behaviors counter-intuitive to humans (the sensitive to various corruptions and the overconfidence of OOD).

Fig. 2: Four pairs of testing sampless selected from in-distribution CIFAR-10 and OOD SVHN that help explain that CNN capture more amplitude specturm than phase specturm for classification: First, in (a) and (b), the model correctly predicts the original image (1st column in each panel), but the predicts are also exchanged after switching amplitude specturm (3rd column in each panel) while the human eye can still give the correct category through the contour information. Secondly, the model is overconfidence for the OOD samples in (c) and (d). Similarly, after the exchange of amplitude specturm, the label with high confidence is also exchanged.

Fig. 3: Two ways of the proposed Amplitude-Phase Recombination: APR-P and APR-S. Motivated by the powerful generalizability of the human, we argue that reducing the dependence on the amplitude spectrum and enhancing the ability to capture phase spectrum can improve the robustness of CNN.

Citation

If you find our work, this repository and pretrained adversarial generators useful. Please consider giving a star ⭐ and citation.

@inproceedings{chen2021amplitude,
    title={Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain},
    author={Chen, Guangyao and Peng, Peixi and Ma, Li and Li, Jia and Du, Lin and Tian, Yonghong},
    booktitle={Proceedings of the IEEE International Conference on Computer Vision},
    year={2021}
}

1. Requirements

Environments

Currently, requires following packages

python 3.6+
torch 1.7.1+
torchvision 0.5+
CUDA 10.1+
scikit-learn 0.22+

Datasets

For even quicker experimentation, there is CIFAR-10-C and CIFAR-100-C. please download these datasets to ./data/CIFAR-10-C and ./data/CIFAR-100-C.

2. Training & Evaluation

To train the models in paper, run this command:

python main.py --aug <augmentations>

Option --aug can be one of None/APR-S. The default training method is APR-P. To evaluate the model, add --eval after this command.

APRecombination for APR-S and mix_data for APR-P can plug and play in other training codes.

3. Results

Fourier Analysis

The standard trained model is highly sensitive to additive noise in all but the lowest frequencies. APR-SP could substantially improve robustness to most frequency perturbations. The code of Heat maps is developed upon the following project FourierHeatmap.

ImageNet-C

Results of ResNet-50 models on ImageNet-C:

	+(APR-P)	+(APR-S)	+(APR-SP)	+DeepAugMent+(ARP-SP)
mCE	70.5	69.3	65.0	57.5

You might also like...

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR2021)

NExT-QA We reproduce some SOTA VideoQA methods to provide benchmark results for our NExT-QA dataset accepted to CVPR2021 (with 1 'Strong Accept' and 2

50 Nov 24, 2022

object detection; robust detection; ACM MM21 grand challenge; Security AI Challenger Phase VII

赛题背景在商品知识产权领域，知识产权体现为在线商品的设计和品牌。不幸的是，在每一天，存在着非法商户通过一些对抗手段干扰商标识别来逃避侵权，这带来了很高的知识产权风险和财务损失。为了促进先进的多媒体人工智能技术的发展，以保护企业来之不易的创作和想法免受恶意使用和剽窃，因此提出了鲁棒性标识检测挑战赛

65 Dec 22, 2022

Data-Uncertainty Guided Multi-Phase Learning for Semi-supervised Object Detection

An official implementation of paper Data-Uncertainty Guided Multi-Phase Learning for Semi-supervised Object Detection

11 Nov 23, 2022

Official Implementation of CVPR 2022 paper: "Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning"

(CVPR 2022) Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning ArXiv This repo contains Official Implementat

24 Nov 1, 2022

Code for our ICASSP 2021 paper: SA-Net: Shuffle Attention for Deep Convolutional Neural Networks

Comments

randomness in apr-p and apr-s

Hi

I see that in both APR-P and APR-S there is some randomness in the augmentations. For APR-P, the data is mixed only 40% of the time https://github.com/iCGY96/APR/blob/ca61e352c5ea6e8078776f605c03752abcd29493/core/train.py#L16

and in APR-S, the data is augmented 50% of the time https://github.com/iCGY96/APR/blob/ca61e352c5ea6e8078776f605c03752abcd29493/datasets/APR.py#L25

with additional randomness in the choice of image whose phase is returned https://github.com/iCGY96/APR/blob/ca61e352c5ea6e8078776f605c03752abcd29493/datasets/APR.py#L46

Were these steps used to obtain the results in your paper, as I do not find this randomness mentioned in the paper?

Also, could you briefly describe how APR-SP was implemented - did you first apply APR-S and then apply APR-P?

Thank you, KC

opened by kiranchari 5
Question about the section 5.3 in your paper.

Hi @iCGY96! Thanks for your insightful work! I found it is hard to understand in Section 5.3: Labeled by Amplitude or Phase? In this section, the authors investigated the use of labels amplitude and phase separately. "The model is trained for the sample x combined by the phase spectrum Pxi and the amplitude spectrum Axj by optimizing Eq(7)". How to understand x combined by the phase spectrum P and the amplitude spectrum A. Is it means that you first decouple the x into A and P, then input the A and P to the networks with two FC layers, or input [X, P] and [X, A] into the networks?

opened by CoderHHX 3
Can you share the implementation codes for AutoAttack? (FSGM)

I read your paper very impressive using Fourier transform. I wish I could have the implementations to run the AutoAttack experiments. Can you share your code for FSGM training/eval?

opened by wjun0830 1
dafault path of data_dir is wrong in ./autoattacks/train_fgsm.py

if you have tried ./main.py and prepared datasets in ./data, and you want to train the code in ./autoattacks like me, attention here. in line 26 of train_fgsm.py: parser.add_argument('--data-dir', default='../data', type=str) should be modifieed to: parser.add_argument('--data-dir', default='../data/cifar10', type=str)

opened by changzhang777 0

[ICCV 2021] Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain

Related tags

Overview

Amplitude-Phase Recombination (ICCV'21)

Highlights

Citation

1. Requirements

Environments

Datasets

2. Training & Evaluation

3. Results

Fourier Analysis

ImageNet-C

You might also like...

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR2021)

object detection; robust detection; ACM MM21 grand challenge; Security AI Challenger Phase VII

Data-Uncertainty Guided Multi-Phase Learning for Semi-supervised Object Detection

Official Implementation of CVPR 2022 paper: "Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning"

Code for our ICASSP 2021 paper: SA-Net: Shuffle Attention for Deep Convolutional Neural Networks

《Truly shift-invariant convolutional neural networks》(2021)

TCNN Temporal convolutional neural network for real-time speech enhancement in the time domain

Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

Code for ICCV 2021 paper "Distilling Holistic Knowledge with Graph Neural Networks"

Comments

randomness in apr-p and apr-s

Question about the section 5.3 in your paper.

Can you share the implementation codes for AutoAttack? (FSGM)

dafault path of data_dir is wrong in ./autoattacks/train_fgsm.py

Owner

Guangyao Chen

A PyTorch implementation of " EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks."

Official DGL implementation of "Rethinking High-order Graph Convolutional Networks"

Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting (ICCV, 2021)

This repository implements and evaluates convolutional networks on the Möbius strip as toy model instantiations of Coordinate Independent Convolutional Networks.

Code Release for ICCV 2021 (oral), "AdaFit: Rethinking Learning-based Normal Estimation on Point Clouds"

[CVPR'21] FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

EdMIPS: Rethinking Differentiable Search for Mixed-Precision Neural Networks

Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness

FcaNet: Frequency Channel Attention Networks