《Truly shift-invariant convolutional neural networks》(2021)

Anadi Chaman

Last update: Dec 19, 2022

Related tags

Deep Learning truly_shift_invariant_cnns

Overview

Truly shift-invariant convolutional neural networks [Paper]

Authors: Anadi Chaman and Ivan Dokmanić

Convolutional neural networks were always assumed to be shift invariant, until recently when it was shown that the classification accuracy of a trained CNN can take a serious hit with merely a 1-pixel shift in input image. One of the primary reasons for this problem is the use of downsampling (popularly known as stride) layers in the networks.

In this work, we present Adaptive Polyphase Sampling (APS), an easy-to-implement non-linear downsampling scheme that completely gets rid of this problem. The resulting CNNs yield 100% consistency in classification performance under shifts without any loss in accuracy. In fact, unlike prior works, the networks exhibit perfect consistency even before training, making it the first approach that makes CNNs truly shift invariant.

This repository contains our code in PyTorch to implement APS.

ImageNet training

To train ResNet-18 model with APS on ImageNet use the following commands (training and evaluation with circular shifts).

cd imagenet_exps
python3 main.py --out-dir OUT_DIR --arch resnet18_aps1 --seed 0 --data PATH-TO-DATASET

For training on multiple GPUs:

cd imagenet_exps
python3 main.py --out-dir OUT_DIR --arch resnet18_aps1 --seed 0 --data PATH-TO-DATASET --workers NUM_WORKERS --dist-url tcp://127.0.0.1:FREE-PORT --dist-backend nccl --multiprocessing-distributed --world-size 1 --rank 0

--arch is used to specify the architecture. To use ResNet18 with APS layer and blur filter of size j, pass 'resnet18_apsj' as the argument to --arch. List of currently supported network architectures are here.

--circular_data_aug can be used to additionally train the networks with random circular shifts.

Results are saved in OUT_DIR.

CIFAR-10 training

The following commands run our implementation on CIFAR-10 dataset.

cd cifar10_exps
python3 main.py --arch 'resnet18_aps' --filter_size FILTER_SIZE --validate_consistency --seed_num 0 --device_id 0 --model_folder CURRENT_MODEL_DIRECTORY --results_root_path ROOT_DIRECTORY --dataset_path PATH-TO-DATASET

--data_augmentation_flag can be used to additionally train the networks with randomly shifted images. FILTER_SIZE can take the values between 1 to 7. The list of CNN architectures currently supported can be found here.

The results are saved in the path: ROOT_DIRECTORY/CURRENT_MODEL_DIRECTORY/

The code repository for "RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection" (ACM MM'21)

RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection (ACM MM'21) By Zhuofan Zong, Qianggang Cao, Biao Leng Introduction F

9 Jul 30, 2022

A hue shift helper for OBS

obs-hue-shift A hue shift helper for OBS This is a repo based on the really nice script Hegemege made. The original script can be found https://gist.g

1 Jan 10, 2022

Mapping Conditional Distributions for Domain Adaptation Under Generalized Target Shift

This repository contains the official code of OSTAR in "Mapping Conditional Distributions for Domain Adaptation Under Generalized Target Shift" (ICLR 2022).

5 Dec 6, 2022

The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

TwoStageAlign The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift Pa

32 Dec 15, 2022

Comments

Perfect shift-invariance with APS (not 100% consistency)

Hello,

I really appreciate your work!

When I run your repo, unfortunately, I cannot receive 100% percent consistency. In the paper, figure 5 shows that I should get 100% percent consistency even at epoch 0. Could you please share the configuration I should run to get this result?

Thanks in advance.

opened by DuyguSerbes 4
Comparison to max-pooling

Hi, interesting work! I was wondering what is the main difference of APS to simple max-pooling (resp. un-max-pooling for your follow-up paper): I guess max pooling is a special case of APS for single-channel tensors? So is the main point of your work that one can extend this concept to multi-channel tensors by choosing the pooling index based on the pixel norm over all channels? Thanks a lot!

opened by mys007 3
License for Code

Hi Anadi,

I came across your paper on arxiv a couple days ago, really cool work! I'm curious to try it out in my project, and I was wondering if it'd be possible to attach a license to the repo?

Thanks!

opened by kkl116 0

《Truly shift-invariant convolutional neural networks》(2021)

Related tags

Overview

Truly shift-invariant convolutional neural networks [Paper]

ImageNet training

CIFAR-10 training

You might also like...

The code repository for "RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection" (ACM MM'21)

A hue shift helper for OBS

Mapping Conditional Distributions for Domain Adaptation Under Generalized Target Shift

The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

Implementation of CVPR 2021 paper "Spatially-invariant Style-codes Controlled Makeup Transfer"

Code for NeurIPS 2021 paper: Invariant Causal Imitation Learning for Generalizable Policies

Complex-Valued Neural Networks (CVNN)Complex-Valued Neural Networks (CVNN)

《A-CNN: Annularly Convolutional Neural Networks on Point Clouds》(2019)

Spontaneous Facial Micro Expression Recognition using 3D Spatio-Temporal Convolutional Neural Networks

Comments

Perfect shift-invariance with APS (not 100% consistency)

Comparison to max-pooling

License for Code

Owner

Anadi Chaman

Expressive Power of Invariant and Equivaraint Graph Neural Networks (ICLR 2021)

This repository implements and evaluates convolutional networks on the Möbius strip as toy model instantiations of Coordinate Independent Convolutional Networks.

DIR-GNN - Discovering Invariant Rationales for Graph Neural Networks

Multi-Task Temporal Shift Attention Networks for On-Device Contactless Vitals Measurement (NeurIPS 2020)

Code for our ICASSP 2021 paper: SA-Net: Shuffle Attention for Deep Convolutional Neural Networks

[ICCV 2021] Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain

Implementation of our paper 'RESA: Recurrent Feature-Shift Aggregator for Lane Detection' in AAAI2021.

Official code for "Mean Shift for Self-Supervised Learning"

Supporting code for the paper "Dangers of Bayesian Model Averaging under Covariate Shift"

This is official implementaion of paper "Token Shift Transformer for Video Classification".