Official repository for the NeurIPS 2021 paper Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided curriculum Learning Approach

Sowrya Gali

Last update: Apr 25, 2022

Related tags

Deep Learning Get-Fooled-for-the-Right-Reason

Overview

Get Fooled for the Right Reason

Official repository for the NeurIPS 2021 paper Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided Curriculum Learning Approach

Dependencies

Tensorflow 1.14.0
Python 3.7

Datasets

CIFAR10: https://www.cs.toronto.edu/~kriz/cifar.html

Models

modelGTP_cifar10: https://www.dropbox.com/sh/29n2lt08ypjdw67/AABSZlD8nTM08E-bcZv1mdkOa?dl=0

Usage

Install dependencies with pip install -r requirements.txt. Prefarably, create an anaconda environment.
Download and save datasets in datasets/ folder.
Download and save model in models/ folder.
Run the python eval_attack.py
The evaluation results will be stored in attack_log directory.

Note

Using a GPU is highly recommended.

Code overview

model_new.py: contains code for IGAM model architectures.
cifar10_input.py provides utility functions and classes for loading the CIFAR10 dataset.
PGD_attack.py: generates adversarial examples and save them in attacks/.
run_attack.py: evaluates model on adversarial examples from attacks/.
config_attack.py: parameters for adversarial example evaluation.
eval_attack.py: runs FGSM, PGD-5, PGD-10, PGD-20 attacks and logs the results in attack_log directory. However, you can get results for any attack by modifying the num_steps flag in the code.

Acknowledgements

Useful code bases we used in our work:

https://github.com/MadryLab/cifar10_challenge (for adversarial example generation and evaluation)
https://github.com/ashafahi/free_adv_train (for model code)

Comments

Influence of BatchNorm on robustness results

Hi, While going through your code I saw that inside the PGD_attack.py file the models are created by passing a mode="train" flag to the model constructors (e.g., see here). Among others, this flag controls the behavior of the batch norm operations in your model: if the flag is set to "train" then the batch norm layers operate in training mode and not in evaluation mode (e.g., see here). This creates a weird behavior when running your model/evaluation on data with small batch sizes. When I set this flag to "eval" and, thus, switch the batch norm layers to evaluation mode, the clean accuracy of your model does not change (i.e., 90.88% vs. 90.33%) but its robust accuracy (with the attack included in your repository without any modification) is reduced to <1%. When I keep the currently hard-coded value for that flag (i.e., "train"), I can reproduce the results of your paper which suggests that I loaded the correct model and nothing went wrong when restoring it. Did you intentionally leave the model in training mode during inference or was this accidental? I couldn't find anything about this in your paper which made be think that this might not have been part of your defense idea; can you please comment on my observation? Thanks!

opened by zimmerrol 0

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

MultiModal-InfoMax This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Informa

Deep Cognition and Language Research (DeCLaRe) Lab

89 Dec 26, 2022

Pytorch implementation for "Adversarial Robustness under Long-Tailed Distribution" (CVPR 2021 Oral)

Adversarial Long-Tail This repository contains the PyTorch implementation of the paper: Adversarial Robustness under Long-Tailed Distribution, CVPR 20

89 Dec 15, 2022

Official Pytorch implementation of "Unbiased Classification Through Bias-Contrastive and Bias-Balanced Learning (NeurIPS 2021)

Unbiased Classification Through Bias-Contrastive and Bias-Balanced Learning (NeurIPS 2021) Official Pytorch implementation of Unbiased Classification

17 Jan 1, 2023

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

Cutoff: A Simple Data Augmentation Approach for Natural Language This repository contains source code necessary to reproduce the results presented in

49 Dec 22, 2022

[ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Chenyu You, Xiaohui Xie, Zhangyang Wang

Undistillable: Making A Nasty Teacher That CANNOT teach students "Undistillable: Making A Nasty Teacher That CANNOT teach students" Haoyu Ma, Tianlong

71 Dec 28, 2022

Official implementation of "GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators" (NeurIPS 2020)

GS-WGAN This repository contains the implementation for GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators (NeurIPS

46 Nov 9, 2022

This is an unofficial implementation of the paper “Student-Teacher Feature Pyramid Matching for Unsupervised Anomaly Detection”.

32 Oct 26, 2022

TF2 implementation of knowledge distillation using the "function matching" hypothesis from the paper Knowledge distillation: A good teacher is patient and consistent by Beyer et al.

FunMatch-Distillation TF2 implementation of knowledge distillation using the "function matching" hypothesis from the paper Knowledge distillation: A g

67 Dec 20, 2022

Multitask Learning Strengthens Adversarial Robustness

15 Jun 10, 2022

Official repository for the NeurIPS 2021 paper Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided curriculum Learning Approach

Related tags

Overview

Get Fooled for the Right Reason

Dependencies

Datasets

Models

Usage

Note

Code overview

Acknowledgements

You might also like...

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

Pytorch implementation for "Adversarial Robustness under Long-Tailed Distribution" (CVPR 2021 Oral)

Official Pytorch implementation of "Unbiased Classification Through Bias-Contrastive and Bias-Balanced Learning (NeurIPS 2021)

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

[ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Chenyu You, Xiaohui Xie, Zhangyang Wang

Official implementation of "GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators" (NeurIPS 2020)

This is an unofficial implementation of the paper “Student-Teacher Feature Pyramid Matching for Unsupervised Anomaly Detection”.

TF2 implementation of knowledge distillation using the "function matching" hypothesis from the paper Knowledge distillation: A good teacher is patient and consistent by Beyer et al.

Multitask Learning Strengthens Adversarial Robustness

Comments

Influence of BatchNorm on robustness results

Owner

Sowrya Gali

Improving adversarial robustness by a coupling rejection strategy

Official repository for Jia, Raghunathan, Göksel, and Liang, "Certified Robustness to Adversarial Word Substitutions" (EMNLP 2019)

Official repository for "On Improving Adversarial Transferability of Vision Transformers" (2021)

Hierarchical-Bayesian-Defense - Towards Adversarial Robustness of Bayesian Neural Network through Hierarchical Variational Inference (Openreview)

The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGIR2022

Code repository accompanying the paper "On Adversarial Robustness: A Neural Architecture Search perspective"

Official implementation of "Open-set Label Noise Can Improve Robustness Against Inherent Label Noise" (NeurIPS 2021)

Download files from DSpace systems (because for some reason DSpace won't let you)

PyTorch code for ICLR 2021 paper Unbiased Teacher for Semi-Supervised Object Detection

Code for 'Self-Guided and Cross-Guided Learning for Few-shot segmentation. (CVPR' 2021)'