Minimal implementation of Denoised Smoothing: A Provable Defense for Pretrained Classifiers in TensorFlow.

Sayak Paul

Last update: Dec 11, 2022

Related tags

Deep Learning tensorflow vision robustness adversarial-defense randomized-smoothing denoised-randomized-smoothing

Overview

Denoised-Smoothing-TF

Minimal implementation of Denoised Smoothing: A Provable Defense for Pretrained Classifiers in TensorFlow.

Denoised Smoothing is a simple and elegant way to (provably) robustify pre-trained image classification models (including the cloud APIs with only query access) and l2 adversarial attacks. This blog post provides a nice introduction to the method. The figure below summarizes what Denoised Smoothing is and how it works:

Source

Take a pre-trained classifier and prepend a pre-trained denoiser with it. Of course, the dataset on which the classifier and the denoiser would need to be trained on the same/similar dataset.
Apply Randomized Smoothing.

Randomized Smoothing is a well-tested method to provably defend against l2 adversarial attacks under a specific radii. But it assumes that a classifier performs well under Gaussian noisy perturbations which may not always be the case.

Note: I utilized many scripts from the official repository of Denoised Smoothing to develop this repository. My aim with this repository is to provide a template for researchers to conduct certification tests with Keras/TensorFlow models. I encourage the readers to check out the original repository, it's really well-developed.

Further notes

The Denoised Smoothing process is demonstrated on the CIFAR-10 dataset.
You can train a classifier quickly with the Train_Classifier.ipynb notebook.
Training the denoiser is demonstrated in the Train_Denoiser.ipynb notebook.
Certification tests are in Certification_Test.ipynb notebook.

All the notebooks can be executed on Colab! You also have the option to train using the free TPUs.

Results

Denoiser with stability objective	Denoiser with MSE objective

As we can see prepending a pre-trained denoiser is extremely helpful for our purpose.

Models

The models are available inside models.tar.gz in the SavedModel format. In the interest of reproducibility, the initial model weights are also provided.

Acknowledgements

Hadi Salman (first author of Denoised Smoothing) for fruitful discussions.
ML-GDE program for providing GCP credits.

Paper citation

@inproceedings{NEURIPS2020_f9fd2624,
 author = {Salman, Hadi and Sun, Mingjie and Yang, Greg and Kapoor, Ashish and Kolter, J. Zico},
 booktitle = {Advances in Neural Information Processing Systems},
 editor = {H. Larochelle and M. Ranzato and R. Hadsell and M. F. Balcan and H. Lin},
 pages = {21945--21957},
 publisher = {Curran Associates, Inc.},
 title = {Denoised Smoothing: A Provable Defense for Pretrained Classifiers},
 url = {https://proceedings.neurips.cc/paper/2020/file/f9fd2624beefbc7808e4e405d73f57ab-Paper.pdf},
 volume = {33},
 year = {2020}
}

Comments

float32

Hello, thank you for implementing this great application. However, for your train_denoiser.ipynb, I ran it on my Google Colab and it gave me a mismatched type of float32. If I change all float32 types to float64, it fixes the error. This can be something to keep in mind in the next update of your code.

opened by le000043 4
Performance comparison

Not sure if I missed anything, Have you conducted experiments comparing your implementation and the original implementation to see if there exist any discrepancies in adversarial defense performance? Thanks

opened by le000043 1

A repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]

PINTO_model_zoo Please read the contents of the LICENSE file located directly under each folder before using the model. My model conversion scripts ar

2.4k Jan 5, 2023

A data-driven approach to quantify the value of classifiers in a machine learning ensemble.

Documentation | External Resources | Research Paper Shapley is a Python library for evaluating binary classifiers in a machine learning ensemble. The

188 Dec 29, 2022

This implements one of result networks from Large-scale evolution of image classifiers

Exotic structured image classifier This implements one of result networks from Large-scale evolution of image classifiers by Esteban Real, et. al. Req

54 Nov 25, 2022

Face recognition with trained classifiers for detecting objects using OpenCV

Face_Detector Face recognition with trained classifiers for detecting objects using OpenCV Libraries required to be installed using pip Command: cv2 n

0 Oct 31, 2021

Demos of essentia classifiers hosted on replicate.ai

essentia-replicate-demos Demos of Essentia models hosted on replicate.ai's MTG site. The models Check our site for a complete list of the models avail

Music Technology Group - Universitat Pompeu Fabra

12 Nov 14, 2022

Patient-Survival - Using Python, I developed a Machine Learning model using classification techniques such as Random Forest and SVM classifiers to predict a patient's survival status that have undergone breast cancer surgery.

1 Dec 28, 2021

Minimal implementation of Denoised Smoothing: A Provable Defense for Pretrained Classifiers in TensorFlow.

Related tags

Overview

Denoised-Smoothing-TF

Further notes

Results

Models

Acknowledgements

Paper citation

You might also like...

A data-driven approach to quantify the value of classifiers in a machine learning ensemble.

This implements one of result networks from Large-scale evolution of image classifiers

Face recognition with trained classifiers for detecting objects using OpenCV

Demos of essentia classifiers hosted on replicate.ai

Patient-Survival - Using Python, I developed a Machine Learning model using classification techniques such as Random Forest and SVM classifiers to predict a patient's survival status that have undergone breast cancer surgery.

Effect of Different Encodings and Distance Functions on Quantum Instance-based Classifiers

Official PyTorch implementation and pretrained models of the paper Self-Supervised Classification Network

PyTorch implementation and pretrained models for XCiT models. See XCiT: Cross-Covariance Image Transformer

Comments

float32

Performance comparison

Owner

Sayak Paul

Minimal diffusion models - Minimal code and simple experiments to play with Denoising Diffusion Probabilistic Models (DDPMs)

SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems

A certifiable defense against adversarial examples by training neural networks to be provably robust

Hierarchical-Bayesian-Defense - Towards Adversarial Robustness of Bayesian Neural Network through Hierarchical Variational Inference (Openreview)

Dcf-game-infrastructure-public - Contains all the components necessary to run a DC finals (attack-defense CTF) game from OOO

Implementation of Online Label Smoothing in PyTorch

LVI-SAM: Tightly-coupled Lidar-Visual-Inertial Odometry via Smoothing and Mapping

Image Processing, Image Smoothing, Edge Detection and Transforms

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Minimal implementation of PAWS (https://arxiv.org/abs/2104.13963) in TensorFlow.