Bayesian Neural Networks in PyTorch

Jurijs Nazarovs

Last update: May 3, 2022

Related tags

Deep Learning bayesian_nn

Overview

We present the new scheme to compute Monte Carlo estimator in Bayesian VI settings with almost no memory cost in GPU, regardles of the number of samples. Our method is described in the paper (UAI2021): "Graph Reparameterizations for Enabling 1000+ Monte Carlo Iterations in Bayesian Deep Neural Networks".

In addition, we provide an implementation framework to make your deterministic network Bayesian in PyTorch.

If you like our work, please click on a star. If you use our code in your research projects, please cite our paper above.

Bayesify your Neural Network

There are 3 main files which help you to Bayesify your deterministic network:

bayes_layers.py - file contains a bayesian implementation of convolution(1d, 2d, 3d, transpose) and linear layers, according to approx posterior from Location-Scale family, i.e. which has 2 parameters mu and sigma. This file contains general definition, independent of specific distribution, as long as distribution contains 2 parameters mu and sigma. It uses forward method defined in vi_posteriors.py file. One of the main arguments for redefined classes is approx_post, which defined which posterior class to use from vi_posteriors.py. Please, specify this name same way as defined class in vi_posteriors.py. For example, if vi_posteriors.py contains class Gaus, then approx_post='Gaus'.
vi_posteriors.py - file describes forward method, including kl term, for different approximate posterior distributions. Current implementation contains following disutributions:

Radial
Gaus

If you would like to implement your own class of distrubtions, in vi_posteriors.py copy one of defined classes and redefine following functions: forward(obj, x, fun=""), get_kl(obj, n_mc_iter, device).

It also contains usefull Utils class which provides

definition of loss functions:
- get_loss_categorical
- get_loss_normal,
different beta coefficients: get_beta for KL term and
allows to turn on/off computing the KL term, with function set_compute_kl. this is useful, when you perform testing/evaluation, and kl term is not required to be computed. In that case it accelerates computations.

Below is an example to bayesify your own network. Note the forward method, which handles situations if a layer is not of a Bayesian type, and thus, does not return kl term, e.g. ReLU(x).

import bayes_layers as bl # important for defining bayesian layers
class YourBayesNet(nn.Module):
    def __init__(self, num_classes, in_channels, 
                 **bayes_args):
        super(YourBayesNet, self).__init__()
        self.conv1 = bl.Conv2d(in_channels, 64,
                               kernel_size=11, stride=4,
                               padding=5,
                               **bayes_args)
        self.classifier = bl.Linear(1*1*128,
                                    num_classes,
                                    **bayes_args)
        self.layers = [self.conv1, nn.ReLU(), self.classifier]
        
    def forward(self, x):
        kl = 0
        for layer in self.layers:
            tmp = layer(x)
            if isinstance(tmp, tuple):
                x, kl_ = tmp
                kl += kl_
            else:
                x = tmp

        x = x.view(x.size(0), -1)
        logits, _kl = self.classifier.forward(x)
        kl += _kl
        
        return logits, kl

Then later in the main file during training, you can either use one of the loss functions, defined in utils as following:

output, kl = model(inputs)
kl = kl.mean()  # if several gpus are used to split minibatch

loss, _ = vi.Utils.get_loss_categorical(kl, output, targets, beta=beta) 
#loss, _ = vi.Utils.get_loss_normal(kl, output, targets, beta=beta) 
loss.backward()

or design your own, e.g.

loss = kl_coef*kl - loglikelihood
loss.backward()

uncertainty_estimate.py - file describes set of functions to perform uncertainty estimation, e.g.

get_prediction_class - function which return the most common class in iterations
summary_class - function creates a summary file with statistics

Current implementation of networks for different problems

Classification

Script bayesian_dnn_class/main.py is the main executable code and all standard DNN models are located in bayesian_dnn_class/models, and are:

AlexNet
Fully Connected
DenseNet
ResNet
VGG

Comments

Out of memory problem

Hello, I am trying to run your code on my pc which has a 32GB RAM CPU and a 3GB GTX 1060 GPU. And I am using my own dataset with 100 * 7 images in total. However, I got 'Out Of Memory' error before the very first epoch for net like vgg11, even if the batch size is 1. With CPU, the error is

RuntimeError: $ Torch: not enough memory: you tried to allocate 0GB. Buy new RAM! at ..\aten\src\TH\THGeneral.cpp:204

With GPU, the error is

RuntimeError: CUDA error: out of memory

I wonder if it is a hardware problem, or is there some restriction for the input size of images for different NNs? If the first case, what will be the basic hardware requirement?

Thank you very much.
good first issue question

opened by urnotmeeto 11

Bayesian Neural Networks in PyTorch

Related tags

Overview

Bayesify your Neural Network

Current implementation of networks for different problems

Classification

You might also like...

A framework that constructs deep neural networks, autoencoders, logistic regressors, and linear networks

Code for "Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks", CVPR 2021

DeepHyper: Scalable Asynchronous Neural Architecture and Hyperparameter Search for Deep Neural Networks

Neural-fractal - Create Fractals Using Complex-Valued Neural Networks!

Spearmint Bayesian optimization codebase

Python Environment for Bayesian Learning

Python package for Bayesian Machine Learning with scikit-learn API

Simple, but essential Bayesian optimization package

Bayesian dessert for Lasagne

Comments

Out of memory problem

Owner

Jurijs Nazarovs

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

pyhsmm - library for approximate unsupervised inference in Bayesian Hidden Markov Models (HMMs) and explicit-duration Hidden semi-Markov Models (HSMMs), focusing on the Bayesian Nonparametric extensions, the HDP-HMM and HDP-HSMM, mostly with weak-limit approximations.

Bayesian Neural Networks in PyTorch

Code for "Infinitely Deep Bayesian Neural Networks with Stochastic Differential Equations"

Complex-Valued Neural Networks (CVNN)Complex-Valued Neural Networks (CVNN)

Python Library for learning (Structure and Parameter) and inference (Statistical and Causal) in Bayesian Networks.

This repository contains notebook implementations of the following Neural Process variants: Conditional Neural Processes (CNPs), Neural Processes (NPs), Attentive Neural Processes (ANPs).

An implementation demo of the ICLR 2021 paper Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks in PyTorch.

Python package facilitating the use of Bayesian Deep Learning methods with Variational Inference for PyTorch

Bayesian optimization in PyTorch