A PyTorch implementation for PyramidNets (Deep Pyramidal Residual Networks)

Greg Dongyoon Han

Last update: Jan 3, 2023

Related tags

Deep Learning pytorch imagenet resnet residual-networks cifar-10 pyramidnet cifar-100

Overview

A PyTorch implementation for PyramidNets (Deep Pyramidal Residual Networks)

This repository contains a PyTorch implementation for the paper: Deep Pyramidal Residual Networks (CVPR 2017, Dongyoon Han*, Jiwhan Kim*, and Junmo Kim, (equally contributed by the authors*)). The code in this repository is based on the example provided in PyTorch examples and the nice implementation of Densely Connected Convolutional Networks.

Two other implementations with LuaTorch and Caffe are provided:

A LuaTorch implementation for PyramidNets,
A Caffe implementation for PyramidNets.

Usage examples

To train additive PyramidNet-200 (alpha=300 with bottleneck) on ImageNet-1k dataset with 8 GPUs:

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python train.py --data ~/dataset/ILSVRC/Data/CLS-LOC/ --net_type pyramidnet --lr 0.05 --batch_size 128 --depth 200 -j 16 --alpha 300 --print-freq 1 --expname PyramidNet-200 --dataset imagenet --epochs 100

To train additive PyramidNet-110 (alpha=48 without bottleneck) on CIFAR-10 dataset with a single-GPU:

CUDA_VISIBLE_DEVICES=0 python train.py --net_type pyramidnet --alpha 64 --depth 110 --no-bottleneck --batch_size 32 --lr 0.025 --print-freq 1 --expname PyramidNet-110 --dataset cifar10 --epochs 300

To train additive PyramidNet-164 (alpha=48 with bottleneck) on CIFAR-100 dataset with 4 GPUs:

CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py --net_type pyramidnet --alpha 48 --depth 164 --batch_size 128 --lr 0.5 --print-freq 1 --expname PyramidNet-164 --dataset cifar100 --epochs 300

Notes

This implementation contains the training (+test) code for add-PyramidNet architecture on ImageNet-1k dataset, CIFAR-10 and CIFAR-100 datasets.
The traditional data augmentation for ImageNet and CIFAR datasets are used by following fb.resnet.torch.
The example codes for ResNet and Pre-ResNet are also included.
For efficient training on ImageNet-1k dataset, Intel MKL and NVIDIA(nccl) are prerequistes. Please check the official PyTorch github for the installation.

Tracking training progress with TensorBoard

Thanks to the implementation, which support the TensorBoard to track training progress efficiently, all the experiments can be tracked with tensorboard_logger.

Tensorboard_logger can be installed with

pip install tensorboard_logger

Paper Preview

Abstract

Deep convolutional neural networks (DCNNs) have shown remarkable performance in image classification tasks in recent years. Generally, deep neural network architectures are stacks consisting of a large number of convolution layers, and they perform downsampling along the spatial dimension via pooling to reduce memory usage. At the same time, the feature map dimension (i.e., the number of channels) is sharply increased at downsampling locations, which is essential to ensure effective performance because it increases the capability of high-level attributes. Moreover, this also applies to residual networks and is very closely related to their performance. In this research, instead of using downsampling to achieve a sharp increase at each residual unit, we gradually increase the feature map dimension at all the units to involve as many locations as possible. This is discussed in depth together with our new insights as it has proven to be an effective design to improve the generalization ability. Furthermore, we propose a novel residual unit capable of further improving the classification accuracy with our new network architecture. Experiments on benchmark CIFAR datasets have shown that our network architecture has a superior generalization ability compared to the original residual networks.

Schematic Illustration

We provide a simple schematic illustration to compare the several network architectures, which have (a) basic residual units, (b) bottleneck, (c) wide residual units, and (d) our pyramidal residual units, and (e) our pyramidal bottleneck residual units, as follows:

Experimental Results

The results are readily reproduced, which show the same performances as those reproduced with A LuaTorch implementation for PyramidNets.
Comparison of the state-of-the-art networks by [Top-1 Test Error Rates VS # of Parameters]:

Top-1 test error rates (%) on CIFAR datasets are shown in the following table. All the results of PyramidNets are produced with additive PyramidNets, and α denotes alpha (the widening factor). “Output Feat. Dim.” denotes the feature dimension of just before the last softmax classifier.

ImageNet-1k Pretrained Models

A pretrained model of PyramidNet-101-360 is trained from scratch using the code in this repository (single-crop (224x224) validation error rates are reported):

Network Type	Alpha	# of Params	Top-1 err(%)	Top-5 err(%)	Model File
ResNet-101 (Caffe model)	-	44.7M	23.6	7.1	Original Model
ResNet-101 (Luatorch model)	-	44.7M	22.44	6.21	Original Model
PyramidNet-v1-101	360	42.5M	21.98	6.20	Download

Note that the above widely-used ResNet-101 (Caffe model) is trained with the images, where the pixel intensities are in [0,255] and are centered by the mean image, our PyramidNet-101 is trained with the images where the pixel values are standardized.
The model is originally trained with PyTorch-0.4, and the keys of num_batches_tracked were excluded for convenience (the BatchNorm2d layer in PyTorch (>=0.4) contains the key of num_batches_tracked by track_running_stats).

Updates

Some minor bugs are fixed (2018/02/22).
train.py is updated (including ImagNet-1k training code) (2018/04/06).
resnet.py and PyramidNet.py are updated (2018/04/06).
preresnet.py (Pre-ResNet architecture) is uploaded (2018/04/06).
A pretrained model using PyTorch is uploaded (2018/07/09).

Citation

Please cite our paper if PyramidNets are used:

@article{DPRN,
  title={Deep Pyramidal Residual Networks},
  author={Han, Dongyoon and Kim, Jiwhan and Kim, Junmo},
  journal={IEEE CVPR},
  year={2017}
}

If this implementation is useful, please cite or acknowledge this repository on your work.

Contact

Dongyoon Han ([email protected]), Jiwhan Kim ([email protected]), Junmo Kim ([email protected])

You might also like...

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising

1.2k Dec 29, 2022

Resco: A simple python package that report the effect of deep residual learning

resco Description resco is a simple python package that report the effect of dee

1 Jun 28, 2022

Official Implementation for "ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement" https://arxiv.org/abs/2104.02699

ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement Recently, the power of unconditional image synthesis has significantly advanced th

967 Jan 4, 2023

Comments

CIFAR pretrained models

Hey, will it be possible to release pretrained model that achieve ~16 error rate on CIFAR100? The memory requirement is too big for my current machine to train from scratch. Thanks in advance

opened by idansc 1
Useful trainning tips: small bug for Python >= 3.5
For Python version >= 3.5, async becomes a keyword in Python. So the target.cuda(async=True) will cause error while running the scripts.

In order to fix this, change line 276 and 325 in train.py from

target = target.cuda(async=True)

to

target = target.cuda(non_blocking=True)

Also, remind to suitable code for different Pytorch version in train.py from line 327 to line 338
opened by reverseuniverse 0

A PyTorch implementation for PyramidNets (Deep Pyramidal Residual Networks)

Related tags

Overview

A PyTorch implementation for PyramidNets (Deep Pyramidal Residual Networks)

Usage examples

Notes

Tracking training progress with TensorBoard

Paper Preview

Abstract

Schematic Illustration

Experimental Results

ImageNet-1k Pretrained Models

Updates

Citation

Contact

You might also like...

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising

Resco: A simple python package that report the effect of deep residual learning

Official Implementation for "ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement" https://arxiv.org/abs/2104.02699

Code for "Human Pose Regression with Residual Log-likelihood Estimation", ICCV 2021 Oral

Official code of ICCV2021 paper "Residual Attention: A Simple but Effective Method for Multi-Label Recognition"

harmonic-percussive-residual separation algorithm wrapped as a VST3 plugin (iPlug2)

Graph Regularized Residual Subspace Clustering Network for hyperspectral image clustering

Residual Dense Net De-Interlace Filter (RDNDIF)

A framework that constructs deep neural networks, autoencoders, logistic regressors, and linear networks

Comments

CIFAR pretrained models

Useful trainning tips: small bug for Python >= 3.5

Owner

Greg Dongyoon Han

PyTorch code for our ECCV 2018 paper "Image Super-Resolution Using Very Deep Residual Channel Attention Networks"

PyTorch version of the paper 'Enhanced Deep Residual Networks for Single Image Super-Resolution' (CVPRW 2017)

Torch implementation of "Enhanced Deep Residual Networks for Single Image Super-Resolution"

PyTorch implementation of Wide Residual Networks with 1-bit weights by McDonnell (ICLR 2018)

Image Super-Resolution Using Very Deep Residual Channel Attention Networks

Reproduce ResNet-v2(Identity Mappings in Deep Residual Networks) with MXNet

Pytorch implementation of Deep Recursive Residual Network for Super Resolution (DRRN)

Wide Residual Networks (WideResNets) in PyTorch

RMNet: Equivalently Removing Residual Connection from Networks

PyTorch implementation of the Pose Residual Network (PRN)