Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

Woojeong Kim

Last update: Dec 30, 2022

Related tags

Deep Learning neuron-merging

Overview

Neuron Merging: Compensating for Pruned Neurons

Pytorch implementation of Neuron Merging: Compensating for Pruned Neurons, accepted at 34th Conference on Neural Information Processing Systems (NeurIPS 2020).

Requirements

To install requirements:

conda env create -f ./environment.yml

Python environment & main libraries:

python 3.8
pytorch 1.5.0
scikit-learn 0.22.1
torchvision 0.6.0

LeNet-300-100

To test LeNet-300-100 model on FashionMNIST, run:

bash scripts/LeNet_300_100_FashionMNIST.sh -t [model type] -c [criterion] -r [pruning ratio]

You can use three arguments for this script:

model type: original | prune | merge
pruning criterion : l1-norm | l2-norm | l2-GM
pruning ratio : 0.0 ~ 1.0

For example, to test the model after pruning 50% of the neurons with $l_1$-norm criterion, run:

bash scripts/LeNet_300_100_FashionMNIST.sh -t prune -c l1-norm -r 0.5

To test the model after merging , run:

bash scripts/LeNet_300_100_FashionMNIST.sh -t merge -c l1-norm -r 0.5

VGG-16

To test VGG-16 model on CIFAR-10, run:

bash scripts/VGG16_CIFAR10.sh -t [model type] -c [criterion]

You can use two arguments for this script

model type: original | prune | merge
pruning criterion: l1-norm | l2-norm | l2-GM

As a pretrained model on CIFAR-100 is not included, you must train it first. To train VGG-16 on CIFAR-100, run:

bash scripts/VGG16_CIFAR100_train.sh

All the hyperparameters are as described in the supplementary material.

After training, to test VGG-16 model on CIFAR-100, run:

bash scripts/VGG16_CIFAR100.sh -t [model type] -c [criterion]

You can use two arguments for this script

model type: original | prune | merge
pruning criterion: l1-norm | l2-norm | l2-GM

ResNet

To test ResNet-56 model on CIFAR-10, run:

bash scripts/ResNet56_CIFAR10.sh -t [model type] -c [criterion] -r [pruning ratio]

You can use three arguments for this script

model type: original | prune | merge
pruning method : l1-norm | l2-norm | l2-GM
pruning ratio : 0.0 ~ 1.0

To test WideResNet-40-4 model on CIFAR-10, run:

bash scripts/WideResNet_40_4_CIFAR10.sh -t [model type] -c [criterion] -r [pruning ratio]

You can use three arguments for this script

model type: original | prune | merge
pruning method : l1-norm | l2-norm | l2-GM
pruning ratio : 0.0 ~ 1.0

Results

Our model achieves the following performance on (without fine-tuning) :

Image classification of LeNet-300-100 on FashionMNIST

Baseline Accuracy : 89.80%

Pruning Ratio	Prune ($l_1$-norm)	Merge
50%	88.40%	88.69%
60%	85.17%	86.92%
70%	71.26%	82.75%
80%	66.76	80.02%

Image classification of VGG-16 on CIFAR-10

Baseline Accuracy : 93.70%

Criterion	Prune	Merge
$l_1$-norm	88.70%	93.16%
$l_2$-norm	89.14%	93.16%
$l_2$-GM	87.85%	93.10%

Citation

@inproceedings{kim2020merging,
  title     = {Neuron Merging: Compensating for Pruned Neurons},
  author    = {Kim, Woojeong and Kim, Suhyun and Park, Mincheol and Jeon, Geonseok},
  booktitle = {Advances in Neural Information Processing Systems 33},
  year      = {2020}
}

You might also like...

UDP++ (ECCVW 2020 Oral), (Winner of COCO 2020 Keypoint Challenge).

UDP-Pose This is the pytorch implementation for UDP++, which won the Fisrt place in COCO Keypoint Challenge at ECCV 2020 Workshop. Top-Down Results on

20 Jul 29, 2022

git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Beta R-CNN: Looking into Pedestrian Detection from Another Perspective This is the pytorch implementation of our paper "[Beta R-CNN: Looking into Pede

35 Sep 8, 2021

Official implementation of "GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators" (NeurIPS 2020)

GS-WGAN This repository contains the implementation for GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators (NeurIPS

46 Nov 9, 2022

Diverse Image Captioning with Context-Object Split Latent Spaces (NeurIPS 2020)

Diverse Image Captioning with Context-Object Split Latent Spaces This repository is the PyTorch implementation of the paper: Diverse Image Captioning

34 Nov 21, 2022

Official implementation for Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder at NeurIPS 2020

Likelihood-Regret Official implementation of Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder at NeurIPS 2020. T

33 Oct 12, 2022

Official Pytorch implementation of 'GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network' (NeurIPS 2020)

Official implementation of GOCor This is the official implementation of our paper : GOCor: Bringing Globally Optimized Correspondence Volumes into You

71 Nov 18, 2022

《Dual-Resolution Correspondence Network》(NeurIPS 2020)

Comments

Question
안녕하세요. 훌륭한 연구 결과 공유해주셔서 감사합니다.

다름이 아니라, 공유해주신 레포의 코드를 보면서 궁금증이 생겨 질문 남깁니다.

Q) Decompose 할 때 Pretrained model의 정보가 필요한 이유는 무엇인가요? 논문 내 기재된 알고리즘을 봤을 때 해당 부분은 없었던 것 같은데... Pretrained Model이 아니라 L1, L2 Norm이나 Geometric Median 에 의해 계산된 점수에 의해 결정되는 것이 아닌지요?

질문을 한글로 기재했으나, 답변이 정리되는 대로 영문으로 재작성하겠습니다~!
opened by planemanner 2
why code need retrain is not identity as the paper

I read your paper and the innovation is one-shot and data-free get the better results than only pruning ,so I very interesting your work.when I download the code then run and find it will enter the train.so I want to know how to configure the args to get the model the no retrain

opened by lndawn 4
how about the steps of train, prune and merge?
I've tried train the model VGG, set args.retrain False, and then I want to run the function Decompose, so changed the args.retrain to True, while, the VGG cfg changes

if args.dataset == 'cifar10': cfg = [32, 64, 'M', 128, 128, 'M', 256, 256, 256, 'M', 256, 256, 256, 'M', 256, 256, 256]

but when train in the begin, the default cfg is

defaultcfg = { 11: [64, 'M', 128, 'M', 256, 256, 'M', 512, 512, 'M', 512, 512],

so can not load the trained model to prune
opened by cs-heibao 2

Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

Related tags

Overview

Neuron Merging: Compensating for Pruned Neurons

Requirements

LeNet-300-100

VGG-16

ResNet

Results

Image classification of LeNet-300-100 on FashionMNIST

Image classification of VGG-16 on CIFAR-10

Citation

You might also like...

UDP++ (ECCVW 2020 Oral), (Winner of COCO 2020 Keypoint Challenge).

git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Official implementation of "GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators" (NeurIPS 2020)

Diverse Image Captioning with Context-Object Split Latent Spaces (NeurIPS 2020)

Official implementation for Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder at NeurIPS 2020

Official Pytorch implementation of 'GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network' (NeurIPS 2020)

《Dual-Resolution Correspondence Network》(NeurIPS 2020)

(NeurIPS 2020) Wasserstein Distances for Stereo Disparity Estimation

Official Implementation of Swapping Autoencoder for Deep Image Manipulation (NeurIPS 2020)

Comments

Question

why code need retrain is not identity as the paper

how about the steps of train, prune and merge?

Owner

Woojeong Kim

Neural implicit reconstruction experiments for the Vector Neuron paper

Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation

Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging

Medical image analysis framework merging ANTsPy and deep learning

Vector Neurons: A General Framework for SO(3)-Equivariant Networks

Binary Stochastic Neurons in PyTorch

A library for finding knowledge neurons in pretrained transformer models.

Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"

WormMovementSimulation - 3D Simulation of Worm Body Movement with Neurons attached to its body