Pytorch Implementation of Auto-Compressing Subset Pruning for Semantic Image Segmentation

Merantix

Last update: Dec 7, 2022

Related tags

Deep Learning acosp

Overview

Pytorch Implementation of Auto-Compressing Subset Pruning for Semantic Image Segmentation

Introduction

ACoSP is an online pruning algorithm that compresses convolutional neural networks during training. It learns to select a subset of channels from convolutional layers through a sigmoid function, as shown in the figure. For each channel a w_i is used to scale activations.

The segmentation maps display compressed PSPNet-50 models trained on Cityscapes. The models are up to 16 times smaller.

Repository

This repository is a PyTorch implementation of ACoSP based on hszhao/semseg. It was used to run all experiments used for the publication and is meant to guarantee reproducibility and audibility of our results.

The training, test and configuration infrastructure is kept close to semseg, with only some minor modifications to enable more reproducibility and integrate our pruning code. The model/ package contains the PSPNet50 and SegNet model definitions. In acosp/ all code required to prune during training is defined.

The current configs expect a special folder structure (but can be easily adapted):

/data: Datasets, Pretrained-weights
/logs/exp: Folder to store experiments

Installation

Clone the repository:

git clone [email protected]:merantix/acosp.git

Install ACoSP including requirements:
```
pip install .
```

Using ACoSP

The implementation of ACoSP is encapsulated in /acosp and using it independent of all other experimentation code is quite straight forward.

Create a pruner and adapt the model:

from acosp.pruner import SoftTopKPruner
import acosp.inject

# Create pruner object
pruner = SoftTopKPruner(
    starting_epoch=0,
    ending_epoch=100,  # Pruning duration
    final_sparsity=0.5,  # Final sparsity
)
# Add sigmoid soft k masks to model
pruner.configure_model(model)

In your training loop update the temperature of all masking layers:

# Update the temperature in all masking layers
pruner.update_mask_layers(model, epoch)

Convert the soft pruning to hard pruning when ending_epoch is reached:

if epoch == pruner.ending_epoch:
    # Convert to binary channel mask
    acosp.inject.soft_to_hard_k(model)

Experiments

Highlight:

All initialization models, trained models are available. The structure is:

| init/  # initial models
| exp/
|-- ade20k/  # ade20k/camvid/cityscapes/voc2012/cifar10
| |-- pspnet50_{SPARSITY}/  # the sparsity refers to the relative amount of weights that are removed. I.e. sparsity=0.75 <==> compression_ratio=4 
|   |-- model # model files
|   |-- ... # config/train/test files
|-- evals/  # all result with class wise IoU/Acc

Hardware Requirements: At least 60GB (PSPNet50) / 16GB (SegNet) of GPU RAM. Can be distributed to multiple GPUs.
Train:
- Download related datasets and symlink the paths to them as follows (you can alternatively modify the relevant paths specified in folder config):
```
mkdir -p /
ln -s /path_to_ade20k_dataset /data/ade20k
```
- Download ImageNet pre-trained models and put them under folder /data for weight initialization. Remember to use the right dataset format detailed in FAQ.md.
- Specify the gpu used in config then do training. (Training using acosp have only been carried out on a single GPU. And not been tested with DDP). The general structure to access individual configs is as follows:
```
sh tool/train.sh ${DATASET} ${CONFIG_NAME_WITHOUT_DATASET}
```
  E.g. to train a PSPNet50 on the ade20k dataset and use the config `config/ade20k/ade20k_pspnet50.yaml', execute:
```
sh tool/train.sh ade20k pspnet50
```
Test:
- Download trained segmentation models and put them under folder specified in config or modify the specified paths.
- For full testing (get listed performance):
```
sh tool/test.sh ade20k pspnet50
```
Visualization: tensorboardX incorporated for better visualization.
```
tensorboard --logdir=/logs/exp/ade20k
```
Other:
- Resources: GoogleDrive LINK contains shared models, visual predictions and data lists.
- Models: ImageNet pre-trained models and trained segmentation models can be accessed. Note that our ImageNet pretrained models are slightly different from original ResNet implementation in the beginning part.
- Predictions: Visual predictions of several models can be accessed.
- Datasets: attributes (names and colors) are in folder dataset and some sample lists can be accessed.
- Some FAQs: FAQ.md.

Performance

Description: mIoU/mAcc stands for mean IoU, mean accuracy of each class and all pixel accuracy respectively. General parameters cross different datasets are listed below:

Network: {NETWORK} @ ACoSP-{COMPRESSION_RATIO}
Train Parameters: sync_bn(True), scale_min(0.5), scale_max(2.0), rotate_min(-10), rotate_max(10), zoom_factor(8), aux_weight(0.4), base_lr(1e-2), power(0.9), momentum(0.9), weight_decay(1e-4).
Test Parameters: ignore_label(255).

ADE20K: Train Parameters: classes(150), train_h(473), train_w(473), epochs(100). Test Parameters: classes(150), test_h(473), test_w(473), base_size(512).

Setting: train on train (20210 images) set and test on val (2000 images) set.

Network	mIoU/mAcc
PSPNet50	41.42/51.48
PSPNet50 @ ACoSP-2	38.97/49.56
PSPNet50 @ ACoSP-4	33.67/43.17
PSPNet50 @ ACoSP-8	28.04/35.60
PSPNet50 @ ACoSP-16	19.39/25.52

PASCAL VOC 2012: Train Parameters: classes(21), train_h(473), train_w(473), epochs(50). Test Parameters: classes(21), test_h(473), test_w(473), base_size(512).

Setting: train on train_aug (10582 images) set and test on val (1449 images) set.

Network	mIoU/mAcc
PSPNet50	77.30/85.27
PSPNet50 @ ACoSP-2	72.71/81.87
PSPNet50 @ ACoSP-4	65.84/77.12
PSPNet50 @ ACoSP-8	58.26/69.65
PSPNet50 @ ACoSP-16	48.06/58.83

Cityscapes: Train Parameters: classes(19), train_h(713/512 -PSP/SegNet), train_h(713/1024 -PSP/SegNet), epochs(200). Test Parameters: classes(19), train_h(713/512 -PSP/SegNet), train_h(713/1024 -PSP/SegNet), base_size(2048).

Setting: train on fine_train (2975 images) set and test on fine_val (500 images) set.

Network	mIoU/mAcc
PSPNet50	77.35/84.27
PSPNet50 @ ACoSP-2	74.11/81.73
PSPNet50 @ ACoSP-4	71.50/79.40
PSPNet50 @ ACoSP-8	66.06/74.33
PSPNet50 @ ACoSP-16	59.49/67.74
SegNet	65.12/73.85
SegNet @ ACoSP-2	64.62/73.19
SegNet @ ACoSP-4	60.77/69.57
SegNet @ ACoSP-8	54.34/62.48
SegNet @ ACoSP-16	44.12/50.87

CamVid: Train Parameters: classes(11), train_h(360), train_w(720), epochs(450). Test Parameters: classes(11), test_h(360), test_w(720), base_size(360).

Setting: train on train (367 images) set and test on test (233 images) set.

Network	mIoU/mAcc
SegNet	55.49+-0.85/65.44+-1.01
SegNet @ ACoSP-2	51.85+-0.83/61.86+-0.85
SegNet @ ACoSP-4	50.10+-1.11/59.79+-1.49
SegNet @ ACoSP-8	47.25+-1.18/56.87+-1.10
SegNet @ ACoSP-16	42.27+-1.95/51.25+-2.02

Cifar10: Train Parameters: classes(10), train_h(32), train_w(32), epochs(50). Test Parameters: classes(10), test_h(32), test_w(32), base_size(32).

Setting: train on train (50000 images) set and test on test (10000 images) set.

Network	mAcc
ResNet18	89.68
ResNet18 @ ACoSP-2	88.50
ResNet18 @ ACoSP-4	86.21
ResNet18 @ ACoSP-8	81.06
ResNet18 @ ACoSP-16	76.81

Citation

If you find the acosp/ code or trained models useful, please consider citing:

For the general training code, please also consider referencing hszhao/semseg.

Question

Some FAQ.md collected. You are welcome to send pull requests or give some advices. Contact information: at.

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

This project is a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

147 Dec 3, 2022

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

45 Dec 8, 2022

Official pytorch code for "APP: Anytime Progressive Pruning"

APP: Anytime Progressive Pruning Diganta Misra1,2,3, Bharat Runwal2,4, Tianlong Chen5, Zhangyang Wang5, Irina Rish1,3 1 Mila - Quebec AI Institute,2 L

12 Nov 22, 2022

A unet implementation for Image semantic segmentation

Unet-pytorch a unet implementation for Image semantic segmentation 参考网上的Unet做分割的代码，做了一个针对kaggle地盐识别的，请去以下地址获取数据集: https://www.kaggle.com/c/tgs-salt-id

3 Jun 29, 2022

Mae segmentation - Reproduction of semantic segmentation using masked autoencoder (mae)

ADE20k Semantic segmentation with MAE Getting started Install the mmsegmentation

97 Dec 17, 2022

U-Net Implementation: Convolutional Networks for Biomedical Image Segmentation" using the Carvana Image Masking Dataset in PyTorch

U-Net Implementation By Christopher Ley This is my interpretation and implementation of the famous paper "U-Net: Convolutional Networks for Biomedical

1 Jan 6, 2022

HyperSeg: Patch-wise Hypernetwork for Real-time Semantic Segmentation Official PyTorch Implementation

: We present a novel, real-time, semantic segmentation network in which the encoder both encodes and generates the parameters (weights) of the decoder. Furthermore, to allow maximal adaptivity, the weights at each decoder block vary spatially. For this purpose, we design a new type of hypernetwork, composed of a nested U-Net for drawing higher level context features

182 Dec 14, 2022

PyTorch implementation of: Michieli U. and Zanuttigh P., "Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations", CVPR 2021.

Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations This is the official PyTorch implementation

Multimedia Technology and Telecommunication Lab

42 Nov 9, 2022

Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation

Segmenter: Transformer for Semantic Segmentation Segmenter: Transformer for Semantic Segmentation by Robin Strudel*, Ricardo Garcia*, Ivan Laptev and

594 Jan 6, 2023

Comments

No speed difference between sparsities
Thanks for your great work! I have some trouble running tests with your code, but I could not submit any issues so I'm leaving messages here...

Q1: When I ran test.sh on cityscapes dataset, it gave me the warning:

/home/yan/anaconda3/envs/ML3.9/lib/python3.9/site-packages/acosp/inject.py:119: UserWarning: Incorrect number of channels are masked: tensor([1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.]) for 32 number of remaining elements

what does it mean?

Q2: There is no speed difference between sparsities. I noticed you had implemented soft_to_channel_sparse in inject.py, and I think it is used to convert masked convs to sparse normal ones, which have fewer flops. Could you tell me how to use it to boot inference at runtime?

Thanks in advance.
opened by WaterHyacinthInNANHU 0

Pytorch Implementation of Auto-Compressing Subset Pruning for Semantic Image Segmentation

Related tags

Overview

Pytorch Implementation of Auto-Compressing Subset Pruning for Semantic Image Segmentation

Introduction

Repository

Installation

Using ACoSP

Experiments

Performance

Citation

Question

You might also like...

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

Official pytorch code for "APP: Anytime Progressive Pruning"

A unet implementation for Image semantic segmentation

Mae segmentation - Reproduction of semantic segmentation using masked autoencoder (mae)

U-Net Implementation: Convolutional Networks for Biomedical Image Segmentation" using the Carvana Image Masking Dataset in PyTorch

HyperSeg: Patch-wise Hypernetwork for Real-time Semantic Segmentation Official PyTorch Implementation

PyTorch implementation of: Michieli U. and Zanuttigh P., "Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations", CVPR 2021.

Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation

Comments

No speed difference between sparsities

Owner

Merantix

AutoDeeplab / auto-deeplab / AutoML for semantic segmentation, implemented in Pytorch

XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale

Submodular Subset Selection for Active Domain Adaptation (ICCV 2021)

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018

A Python implementation of the Locality Preserving Matching (LPM) method for pruning outliers in image matching.

Auto Seg-Loss: Searching Metric Surrogates for Semantic Segmentation

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference

An implementation of based on pytorch and mmcv

PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning"