DropNAS: Grouped Operation Dropout for Differentiable Architecture Search

weijunhong

Last update: Aug 15, 2022

Related tags

Deep Learning DropNAS

Overview

DropNAS: Grouped Operation Dropout for Differentiable Architecture Search

DropNAS, a grouped operation dropout method for one-level DARTS, with better and more stable performance.

Requirements

python-3.5.2
pytorch-1.0.0
torchvision-0.2.0
tensorboardX-2.0
graphviz-0.14

How to use the code

# with the default setting presented in paper, but you may need to adjust the batch size to prevent OOM 
python3 search.py --name cifar10_example --dataset CIFAR10 --gpus 0

Augment

# use the genotype we found on CIFAR10

python3 augment.py --name cifar10_example --dataset CIFAR10 --gpus 0 --genotype "Genotype(
    normal=[[('sep_conv_3x3', 1), ('skip_connect', 0)], [('sep_conv_3x3', 1), ('sep_conv_3x3', 0)], [('sep_conv_3x3', 1), ('sep_conv_3x3', 0)], [('dil_conv_5x5', 4), ('dil_conv_3x3', 1)]],
    normal_concat=range(2, 6),
    reduce=[[('max_pool_3x3', 0), ('sep_conv_5x5', 1)], [('dil_conv_5x5', 2), ('sep_conv_5x5', 1)], [('dil_conv_5x5', 3), ('dil_conv_5x5', 2)], [('dil_conv_5x5', 3), ('dil_conv_5x5', 4)]],
    reduce_concat=range(2, 6)
)"

Results

The following results in CIFAR-10/100 are obtained with the default setting. More results with different arguements and other dataset like ImageNet can be found in the paper.

Dataset	Avg Acc (%)	Best Acc (%)
CIFAR-10	97.42±0.14	97.74
CIFAR-100	83.05±0.41	83.61

The performance of DropNAS and one-level DARTS across different search spaces on CIFAR-10/100.

Dataset	Search Space	DropNAS Acc (%)	one-level DARTS Acc (%)
CIFAR-10	3-skip	97.32±0.10	96.81±0.18
	1-skip	97.33±0.11	97.15±0.12
	original	97.42±0.14	97.10±0.16
CIFAR-100	3-skip	83.03±0.35	82.00±0.34
	1-skip	83.53±0.19	82.27±0.25
	original	83.05±0.41	82.73±0.36

The test error of DropNAS on CIFAR-10 when different operation groups are applied with different drop path rates.

	r_p=1e-5	r_p=3e-5	r_p=1e-4
r_np=1e-5	97.40±0.16	97.28±0.04	97.36±0.12
r_np=3e-5	97.36±0.11	97.42±0.14	97.31±0.05
r_np=1e-4	97.35±0.07	97.31±0.10	97.37±0.16

Found Architectures

CIFAR-10

CIFAR100

Reference

[1] https://github.com/quark0/darts (official implementation of DARTS)

[2] https://github.com/khanrc/pt.darts

[3] https://github.com/susan0199/StacNAS (feature map code used in our paper)

You might also like...

Unofficial implementation of the Involution operation from CVPR 2021

involution_pytorch Unofficial PyTorch implementation of "Involution: Inverting the Inherence of Convolution for Visual Recognition" by Li et al. prese

46 Dec 7, 2022

Unified file system operation experience for different backend

megfile - Megvii FILE library Docs: http://megvii-research.github.io/megfile megfile provides a silky operation experience with different backends (cu

76 Dec 14, 2022

Multiple types of NN model optimization environments. It is possible to directly access the host PC GUI and the camera to verify the operation. Intel iHD GPU (iGPU) support. NVIDIA GPU (dGPU) support.

mtomo Multiple types of NN model optimization environments. It is possible to directly access the host PC GUI and the camera to verify the operation.

24 Mar 2, 2022

Complete system for facial identity system. Include one-shot model, database operation, features visualization, monitoring

2 Dec 28, 2021

Accelerated SMPL operation, commonly used in generate 3D human mesh, STAR included.

SMPL2 An enchanced and accelerated SMPL operation which commonly used in 3D human mesh generation. It takes a poses, shapes, cam_trans as inputs, outp

20 Oct 17, 2022

Liecasadi - liecasadi implements Lie groups operation written in CasADi

liecasadi liecasadi implements Lie groups operation written in CasADi, mainly di

14 Nov 5, 2022

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Deep Image Search - AI-Based Image Search Engine Deep Image Search is an AI-based image search engine that includes deep transfer learning features Ex

139 Jan 1, 2023

EdMIPS: Rethinking Differentiable Search for Mixed-Precision Neural Networks

EdMIPS is an efficient algorithm to search the optimal mixed-precision neural network directly without proxy task on ImageNet given computation budgets. It can be applied to many popular network architectures, including ResNet, GoogLeNet, and Inception-V3.

47 Dec 30, 2022

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective [PDF] Wuyang Chen, Xinyu Gong, Zhangyang Wang In ICLR 2

156 Nov 28, 2022

DropNAS: Grouped Operation Dropout for Differentiable Architecture Search

Related tags

Overview

DropNAS: Grouped Operation Dropout for Differentiable Architecture Search

Requirements

How to use the code

Results

Found Architectures

Reference

You might also like...

Unofficial implementation of the Involution operation from CVPR 2021

Unified file system operation experience for different backend

Multiple types of NN model optimization environments. It is possible to directly access the host PC GUI and the camera to verify the operation. Intel iHD GPU (iGPU) support. NVIDIA GPU (dGPU) support.

Complete system for facial identity system. Include one-shot model, database operation, features visualization, monitoring

Accelerated SMPL operation, commonly used in generate 3D human mesh, STAR included.

Liecasadi - liecasadi implements Lie groups operation written in CasADi

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

EdMIPS: Rethinking Differentiable Search for Mixed-Precision Neural Networks

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Owner

weijunhong

Differentiable architecture search for convolutional and recurrent networks

code for paper "Does Unsupervised Architecture Representation Learning Help Neural Architecture Search?"

R-Drop: Regularized Dropout for Neural Networks

Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition

Pytorch implementation of Learning Rate Dropout.

Unofficial PyTorch implementation of Guided Dropout

Differentiable Neural Computers, Sparse Access Memory and Sparse Differentiable Neural Computers, for Pytorch

Model search is a framework that implements AutoML algorithms for model architecture search at scale

Densely Connected Search Space for More Flexible Neural Architecture Search (CVPR2020)

[ICLR2021oral] Rethinking Architecture Selection in Differentiable NAS