[CVPR 2021] 'Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator'

Last update: Dec 1, 2022

Related tags

Deep Learning SGNAS

Overview

[CVPR2021] Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator

Overview

This is the entire codebase for the paper Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator

In one-shot NAS, sub-networks need to be searched from the supernet to meet different hardware constraints. However, the search cost is high and N times of searches are needed for N different constraints. In this work, we propose a novel search strategy called architecture generator to search sub-networks by generating them, so that the search process can be much more efficient and flexible. With the trained architecture generator, given target hardware constraints as the input, N good architectures can be generated for N constraints by just one forward pass without researching and supernet retraining. Moreover, we propose a novel single-path supernet, called unified supernet, to further improve search efficiency and reduce GPU memory consumption of the architecture generator. With the architecture generator and the unified supernet, we pro- pose a flexible and efficient one-shot NAS framework, called Searching by Generating NAS (SGNAS). The search time of SGNAS for N different hardware constraints is only 5 GPU hours, which is 4N times faster than previous SOTA single-path methods. The top1-accuracy of SGNAS on ImageNet is 77.1%, which is comparable with the SOTAs.

Model Zoo

Model	FLOPs (M)	Param (M)	Top-1 (%)	Weights
SGNAS-A	373	6.0	77.1	Google drive
SGNAS-B	326	5.5	76.8	Google drive
SGNAS-C	281	4.7	76.2	Google drive

Requirements

pip3 install -r requirements.txt

[Optional] Transfer Imagenet dataset into LMDB format by utils/folder2lmdb.py
- With LMDB format, you can speed up entire training process(30 mins per epoch with 4 GeForce GTX 1080 Ti)

Getting Started

Search

Training Unified Supernet

For Imagenet training, set the config file ./config_file/imagenet_config.yml. For cifar100 training, set the config file ./config_file/config.yml.
Set the hyperparameter warmup_epochs in the config file to specific the epochs for training the unified supernet.

python3 search.py --cfg [CONFIG_FILE] --title [EXPERIMENT_TITLE]

Training Architecture Generator

For Imagenet training, set the config file ./config_file/imagenet_config.yml. For cifar100 training, set the config file ./config_file/config.yml.
Set the hyperparameter warmup_epochs in the config file to skip the supernet training, and set the hyperparameter search_epochs to specific the epochs for training the architecture generator.

python3 search.py --cfg [CONFIG_FILE] --title [EXPERIMENT_TITLE]

Train From Scratch

CIFAR10 or CIFAR100

Set train_portion in ./config_file/config.yml to 1

python3 train_cifar.py --cfg [CONFIG_FILE] -- flops [TARGET_FLOPS] --title [EXPERIMENT_TITLE]

ImageNet

Set the target flops and correspond config file path in run_example.sh

bash ./run_example.sh

Validate

ImageNet

SGNAS-A

python3 validate.py [VAL_PATH] --checkpoint [CHECKPOINT_PATH] --config_path [CONFIG_FILE] --target_flops 365 --se True --activation hswish

SGNAS-B

python3 validate.py [VAL_PATH] --checkpoint [CHECKPOINT_PATH] --config_path [CONFIG_FILE] --target_flops 320 --se True --activation hswish

SGNAS-C

python3 validate.py [VAL_PATH] --checkpoint [CHECKPOINT_PATH] --config_path [CONFIG_FILE] --target_flops 275 --se True --activation hswish

Reference

Citation

@InProceedings{sgnas,
author = {Sian-Yao Huang and Wei-Ta Chu},
title = {Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator},
booktitle = {Proceedings of IEEE Conference on Computer Vision and Pattern Recognition},
year = {2021}
}

You might also like...

code for paper "Does Unsupervised Architecture Representation Learning Help Neural Architecture Search?"

Does Unsupervised Architecture Representation Learning Help Neural Architecture Search? Code for paper: Does Unsupervised Architecture Representation

39 Dec 17, 2022

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

730 Jan 9, 2023

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

Squirrel Core Share, load, and transform data in a collaborative, flexible, and efficient way What is Squirrel? Squirrel is a Python library that enab

249 Dec 7, 2022

An implementation for Neural Architecture Search with Random Labels (CVPR 2021 poster) on Pytorch.

Neural Architecture Search with Random Labels(RLNAS) Introduction This project provides an implementation for Neural Architecture Search with Random L

18 Nov 8, 2022

(AAAI 2021) Progressive One-shot Human Parsing

End-to-end One-shot Human Parsing This is the official repository for our two papers: Progressive One-shot Human Parsing (AAAI 2021) End-to-end One-sh

54 Dec 30, 2022

CVPR 2021: "Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE"

Diverse Structure Inpainting ArXiv | Papar | Supplementary Material | BibTex This repository is for the CVPR 2021 paper, "Generating Diverse Structure

152 Nov 4, 2022

Simple, efficient and flexible vision toolbox for mxnet framework.

MXbox: Simple, efficient and flexible vision toolbox for mxnet framework. MXbox is a toolbox aiming to provide a general and simple interface for visi

31 Oct 19, 2019

git git《Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking》(CVPR 2021) GitHub:git2] 《Masksembles for Uncertainty Estimation》(CVPR 2021) GitHub:git3]

Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking Ning Wang, Wengang Zhou, Jie Wang, and Houqiang Li Accepted by CVPR

236 Dec 22, 2022

Few-shot Neural Architecture Search

One-shot Neural Architecture Search uses a single supernet to approximate the performance each architecture. However, this performance estimation is super inaccurate because of co-adaption among operations in supernet.

38 Oct 18, 2022

Comments

I got the Top1 ACC in Val is 0.7102. Hope to get some help.
Thanks for your great work. I tried to reproduce your experiment on CIFA-100.

Firstly, I setted warmup_epochs=50 search_epochs=0 in cifar_config.yml, to train the supernet.

Secondly,I setted warmup_epochs=0 search_epochs=50, to train the generator.

Finally, I ran 'python3 train_cifar.py --cfg cifar_config.yml --flops 100' to generate the model and retrain it.

However, I got the Top1 ACC in Val is 0.7102. So please tell me what I did wrong.

The curve of training Supernet The curve of searching The curve of training retraining
opened by zanonShao 1
when train_cifar.py is running, there is some error.

Error: Traceback (most recent call last): File "/home/lcf/project/SGNAS/train_cifar.py", line 97, in arch_param = generator(prior, normalize_hardware_constraint, noise) File "/home/lcf/anaconda3/envs/DALLE/lib/python3.6/site-packages/torch/nn/modules/module.py", line 889, in _call_impl result = self.forward(*input, **kwargs) TypeError: forward() takes 3 positional arguments but 4 were given

Code:

arch_param = generator(prior, normalize_hardware_constraint, noise) def forward(self, x, hc): y = x.view(1, 1, *x.shape) .......... return y TypeError: forward() takes 3 positional arguments but 4 were given

opened by MachineCF 3
The compatibility for the later version pytorch

In the original implementation, an error will occur "RuntimeError: A view was created in no_grad mode and is being modified inplace with grad mode enabled" using PyTorch 1.8.

opened by GongXinyuu 1

[CVPR 2021] 'Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator'

Related tags

Overview

[CVPR2021] Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator

Overview

Model Zoo

Requirements

Getting Started

Search

Training Unified Supernet

Training Architecture Generator

Train From Scratch

CIFAR10 or CIFAR100

ImageNet

Validate

ImageNet

Reference

Citation

You might also like...

code for paper "Does Unsupervised Architecture Representation Learning Help Neural Architecture Search?"

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

An implementation for Neural Architecture Search with Random Labels (CVPR 2021 poster) on Pytorch.

(AAAI 2021) Progressive One-shot Human Parsing

CVPR 2021: "Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE"

Simple, efficient and flexible vision toolbox for mxnet framework.

git git《Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking》(CVPR 2021) GitHub:git2] 《Masksembles for Uncertainty Estimation》(CVPR 2021) GitHub:git3]

Few-shot Neural Architecture Search

Comments

I got the Top1 ACC in Val is 0.7102. Hope to get some help.

when train_cifar.py is running, there is some error.

The compatibility for the later version pytorch

Owner

"NAS-Bench-301 and the Case for Surrogate Benchmarks for Neural Architecture Search".

[ICLR2021oral] Rethinking Architecture Selection in Differentiable NAS

NAS Benchmark in "Prioritized Architecture Sampling with Monto-Carlo Tree Search", CVPR2021

CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)

Naszilla is a Python library for neural architecture search (NAS)

[CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search

Code for the paper One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation, CVPR 2021.

Code for "Searching for Efficient Multi-Stage Vision Transformers"

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

Densely Connected Search Space for More Flexible Neural Architecture Search (CVPR2020)