A "gym" style toolkit for building lightweight Neural Architecture Search systems

Jack Turner

Last update: Nov 5, 2022

Related tags

Overview

A "gym" style toolkit for building lightweight Neural Architecture Search systems. I know, the name is awful.

Installation

Preferred option: Install from source:

git clone [email protected]:jack-willturner/gymnastics.git
cd gymnastics
python setup.py install

To install the latest release version:

pip install gymnastics

If you want to use NAS-Bench-101, follow the instructions here to get it set up.

Overview

Over the course of the final year of my PhD I worked a lot on Neural Architecture Search (NAS) and built a bunch of tooling to make my life easier. This is an effort to standardise the various features into a single framework and provide a "gym" style toolkit for comparing various algorithms.

The key use cases for this library are:

test out new predictors on various NAS benchmarks
visualise the cells/graphs of your architectures
add new operations to NAS spaces
add new backbones to NAS spaces

The framework revolves around three key classes:

Model
Proxy
SearchSpace

The anatomy of NAS

We can break down NAS spaces into three separate components: the skeleton or backbone of the network, the possible cells that can fill the skeletons, and the possible operations that can fill the cells. NAS papers and benchmarks all define their own versions of each of these variables. Our goal here is to de-couple the "search strategy" from the "search space" by allowing NAS designers to test out their technique on many NAS search spaces very easily. Specifically, the goal is the provide an easy interface for defining each column of the picture above.

Obligatory builder pattern README example

Using gymnastics we can very easily reconstruct NAS spaces (the goal being that it's easy to define new and exciting ones).

For example, here's how easy it is to redefine the NATS-Bench / NAS-Bench-201 search space:

best_score: best_score = score best_model = model best_model.show_picture() ">

from gymnastics.searchspace import SearchSpace, CellSpace, Skeleton
from gymnastics.searchspace.ops import Conv3x3, Conv1x1, AvgPool2d, Skip, Zeroize

search_space = SearchSpace(
    CellSpace(
        ops=[Conv3x3, Conv1x1, AvgPool2d, Skip, Zeroize], num_nodes=4, num_edges=6
    ),
    Skeleton(
        style=ResNetCIFAR,
        num_blocks=[5, 5, 5],
        channels_per_stage=[16, 32, 64],
        strides_per_stage=[1, 2, 2],
        block_expansion=1
    ),
)


# create an accuracy predictor
from gymnastics.proxies import NASWOT
from gymnastics.datasets import CIFAR10Loader

proxy = NASWOT()
dataset = CIFAR10Loader(path="~/datasets/cifar10", download=False)

minibatch, _ = dataset.sample_minibatch()

best_score = 0.0
best_model = None

# try out 10 random architectures and save the best one
for i in range(10):

    model = search_space.sample_random_architecture()

    y = model(minibatch)

    score = proxy.score(model, minibatch)

    if score > best_score:
        best_score = score
        best_model = model

best_model.show_picture()

Which prints:

Have a look in examples/ for more examples.

NAS-Benchmarks

If you have designed a new proxy for accuracy and want to test its performance, you can use the benchmarks available in benchmarks/.

The interface to the benchmarks is exactly the same as the above example for SearchSpace.

For example, here we score networks from the NDS ResNet space using random input data:

import torch
from gymnastics.benchmarks import NDSSearchSpace
from gymnastics.proxies import Proxy, NASWOT

search_space = NDSSearchSpace(
    "~/nds/data/ResNet.json", searchspace="ResNet"
)

proxy: Proxy = NASWOT()
minibatch: torch.Tensor = torch.rand((10, 3, 32, 32))

scores = []

for _ in range(10):
    model = search_space.sample_random_architecture()
    scores.append(proxy.score(model, minibatch))

Additional supported operations

In addition to the standard NAS operations we include a few more exotic ones, all in various states of completion:

Op	Paper	Notes
conv	-	params: kernel size
gconv	-	+ params: group
depthwise separable	pdf	+ no extra params needed
mixconv	pdf	+ params: needs a list of kernel_sizes
octaveconv	pdf	Don't have a sensible way to include this as a single operation yet
shift	pdf	no params needed
ViT	pdf
Fused-MBConv	pdf
Lambda	pdf

Repositories that use this framework

NASWOT

Alternatives

If you are looking for alternatives to this library, there are a few which I will try to keep a list of here:

You might also like...

[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark

Official PyTorch implementation of "Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets" (ICLR 2021)

Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets This is the official PyTorch implementation for the paper Rapid Neural A

48 Dec 26, 2022

code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"

94 Oct 26, 2022

A "gym" style toolkit for building lightweight Neural Architecture Search systems

Related tags

Overview

Installation

Overview

The anatomy of NAS

Obligatory builder pattern README example

NAS-Benchmarks

Additional supported operations

Repositories that use this framework

Alternatives

You might also like...

[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark

BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search

Deep Multimodal Neural Architecture Search

Official implementation of Rethinking Graph Neural Architecture Search from Message-passing (CVPR2021)

Block-wisely Supervised Neural Architecture Search with Knowledge Distillation (CVPR 2020)

"NAS-Bench-301 and the Case for Surrogate Benchmarks for Neural Architecture Search".

Code release to accompany paper "Geometry-Aware Gradient Algorithms for Neural Architecture Search."

Official PyTorch implementation of "Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets" (ICLR 2021)

code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"

Owner

Jack Turner

code for paper "Does Unsupervised Architecture Representation Learning Help Neural Architecture Search?"

[CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search

PocketNet: Extreme Lightweight Face Recognition Network using Neural Architecture Search and Multi-Step Knowledge Distillation

Transfer style api - An API to use with Tranfer Style App, where you can use two image and transfer the style

Densely Connected Search Space for More Flexible Neural Architecture Search (CVPR2020)

DeepHyper: Scalable Asynchronous Neural Architecture and Hyperparameter Search for Deep Neural Networks

Model search is a framework that implements AutoML algorithms for model architecture search at scale

Fast Neural Style for Image Style Transform by Pytorch

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang