Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.

NVIDIA Corporation

Last update: Dec 17, 2022

Related tags

Deep Learning machine-learning deep-neural-networks deep-learning hyperparameter-optimization hyperparameter-tuning automl

Overview

Milano

(This is a research project, not an official NVIDIA product.)

Documentation

https://nvidia.github.io/Milano

Milano (Machine learning autotuner and network optimizer) is a tool for enabling machine learning researchers and practitioners to perform massive hyperparameters and architecture searches.

You can use it to:

Tune your model on a cloud backend of your choice
Benchmark Auto-ML algorithms (see how to add new search algorithm)

Your script can use any framework of your choice, for example, TensorFlow, PyTorch, Microsoft Cognitive Toolkit etc. or no framework at all. Milano only requires minimal changes to what your script accepts via command line and what it returns to stdout.

Currently supported backends:

Azkaban - on a single multi-GPU machine or server with Azkaban installed
AWS - Amazon cloud using GPU instances
SLURM - any cluster which is running SLURM

Prerequisites

Linux
Python 3
Ensure you have Python version 3.5 or later with packages listed in the requirements.txt file.
Backend with NVIDIA GPU

How to Get Started

Install all dependencies with the following command pip install -r requirements.txt.
Follow this mini-tutorial for local machine or this mini-tutorial for AWS

Visualize

We provide a script to convert the csv file output into two kinds of graphs:

Graphs of each hyperparameter with the benchmark (e.g. valid perplexity)
Color graphs that show the relationship between any two hyperparameters and the benchmark

To run the script, use:

python3 visualize.py --file [the name of the results csv file] 
                     --n [the number of samples to visualize]
                     --subplots [the number of subplots to show in a plot]
                     --max [the max value of benchmark you care about]

You might also like...

Comments

Range of integer values

Right now, you can specify params either as discrete values or continuous values (range). It'd be nice to be able to specify that a param is within a range but only takes integer values.

For example, if I specify:

"--nhid": { "type": "integer_range", "min": 100, "max": 400 },

It'll only search for integer values between 100 and 400, inclusive.

Thanks!

opened by chiphuyen 2

Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.

Related tags

Overview

Milano

Documentation

Prerequisites

How to Get Started

Visualize

You might also like...

Pytorch implementation of "Training a 85.4% Top-1 Accuracy Vision Transformer with 56M Parameters on ImageNet"

Unofficial & improved implementation of NeRF--: Neural Radiance Fields Without Known Camera Parameters

This python-based package offers a way of creating a parametric OpenMC plasma source from plasma parameters.

Solving SMPL/MANO parameters from keypoint coordinates.

Evolving neural network parameters in JAX.

MM1 and MMC Queue Simulation using python - Results and parameters in excel and csv files

Torch-mutable-modules - Use in-place and assignment operations on PyTorch module parameters with support for autograd

Vrcwatch - Supply the local time to VRChat as Avatar Parameters through OSC

Semi-automated OpenVINO benchmark_app with variable parameters

Comments

Range of integer values

Owner

NVIDIA Corporation

A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for ONNX.

PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Hyper-parameter optimization for sklearn

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

Code for the paper "Query Embedding on Hyper-relational Knowledge Graphs"

Facilitating Database Tuning with Hyper-ParameterOptimization: A Comprehensive Experimental Evaluation

RuDOLPH: One Hyper-Modal Transformer can be creative as DALL-E and smart as CLIP

(Arxiv 2021) NeRF--: Neural Radiance Fields Without Known Camera Parameters