Manifold-Mixup implementation for fastai V2

Nestor Demeure

Last update: Jul 25, 2022

Related tags

Overview

Manifold Mixup

Unofficial implementation of ManifoldMixup (Proceedings of ICML 19) for fast.ai (V2) based on Shivam Saboo's pytorch implementation of manifold mixup, fastai's input mixup implementation plus some improvements/variants that I developped with lessw2020.

This package provides four additional callbacks to the fastai learner :

ManifoldMixup which implements ManifoldMixup
OutputMixup which implements a variant that does the mixup only on the output of the last layer (this was shown to be more performant on a benchmark and an independant blogpost)
DynamicManifoldMixup which lets you use manifold mixup with a schedule to increase difficulty progressively
DynamicOutputMixup which lets you use manifold mixup with a schedule to increase difficulty progressively

Usage

For a minimal demonstration of the various callbacks and their parameters, see the Demo notebook.

Mixup

To use manifold mixup, you need to import manifold_mixup and pass the corresponding callback to the cbs argument of your learner :

learner = Learner(data, model, cbs=ManifoldMixup())
learner.fit(8)

The ManifoldMixup callback takes three parameters :

alpha=0.4 parameter of the beta law used to sample the interpolation weight
use_input_mixup=True do you want to apply mixup to the inputs
module_list=None can be used to pass an explicit list of target modules

The OutputMixup variant takes only the alpha parameters.

Dynamic mixup

Dynamic callbackss, which are available via dynamic_mixup, take three parameters instead of the single alpha parameter :

alpha_min=0.0 the initial, minimum, value for the parameter of the beta law used to sample the interpolation weight (we recommend keeping it to 0)
alpha_max=0.6 the final, maximum, value for the parameter of the beta law used to sample the interpolation weight
scheduler=SchedCos the scheduling function to describe the evolution of alpha from alpha_min to alpha_max

The default schedulers are SchedLin, SchedCos, SchedNo, SchedExp and SchedPoly. See the Annealing section of fastai2's documentation for more informations on available schedulers, ways to combine them and provide your own.

Notes

Which modules will be intrumented by ManifoldMixup ?

ManifoldMixup tries to establish a sensible list of modules on which to apply mixup:

it uses a user provided module_list if possible
otherwise it uses only the modules wrapped with ManifoldMixupModule
if none are found, it defaults to modules with Block or Bottleneck in their name (targetting mostly resblocks)
finaly, if needed, it defaults to all modules that are not included in the non_mixable_module_types list

The non_mixable_module_types list contains mostly recurrent layers but you can add elements to it in order to define module classes that should not be used for mixup (do not hesitate to create an issue or start a PR to add common modules to the default list).

When can I use OutputMixup ?

OutputMixup applies the mixup directly to the output of the last layer. This only works if the loss function contains something like a softmax (and not when it is directly used as it is for regression).

Thus, OutputMixup cannot be used for regression.

A note on skip-connections / residual-blocks

ManifoldMixup (this does not apply to OutputMixup) is greatly degraded when applied inside a residual block. This is due to the mixed-up values becoming incoherent with the output of the skip connection (which have not been mixed).

While this implementation is equiped to work around the problem for U-Net and ResNet like architectures, you might run into problems (negligeable improvements over the baseline) with other network structures. In which case, the best way to apply manifold mixup would be to manually select the modules to be instrumented.

For more unofficial fastai extensions, see the Fastai Extensions Repository.

You might also like...

tsai is an open-source deep learning package built on top of Pytorch & fastai focused on state-of-the-art techniques for time series classification, regression and forecasting.

Time series Timeseries Deep Learning Pytorch fastai - State-of-the-art Deep Learning with Time Series and Sequences in Pytorch / fastai

2.8k Jan 8, 2023

fastgradio is a python library to quickly build and share gradio interfaces of your trained fastai models.

34 Jan 5, 2023

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

IceVision is the first agnostic computer vision framework to offer a curated collection with hundreds of high-quality pre-trained models from torchvision, MMLabs, and soon Pytorch Image Models. It orchestrates the end-to-end deep learning workflow allowing to train networks with easy-to-use robust high-performance libraries such as Pytorch-Lightning and Fastai

789 Dec 29, 2022

A forwarding MPI implementation that can use any other MPI implementation via an MPI ABI

MPItrampoline MPI wrapper library: MPI trampoline library: MPI integration tests: MPI is the de-facto standard for inter-node communication on HPC sys

31 Dec 22, 2022

ALBERT-pytorch-implementation - ALBERT pytorch implementation

ALBERT-pytorch-implementation developing... 모델의 개념이해를 돕기 위한 구현물로 현재 변수명을 상세히 적었고

3 Oct 6, 2022

Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.

NuPIC Numenta Platform for Intelligent Computing The Numenta Platform for Intelligent Computing (NuPIC) is a machine intelligence platform that implem

6.3k Dec 30, 2022

PyTorch implementation of neural style transfer algorithm

neural-style-pt This is a PyTorch implementation of the paper A Neural Algorithm of Artistic Style by Leon A. Gatys, Alexander S. Ecker, and Matthias

770 Jan 2, 2023

PyTorch implementation of DeepDream algorithm

neural-dream This is a PyTorch implementation of DeepDream. The code is based on neural-style-pt. Here we DeepDream a photograph of the Golden Gate Br

121 Nov 5, 2022

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Deep High-Resolution Representation Learning for Human Pose Estimation (CVPR 2019) News [2020/07/05] A very nice blog from Towards Data Science introd

3.9k Jan 5, 2023

Comments

Search for both basic and bottleneck blocks (to fix "no known network structure detected" warning with ResNet-50 and other similar models)

Currently when trying to use ResNet-50 (the same issue happens with usual ResNeSt-50 and many other similar models) from timm library https://github.com/rwightman/pytorch-image-models, I get the following warning:

"Manifold mixup: no known network structure detected, 126 modules will be used for mixup"

This happens because ResNet-50 and higher do not have any BasicBlock and have Bottlenecks instead. With this patch I get this message:

"Manifold mixup: Block structure detected, 16 modules will be used for mixup."

I get better error_rate than when 126 modules were used for mixup.

opened by Lissanro 3
NameError: name 'Collection' is not defined

Thank you for your excellent work! I just ran the demo.py with the newest fastai. I changed the fastai2 to fastai. However, when I import manifold_mixup, there is a erros look like this:

NameError
Traceback (most recent call last) in 1 from fastai.vision.all import * ----> 2 from manifold_mixup import * ~/study/ManifoldMixupV2/manifold_mixup.py in 79 # Manifold Mixup 80 ---> 81 class ManifoldMixup(Callback): 82 "Callback that mixes a random inner layer and the target." 83 run_after,run_valid = [Normalize],False ~/study/ManifoldMixupV2/manifold_mixup.py in ManifoldMixup() 82 "Callback that mixes a random inner layer and the target." 83 run_after,run_valid = [Normalize],False ---> 84 def init(self, alpha:float=0.4, use_input_mixup:bool=True, module_list:Collection=None): 85 """ 86 alpha is the parameter for the beta law. NameError: name 'Collection' is not defined

Could you help me with this?

opened by heng-yin 2

With learner.to_fp16() I get error "RuntimeError: expected dtype Float but got dtype Half"

FP16 is important for better performance and preserving VRAM, some larger models cannot be trained at all without it on my videocard. Built-in MixUp() works without issues with to_fp16(). I get the following error with ManifoldMixup() (full error log: https://pastebin.com/HPMLwg6F):

~/Jupyter/manifold_mixup.py in hook_mixup(self, module, input, output)
    140         if not self.mixup_has_been_applied: # performs mixup
    141             output_dims = len(output.size())
--> 142             output = torch.lerp(output[self.shuffle], output, weight=unsqueeze(self.lam, n=output_dims-1))
    143             self.mixup_has_been_applied = True
    144             return output

RuntimeError: expected dtype Float but got dtype Half

opened by Lissanro 1

Owner

Nestor Demeure

PhD, Engineer specialized in computer science and applied mathematics.

GitHub

Mixup for Supervision, Semi- and Self-Supervision Learning Toolbox and Benchmark

OpenSelfSup News Downstream tasks now support more methods(Mask RCNN-FPN, RetinaNet, Keypoints RCNN) and more datasets(Cityscapes). 'GaussianBlur' is

332 Jan 3, 2023

A Pytorch implementation of "Manifold Matching via Deep Metric Learning for Generative Modeling" (ICCV 2021)

Manifold Matching via Deep Metric Learning for Generative Modeling A Pytorch implementation of "Manifold Matching via Deep Metric Learning for Generat

69 Dec 10, 2022

Official repository for the ICLR 2021 paper Evaluating the Disentanglement of Deep Generative Models with Manifold Topology

Official repository for the ICLR 2021 paper Evaluating the Disentanglement of Deep Generative Models with Manifold Topology Sharon Zhou, Eric Zelikman

34 Nov 16, 2022

Code for Learning Manifold Patch-Based Representations of Man-Made Shapes, in ICLR 2021.

LearningPatches | Webpage | Paper | Video Learning Manifold Patch-Based Representations of Man-Made Shapes Dmitriy Smirnov, Mikhail Bessmeltsev, Justi

22 Nov 14, 2022

Hierarchical Uniform Manifold Approximation and Projection

HUMAP Hierarchical Manifold Approximation and Projection (HUMAP) is a technique based on UMAP for hierarchical non-linear dimensionality reduction. HU

160 Jan 6, 2023

Research Artifact of USENIX Security 2022 Paper: Automated Side Channel Analysis of Media Software with Manifold Learning

Manifold-SCA Research Artifact of USENIX Security 2022 Paper: Automated Side Channel Analysis of Media Software with Manifold Learning The repo is org

172 Dec 29, 2022

Manifold-Mixup implementation for fastai V2

Related tags

Overview

Manifold Mixup

Usage

Mixup

Dynamic mixup

Notes

Which modules will be intrumented by ManifoldMixup ?

When can I use OutputMixup ?

A note on skip-connections / residual-blocks

You might also like...

tsai is an open-source deep learning package built on top of Pytorch & fastai focused on state-of-the-art techniques for time series classification, regression and forecasting.

fastgradio is a python library to quickly build and share gradio interfaces of your trained fastai models.

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

A forwarding MPI implementation that can use any other MPI implementation via an MPI ABI

ALBERT-pytorch-implementation - ALBERT pytorch implementation

Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.

PyTorch implementation of neural style transfer algorithm

PyTorch implementation of DeepDream algorithm

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Comments

Search for both basic and bottleneck blocks (to fix "no known network structure detected" warning with ResNet-50 and other similar models)

NameError: name 'Collection' is not defined

With learner.to_fp16() I get error "RuntimeError: expected dtype Float but got dtype Half"

Owner

Nestor Demeure

Mixup for Supervision, Semi- and Self-Supervision Learning Toolbox and Benchmark

A Pytorch implementation of "Manifold Matching via Deep Metric Learning for Generative Modeling" (ICCV 2021)

Official repository for the ICLR 2021 paper Evaluating the Disentanglement of Deep Generative Models with Manifold Topology

Code for Learning Manifold Patch-Based Representations of Man-Made Shapes, in ICLR 2021.

Hierarchical Uniform Manifold Approximation and Projection

Research Artifact of USENIX Security 2022 Paper: Automated Side Channel Analysis of Media Software with Manifold Learning

Robot Reinforcement Learning on the Constraint Manifold

This respository includes implementations on Manifoldron: Direct Space Partition via Manifold Discovery

The fastai deep learning library

The fastai deep learning library