MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Bhanu

Last update: Jan 16, 2022

Related tags

Deep Learning mixrnet

Overview

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Using mixup data augmentation as reguliraztion and tuning the hyper parameters of ResNet 50 models to achieve 94.57% test accuracy on CIFAR-10 Dataset. Link to paper

network	error %
resnet-50	6.97
resnet-110	6.61
resnet-164	5.93
resnet-1001	7.61
This method	5.43

Overview

Change the wandb api key to valid api key.
Python 3.8 and pytorch 1.9 (works on older versions as well)
main.py is to train model
sweep.py and sweep_config.py are for hyperparameter optimization for experiment tracking wandb is used please change api key
pred.py is to run the trained model on the custom data. (Appropriately provide model paths)

Important

If you want to run sweep.py then you must use wandb apikey and if you want to run main.py use wandb to log the experiment for comparision else comment out wandb part.

Training


# Start training with:

python main.py (Added --run_name optional argument for better tracking experiments)

  

# You can manually resume the training with:

python main.py --resume --lr=0.01

Hyperparameters sweep


# Start sweep with:

python sweep.py

  

# Provide appropriate hyperparameters range in sweep_config.py (Config written in py file to use the power of math package for sweep configs)

Running on custom dataset


# Convert traget data of (N*32*32*3) into (N*3*32*32) shape and pass through the model:

python pred.py (Provide path of the saved models)

Other files

mixup.py contains functions to claculate loss of mixup predictions as you cant use nn.CrossEntropyLoss
utils.py contain somehelper functions
dataloader.py is a torch class based dataloader of our train data (CIFAR-10 data)
private_loader.py is a torch class based dataloader of our private data.
Transformations are done using torchtransforms in main.py and sweep.py files depending on usage.

Hyper-parameter optimization for sklearn

hyperopt-sklearn Hyperopt-sklearn is Hyperopt-based model selection among machine learning algorithms in scikit-learn. See how to use hyperopt-sklearn

1.4k Jan 1, 2023

Code for the paper "Query Embedding on Hyper-relational Knowledge Graphs"

Query Embedding on Hyper-Relational Knowledge Graphs This repository contains the code used for the experiments in the paper Query Embedding on Hyper-

19 Jul 26, 2022

MM1 and MMC Queue Simulation using python - Results and parameters in excel and csv files

implementation of MM1 and MMC Queue on randomly generated data and evaluate simulation results then compare with analytical results and draw a plot curve for them, simulate some integrals and compare results and run monte carlo algorithm with them

1 Jan 19, 2022

Adaout is a practical and flexible regularization method with high generalization and interpretability

Adaout Adaout is a practical and flexible regularization method with high generalization and interpretability. Requirements python 3.6 (Anaconda versi

1 Feb 9, 2022

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks (SDPoint) This repository contains the cod

17 Jul 4, 2022

[ICLR 2021] Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization

Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization Kaidi Cao, Yining Chen, Junwei Lu, Nikos Arechiga, Adrien Gaidon, Tengyu Ma

29 Oct 20, 2022

Training vision models with full-batch gradient descent and regularization

Stochastic Training is Not Necessary for Generalization -- Training competitive vision models without stochasticity This repository implements trainin

32 Jan 6, 2023

PyTorch framework, for reproducing experiments from the paper Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks. Code, based on the PyTorch framework, for reprodu

3 Dec 27, 2022

A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for ONNX.

sam4onnx A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for

6 May 15, 2022

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Related tags

Overview

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Overview

Important

Training

Hyperparameters sweep

Running on custom dataset

Other files

You might also like...

Hyper-parameter optimization for sklearn

Code for the paper "Query Embedding on Hyper-relational Knowledge Graphs"

MM1 and MMC Queue Simulation using python - Results and parameters in excel and csv files

Adaout is a practical and flexible regularization method with high generalization and interpretability

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks

[ICLR 2021] Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization

Training vision models with full-batch gradient descent and regularization

PyTorch framework, for reproducing experiments from the paper Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for ONNX.

Owner

Bhanu

3D ResNets for Action Recognition (CVPR 2018)

Facilitating Database Tuning with Hyper-ParameterOptimization: A Comprehensive Experimental Evaluation

Code for ACL2021 paper Consistency Regularization for Cross-Lingual Fine-Tuning.

Mixup for Supervision, Semi- and Self-Supervision Learning Toolbox and Benchmark

ICLR 2021, Fair Mixup: Fairness via Interpolation

A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

SUPERVISED-CONTRASTIVE-LEARNING-FOR-PRE-TRAINED-LANGUAGE-MODEL-FINE-TUNING - The Facebook paper about fine tuning RoBERTa with contrastive loss

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

RuDOLPH: One Hyper-Modal Transformer can be creative as DALL-E and smart as CLIP