Cl datasets - PyTorch image dataloaders and utility functions to load datasets for supervised continual learning

berjaoui

Last update: Aug 28, 2022

Related tags

Deep Learning cl_datasets

Overview

Continual learning datasets

Introduction

This repository contains PyTorch image dataloaders and utility functions to load datasets for supervised continual learning. Currently supported datasets:

MNIST
Pairwise-MNIST
Fashion-MNIST
not-MNIST (letters version of MNIST, see EMNIST for more detail)
CIFAR-10
CIFAR-100
German Traffic Signs
Street View House Numbers (SVHN)
Incremental CIFAR-100
Incremental TinyImageNet

Features

The provided interface simplifies typical data loading for supervised continual learning scenarios.

Dataset order, additional training data (for replay buffers) and test data (for global metrics computation) can all be specified.
A batch balancing feature is also available to make sure data from all available classes are available in a training batch.
Training data size and channels can be specified. Transformations will be added to make sure input data always has the same size and number of channels. If a single channel is specified, grayscaling will be applied. Otherwise, if 3 channels are specified, single channels will be triplicated. Bicubic interpolation or linear subsampling will be applied to meet the specified size.

Installation

Clone the repository to your machine.
Install the package:

pip install -e cl_datasets/

Note: Please use Python 3.8 or above.

Example

from cl_datasets import getDatasets

datasets = ['svhn','cifar10','fashion','mnist']
batchSize = 32
dataSize = (32,32)
nChannels = 3

dataloaders = getDatasets(datasets,batchSize,dataSize,nChannels)

for train_test_loaders in dataloaders:
    trainLoader,testLoader = train_test_loaders
    ...

List of possible datasets for training tasks

Full datasets

Description	Dataset string
MNIST	"mnist" or "MNIST"
not-MNIST	"notMnist" or "notMNIST"
Fashion MNIST	"fashion"
SVHN	"svhn"
Cifar-10	"cifar10"
Cifar-100	"cifar100"
German traffic signs	"traffic"

Incremental datasets

Description	Dataset string
Pairwise MNIST	"mnist_xy" (e.g. "mnist_01")
Incremental Cifar-100 (10 classes per task)	"cifar100_i" (e.g. "cifar100_4")
Incremental Tiny ImageNet (10 classes per task)	"TIN_i" (e.g. "TIN_3")

Project looking into use of autoencoder for semi-supervised learning and comparing data requirements compared to supervised learning.

2 Dec 17, 2021

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

Squirrel Core Share, load, and transform data in a collaborative, flexible, and efficient way What is Squirrel? Squirrel is a Python library that enab

249 Dec 7, 2022

CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation (ACMMM'21 Oral Paper)

CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation (ACMMM'21 Oral Paper) (Accepted for oral presentation at ACM

1 Nov 12, 2021

ICSS - Interactive Continual Semantic Segmentation

Presentation This repository contains the code of our paper: Weakly-supervised c

9 Jul 23, 2022

[CVPR 2022] CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation

CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation Prerequisite Please create and activate the following conda envrionment. To r

87 Jan 8, 2023

[CVPR2022] Representation Compensation Networks for Continual Semantic Segmentation

RCIL [CVPR2022] Representation Compensation Networks for Continual Semantic Segmentation Chang-Bin Zhang1, Jia-Wen Xiao1, Xialei Liu1, Ying-Cong Chen2

71 Dec 28, 2022

Additional code for Stable-baselines3 to load and upload models from the Hub.

Hugging Face x Stable-baselines3 A library to load and upload Stable-baselines3 models from the Hub. Installation With pip Examples [Todo: add colab t

34 Dec 10, 2022

An easy way to build PyTorch datasets. Modularly build datasets and automatically cache processed results

EasyDatas An easy way to build PyTorch datasets. Modularly build datasets and automatically cache processed results Installation pip install git+https

4 Dec 14, 2021

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

A tour through tensorflow with financial data I present several models ranging in complexity from simple regression to LSTM and policy networks. The s

195 Dec 7, 2022

Cl datasets - PyTorch image dataloaders and utility functions to load datasets for supervised continual learning

Related tags

Overview

Continual learning datasets

Introduction

Features

Installation

Example

List of possible datasets for training tasks

Full datasets

Incremental datasets

You might also like...

Project looking into use of autoencoder for semi-supervised learning and comparing data requirements compared to supervised learning.

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation (ACMMM'21 Oral Paper)

ICSS - Interactive Continual Semantic Segmentation

[CVPR 2022] CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation

[CVPR2022] Representation Compensation Networks for Continual Semantic Segmentation

Additional code for Stable-baselines3 to load and upload models from the Hub.

An easy way to build PyTorch datasets. Modularly build datasets and automatically cache processed results

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

Owner

berjaoui

CL-Gym: Full-Featured PyTorch Library for Continual Learning

PyTorch implementation of our Adam-NSCL algorithm from our CVPR2021 (oral) paper "Training Networks in Null Space for Continual Learning"

Pytorch Implementation of Continual Learning With Filter Atom Swapping (ICLR'22 Spolight) Paper

Official Pytorch implementation of Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference (ICLR 2022)

PyTorch implementation of: Michieli U. and Zanuttigh P., "Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations", CVPR 2021.

Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.

Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation

Avalanche RL: an End-to-End Library for Continual Reinforcement Learning

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR