REMIND Your Neural Network to Prevent Catastrophic Forgetting

This is a PyTorch implementation of the REMIND algorithm from our ECCV-2020 paper. An arXiv pre-print of our paper is available.

REMIND (REplay using Memory INDexing) is a novel brain-inspired streaming learning model that uses tensor quantization to efficiently store hidden representations (e.g., CNN feature maps) for later replay. REMIND implements this compression using Product Quantization (PQ) and outperforms existing models on the ImageNet and CORe50 classification datasets. Further, we demonstrate REMIND's robustness by pioneering streaming Visual Question Answering (VQA), in which an agent must answer questions about images.

Formally, REMIND takes an input image and passes it through frozen layers of a network to obtain tensor representations (feature maps). It then quantizes the tensors via PQ and stores the indices in memory for replay. The decoder reconstructs a previous subset of tensors from stored indices to train the plastic layers of the network before inference. We restrict the size of REMIND's replay buffer and use a uniform random storage policy.

Dependencies

⚠️ ⚠️	For unknown reasons, our code does not reproduce results in PyTorch versions greater than PyTorch 1.3.1. Please follow our instructions below to ensure reproducibility.

We have tested the code with the following packages and versions:

Python 3.7.6
PyTorch (GPU) 1.3.1
torchvision 0.4.2
NumPy 1.18.5
FAISS (CPU) 1.5.2
CUDA 10.1 (also works with CUDA 10.0)
Scikit-Learn 0.23.1
Scipy 1.1.0
NVIDIA GPU

We recommend setting up a conda environment with these same package versions:

conda create -n remind_proj python=3.7
conda activate remind_proj
conda install numpy=1.18.5
conda install pytorch=1.3.1 torchvision=0.4.2 cudatoolkit=10.1 -c pytorch
conda install faiss-cpu=1.5.2 -c pytorch

Setup ImageNet-2012

The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) dataset has 1000 categories and 1.2 million images. The images do not need to be preprocessed or packaged in any database, but the validation images need to be moved into appropriate subfolders. See link.

Download the images from http://image-net.org/download-images

Extract the training data:

mkdir train && mv ILSVRC2012_img_train.tar train/ && cd train
tar -xvf ILSVRC2012_img_train.tar && rm -f ILSVRC2012_img_train.tar
find . -name "*.tar" | while read NAME ; do mkdir -p "${NAME%.tar}"; tar -xvf "${NAME}" -C "${NAME%.tar}"; rm -f "${NAME}"; done
cd ..

Extract the validation data and move images to subfolders:

mkdir val && mv ILSVRC2012_img_val.tar val/ && cd val && tar -xvf ILSVRC2012_img_val.tar
wget -qO- https://raw.githubusercontent.com/soumith/imagenetloader.torch/master/valprep.sh | bash

Repo Structure & Descriptions

image_classification_experiments: files to replicate image classification experiments
- imagenet_files: files needed to run ImageNet experiments
  - imagenet_indices: contains numpy files of labels for our ImageNet ordering
    - imagenet_train_labels.npy: numpy array with train labels for our ordering (can be generated for different order)
    - imagenet_val_labels.npy: numpy array with val labels for our ordering (can be generated for different order)
  - best_ResNet18ClassifyAfterLayer4_1_100.pth: our base init ckpt file trained on 100 fixed random classes of ImageNet (can be generated for different order)
  - imagenet_class_order.txt: text file containing names of classes in order for training (supply this file for different order)
- imagenet_base_initialization.py: script to train an offline network on the base initialization data for ImageNet
- imagenet_experiment.py: script to train the streaming REMIND model on ImageNet
- make_numpy_imagenet_label_files.py: script to make imagenet_train_labels.npy and imagenet_val_labels.npy for particular ImageNet ordering
- REMINDModel.py: class for REMIND model
- resnet_models.py: file containing variations of ResNet-18 architectures used in our experiments
- retrieve_any_layer.py: script to extract features from different layers of a model
- run_imagenet_experiment.sh: bash script to run the streaming REMIND model on ImageNet
- train_base_init_network.sh: bash script to train the base init model on a subset of ImageNet
- train_base_init_network_from_scratch.py: script to train the base init model for ImageNet
- utils.py: overall utilities
- utils_imagenet.py: ImageNet specific utilities

Training REMIND on ImageNet (Classification)

We have provided the necessary files to train REMIND on the exact same ImageNet ordering used in our paper (provided in imagenet_class_order.txt). We also provide steps for running REMIND on an alternative ordering.

To train REMIND on the ImageNet ordering from our paper, follow the steps below:

Run run_imagenet_experiment.sh to train REMIND on the ordering from our paper. Note, this will use our ordering and associated files provided in imagenet_files.

To train REMIND on a different ImageNet ordering, follow the steps below:

Generate a text file containing one class name per line in the desired order.
Run make_numpy_imagenet_label_files.py to generate the necessary numpy files for the desired ordering using the text file from step 1.
Run train_base_init_network.sh to train an offline model using the desired ordering and label files generated in step 2 on the base init data.
Run run_imagenet_experiment.sh using the label files from step 2 and the ckpt file from step 3 to train REMIND on the desired ordering.

Files generated from the streaming experiment:

*.json files containing incremental top-1 and top-5 accuracies
*.pth files containing incremental model predictions/probabilities
*.pth files containing incremental REMIND classifier (F) weights
*.pkl files containing PQ centroids and incremental buffer data (e.g., latent codes)

To continue training REMIND from a previous ckpt:

We save out incremental weights and associated data for REMIND after each evaluation cycle. This enables REMIND to continue training from these saved files (in case of a computer crash etc.). This can be done as follows in run_imagenet_experiment.sh:

Set the --resume_full_path argument to the path where the previous REMIND model was saved.
Set the --streaming_min_class argument to the class REMIND left off on.
Run run_imagenet_experiment.sh

Training REMIND on VQA Datasets

We use the gensen library for question features. Execute the following steps to set it up:

cd ${GENSENPATH} 
git clone [email protected]:erobic/gensen.git
cd ${GENSENPATH}/data/embedding
chmod +x glove25.sh && ./glove2h5.sh    
cd ${GENSENPATH}/data/models
chmod +x download_models.sh && ./download_models.sh

Training REMIND on CLEVR

Note: For convenience, we pre-extract all the features including the PQ encoded features. This requires 140 GB of free space, assuming images are deleted after feature extraction.

Download and extract CLEVR images+annotations:

wget https://dl.fbaipublicfiles.com/clevr/CLEVR_v1.0.zip
unzip CLEVR_v1.0.zip

Extract question features
- Clone the gensen repository and download glove features:
```
cd ${GENSENPATH} 
git clone [email protected]:erobic/gensen.git
cd ${GENSENPATH}/data/embedding
chmod +x glove25.sh && ./glove2h5.sh    
cd ${GENSENPATH}/data/models
chmod +x download_models.sh && ./download_models.sh
```
- Edit vqa_experiments/clevr/extract_question_features_clevr.py, changing the DATA_PATH variable to point to CLEVR dataset and GENSEN_PATH to point to gensen repository and extract features: python vqa_experiments/clevr/extract_question_features_clevr.py
- Pre-process the CLEVR questions Edit $PATH variable in vqa_experiments/clevr/preprocess_clevr.py file, pointing it to the directory where CLEVR was extracted
Extract image features, train PQ encoder and extract encoded features
- Extract image features: python -u vqa_experiments/clevr/extract_image_features_clevr.py --path /path/to/CLEVR
- In pq_encoding_clevr.py, change the value of PATH and streaming_type (as either 'iid' or 'qtype')
- Train PQ encoder and extract features: python vqa_experiments/clevr/pq_encoding_clevr.py
Train REMIND
- Edit data_path in vqa_experiments/configs/config_CLEVR_streaming.py
- Run ./vqa_experiments/run_clevr_experiment.sh (Set DATA_ORDER to either qtype or iid to define the data order)

Training REMIND on TDIUC

Note: For convenience, we pre-extract all the features including the PQ encoded features. This requires around 170 GB of free space, assuming images are deleted after feature extraction.

Download TDIUC

cd ${TDIUC_PATH}
wget https://kushalkafle.com/data/TDIUC.zip && unzip TDIUC.zip
cd TDIUC && python setup.py --download Y # You may need to change print '' statements to print('')

Extract question features
- Edit vqa_experiments/clevr/extract_question_features_tdiuc.py, changing the DATA_PATH variable to point to TDIUC dataset and GENSEN_PATH to point to gensen repository and extract features: python vqa_experiments/tdiuc/extract_question_features_tdiuc.py
- Pre-process the TDIUC questions Edit $PATH variable in vqa_experiments/clevr/preprocess_tdiuc.py file, pointing it to the directory where TDIUC was extracted
Extract image features, train PQ encoder and extract encoded features
- Extract image features: python -u vqa_experiments/tdiuc/extract_image_features_tdiuc.py --path /path/to/TDIUC
- In pq_encoding_tdiuc.py, change the value of PATH and streaming_type (as either 'iid' or 'qtype')
- Train PQ encoder and extract features: python vqa_experiments/clevr/pq_encoding_clevr.py
Train REMIND
- Edit data_path in vqa_experiments/configs/config_TDIUC_streaming.py
- Run ./vqa_experiments/run_tdiuc_experiment.sh (Set DATA_ORDER to either qtype or iid to define the data order)

Citation

If using this code, please cite our paper.

@inproceedings{hayes2020remind,
  title={REMIND Your Neural Network to Prevent Catastrophic Forgetting},
  author={Hayes, Tyler L and Kafle, Kushal and Shrestha, Robik and Acharya, Manoj and Kanan, Christopher},
  booktitle={Proceedings of the European Conference on Computer Vision (ECCV)},
  year={2020}
}

I have gone through your code and as mentioned in the paper you have used faiss.ProductQuantizer for quantization of the feature vectors. Can you tell me how to determine the memory usage of the product quantization like mentioned in your paper?


import numpy as np
import faiss
import sys

num_channels = 512
train_data_base_init = np.random.normal(size=(2000, 7, 7, num_channels))
train_data_base_init = train_data_base_init.astype(np.float32)

train_data_base_init = np.reshape(train_data_base_init, (-1, num_channels))
num_samples = len(train_data_base_init)

codebook_size = 256
num_codebooks = 32

nbits = int(np.log2(codebook_size))
pq = faiss.ProductQuantizer(num_channels, num_codebooks, nbits)

pq.train(train_data_base_init)

# how to determine the size of pq (size of product quantization model)?
# or did you just mention the size of latent_dict (the replay buffer only) in the "REMIND"paper?
# latent_dict stores the coresponding code and label of each feature.
# latent_dict, line 110 -> https://github.com/tyler-hayes/REMIND/blob/master/image_classification_experiments/imagenet_base_initialization.py

@tyler-hayes @erobic

1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for Tracking

Instead, two models for appearance modeling are included, together with the open-source BAGS model and the full set of code for inference. With this code, you can achieve around mAP@23 with TAO test set (based on our estimation).

79 Oct 8, 2022

Repository for Traffic Accident Benchmark for Causality Recognition (ECCV 2020)

Causality In Traffic Accident (Under Construction) Repository for Traffic Accident Benchmark for Causality Recognition (ECCV 2020) Overview Data Prepa

21 Nov 20, 2022

git《Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction》(ECCV 2020) GitHub:

Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction Code for the ECCV 2020 paper by Yiming Qian and Yasutaka Furukawa Getting

37 Dec 4, 2022

dataset for ECCV 2020 "Motion Capture from Internet Videos"

Motion Capture from Internet Videos Motion Capture from Internet Videos Junting Dong*, Qing Shuai*, Yuanqing Zhang, Xian Liu, Xiaowei Zhou, Hujun Bao

98 Dec 7, 2022

《Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement》(ECCV 2020) GitHub: [fig9]

Unsupervised 3D Human Pose Representation [Paper] The implementation of our paper Unsupervised 3D Human Pose Representation with Viewpoint and Pose Di

42 Nov 24, 2022

[ECCV 2020] Gradient-Induced Co-Saliency Detection

Gradient-Induced Co-Saliency Detection Zhao Zhang*, Wenda Jin*, Jun Xu, Ming-Ming Cheng ⭐ Project Home » The official repo of the ECCV 2020 paper Grad

35 Nov 25, 2022

Code for Towards Streaming Perception (ECCV 2020) :car:

sAP — Code for Towards Streaming Perception ECCV Best Paper Honorable Mention Award Feb 2021: Announcing the Streaming Perception Challenge (CVPR 2021

85 Dec 22, 2022

Sign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop)

transformer-slt This repository gathers data and code supporting the experiments in the paper Better Sign Language Translation with STMC-Transformer.

107 Dec 27, 2022

Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)

Progressive Transformers for End-to-End Sign Language Production Source code for "Progressive Transformers for End-to-End Sign Language Production" (B

58 Dec 21, 2022

How to reproduce the performance of streaming ImageNet models?

Hi! I want to reproduce the performance of streaming ImageNet models, and follow the steps in README to run the run_imagenet_experiment.sh file( same ImageNet ordering), but i get this result after training, which seems not to match the performance in the paper.

opened by HaitaoWen 4

I wonder if there's a tiny mistake in "make_numpy_imagenet_label_files.py"?

    # compute mapping from user order to default pytorch order
    map = []
    for v in lines:
        ix = default_class_order.index(v)
        map.append(ix)
    # relabel all samples and save to numpy files
    new_train_labels = np.empty_like(default_train_labels)
    new_val_labels = np.empty_like(default_val_labels)
    for i in range(len(new_train_labels)):
        new_train_labels[i] = map[default_train_labels[i]]
    for i in range(len(new_val_labels)):
        new_val_labels[i] = map[default_val_labels[i]]

In this part of "make_numpy_imagenet_label_files.py", "map" list preserve the mapping from user order to default pytorch order. But when computing new labels of samples, the key input to the map is the default pytorch value. So I wonder if the map list should preserve the mapping from default label to user order ?

opened by fania98 2

Can this lifelong learning approach be applied to regression?

It's a great honor to be the first questioner of this project. I want to use incremental learning on a regression problem. I wonder if REMIND works well in this problem.

opened by xujinjiang 1

Memory usuage by Product Quantizer

opened by SouBanerjee 1

PyTorch implementation of the REMIND method from our ECCV-2020 paper "REMIND Your Neural Network to Prevent Catastrophic Forgetting"

Related tags

Overview

REMIND Your Neural Network to Prevent Catastrophic Forgetting

Dependencies

Setup ImageNet-2012

Repo Structure & Descriptions

Training REMIND on ImageNet (Classification)

To train REMIND on the ImageNet ordering from our paper, follow the steps below:

To train REMIND on a different ImageNet ordering, follow the steps below:

Files generated from the streaming experiment:

To continue training REMIND from a previous ckpt:

Training REMIND on VQA Datasets

Training REMIND on CLEVR

Training REMIND on TDIUC

Citation

You might also like...

1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for Tracking

Repository for Traffic Accident Benchmark for Causality Recognition (ECCV 2020)

git《Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction》(ECCV 2020) GitHub:

dataset for ECCV 2020 "Motion Capture from Internet Videos"

《Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement》(ECCV 2020) GitHub: [fig9]

[ECCV 2020] Gradient-Induced Co-Saliency Detection

Code for Towards Streaming Perception (ECCV 2020) :car:

Sign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop)

Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)

Comments

How to reproduce the performance of streaming ImageNet models?

I wonder if there's a tiny mistake in "make_numpy_imagenet_label_files.py"?

Can this lifelong learning approach be applied to regression?

Memory usuage by Product Quantizer

Owner

Tyler Hayes

PyTorch implementation of the Deep SLDA method from our CVPRW-2020 paper "Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis"

PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "

Code for paper ECCV 2020 paper: Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization in the Loop.

Unofficial PyTorch implementation of "RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving" (ECCV 2020)

PyTorch code for our ECCV 2018 paper "Image Super-Resolution Using Very Deep Residual Channel Attention Networks"

Code for the paper: Adversarial Training Against Location-Optimized Adversarial Patches. ECCV-W 2020.

Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)

Code for ECCV 2020 paper "Contacts and Human Dynamics from Monocular Video".

SNE-RoadSeg in PyTorch, ECCV 2020

[ECCV 2020] Reimplementation of 3DDFAv2, including face mesh, head pose, landmarks, and more.