PyTorch code for SENTRY: Selective Entropy Optimization via Committee Consistency for Unsupervised DA

Viraj Prabhu

Last update: Dec 24, 2022

Related tags

Deep Learning SENTRY

Overview

PyTorch Code for SENTRY: Selective Entropy Optimization via Committee Consistency for Unsupervised Domain Adaptation

Viraj Prabhu, Shivam Khare, Deeksha Kartik, Judy Hoffman

Many existing approaches for unsupervised domain adaptation (UDA) focus on adapting under only data distribution shift and offer limited success under additional cross-domain label distribution shift. Recent work based on self-training using target pseudolabels has shown promise, but on challenging shifts pseudolabels may be highly unreliable and using them for self-training may cause error accumulation and domain misalignment. We propose Selective Entropy Optimization via Committee Consistency (SENTRY), a UDA algorithm that judges the reliability of a target instance based on its predictive consistency under a committee of random image transformations. Our algorithm then selectively minimizes predictive entropy to increase confidence on highly consistent target instances, while maximizing predictive entropy to reduce confidence on highly inconsistent ones. In combination with pseudolabel-based approximate target class balancing, our approach leads to significant improvements over the state-of-the-art on 27/31 domain shifts from standard UDA benchmarks as well as benchmarks designed to stress-test adaptation under label distribution shift.

Setup and Dependencies
Usage
Reference
License

Setup and Dependencies

Create an anaconda environment with Python 3.6: conda create -n sentry python=3.6.8 and activate: conda activate sentry
Navigate to the code directory: cd code/
Install dependencies: pip install -r requirements.txt

And you're all set up!

Usage

Download data

Data for SVHN->MNIST is downloaded automatically via PyTorch. Data for other benchmarks can be downloaded from the following links. The splits used for our experiments are already included in the data/ folder):

DomainNet
OfficeHome
VisDA2017 (only train and validation needed)

Pretrained checkpoints

To reproduce numbers reported in the paper, we include a a few pretrained checkpoints. We include checkpoints (source and adapted) for SVHN to MNIST (DIGITS) in the checkpoints directory. Source and adapted checkpoints for Clipart to Sketch adaptation (from DomainNet) and Real_World to Product adaptation (from OfficeHome RS-UT) can be downloaded from this link, and should be saved to the checkpoints/source and checkpoints/SENTRY directory as appropriate.

Train and adapt model

Natural label distribution shift: Adapt a model from to for a given (where benchmark may be DomainNet, OfficeHome, VisDA, or DIGITS), as follows:

python train.py --id <experiment_id> \
                --source <source> \
                --target <target> \
                --img_dir <image_directory> \
                --LDS_type <LDS_type> \
                --load_from_cfg True \
                --cfg_file 'config/<benchmark>/<cfg_file>.yml' \
                --use_cuda True

SENTRY hyperparameters are provided via a sentry.yml config file in the corresponding config/<benchmark> folder (On DIGITS, we also provide a config for baseline adaptation via DANN). The list of valid source/target domains per-benchmark are:

DomainNet: real, clipart, sketch, painting
OfficeHome_RS_UT: Real_World, Clipart, Product
OfficeHome: Real_World, Clipart, Product, Art
VisDA2017: visda_train, visda_test
DIGITS: Only svhn (source) to mnist (target) adaptation is currently supported.

Pass in the path to the parent folder containing dataset images via the --img_dir <name_of_directory> flag (eg. --img_dir '~/data/DomainNet'). Pass in the label distribution shift type via the --LDS_type flag: For DomainNet, OfficeHome (standard), and VisDA2017, pass in --LDS_type 'natural' (default). For OfficeHome RS-UT, pass in --LDS_type 'RS_UT'. For DIGITS, pass in --LDS_type as one of IF1, IF20, IF50, or IF100, to load a manually long-tailed target training split with a given imbalance factor (IF), as described in Table 4 of the paper.

To load a pretrained DA checkpoint instead of training your own, additionally pass --load_da True and --id <benchmark_name> to the script above. Finally, the training script will log performance metrics to the console (average and aggregate accuracy), and additionally plot and save some per-class performance statistics to the results/ folder.

Note: By default this code runs on GPU. To run on CPU pass: --use_cuda False

Reference

If you found this code useful, please consider citing:

@article{prabhu2020sentry
   author = {Prabhu, Viraj and Khare, Shivam and Kartik, Deeksha and Hoffman, Judy},
   title = {SENTRY: Selective Entropy Optimization via Committee Consistency for Unsupervised Domain Adaptation},
   year = {2020},
   journal = {arXiv preprint: 2012.11460},
}

Acknowledgements

We would like to thank the developers of PyTorch for building an excellent framework, in addition to the numerous contributors to all the open-source packages we use.

License

MIT

Comments

Reproducing source-only results on DomainNet
Hi SENTRY authors, congrats on the impressive results from your recent paper and thank you for sharing your code with us! As a starting point I am trying to reproduce the DomainNet source-only Sketch -> Real results, which in your paper is listed as 77% average per-class accuracy. I'm currently running this command with your repo to attempt to reproduce this performance:

python train.py --id source-only-domainnet --source sketch --target real --img_dir <local path to DomainNet> --LDS_type natural --use_cuda True --cnn ResNet50FS --optimizer SGD --lr 1e-2 --wd 5e-4 --num_epochs 100 --batch_size 16 --temperature 0.05 --l2_normalize True

All the hyperparameters other than num_epochs I believe I have taken from your paper, but I may have misread/missed something. The current target average per-class accuracy I'm getting is 66%. Would you be able to share the exact command/hyperparameters you used to obtain the 77% transfer accuracy from sketch to real? Thank you so much!
opened by robbiejones96 4
what it means when committee size k is larger than the augmented number N in Table 6 of the paper?

Hi, I'd like to know what it means when committee size k is larger than the augmented number N in Table 6 of the paper? I think the committee size k should be smaller than the augmented number.

opened by fake-warrior8 1
how can I reproduce svhn->mnist for LDS_type=IF50

Hi, thanks for sharing.

But I found in Digit Dataset, we only can reproduce LDS_type=IF20 as in Checkpoints folder there only has mnist_ixs_IF20.pkl.

So what we can do if we want to reproduce other result, like LDS_type=IF50/100. How can we make .pkl files like yours? Could you help us with this problem?

Thanks a lot.

opened by CYDping 1
error happend when reproducing digits

Hi, thank you very much for sharing.

However, I met error when trying svhn - > MNIST on Digits . As shown in the figure.

Do you know how to solve this problem?

opened by CYDping 1
Most results are reproducible!!

I could reproduce most results given in the paper on Office Home and domainNet datasets. Well done on this great work and thanks for making the code available to public!

As a side note, I noticed that not all the methods you compare against use the same evaluation process, like center crop on Cliparts. It would be great if you could prepare a comparison across all baselines using the same evaluation methods.

opened by tarun005 1

PyTorch code for SENTRY: Selective Entropy Optimization via Committee Consistency for Unsupervised DA

Related tags

Overview

PyTorch Code for SENTRY: Selective Entropy Optimization via Committee Consistency for Unsupervised Domain Adaptation

Viraj Prabhu, Shivam Khare, Deeksha Kartik, Judy Hoffman

Table of Contents

Setup and Dependencies

Usage

Download data

Pretrained checkpoints

Train and adapt model

Reference

Acknowledgements

License

Comments

Reproducing source-only results on DomainNet

what it means when committee size k is larger than the augmented number N in Table 6 of the paper?

how can I reproduce svhn->mnist for LDS_type=IF50

error happend when reproducing digits

Most results are reproducible!!

Owner

Viraj Prabhu

pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination"

Selective Wavelet Attention Learning for Single Image Deraining

Modeling Category-Selective Cortical Regions with Topographic Variational Autoencoders

NudeNet: Neural Nets for Nudity Classification, Detection and selective censoring

Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"

Semi-supervised Domain Adaptation via Minimax Entropy

PyTorch code accompanying our paper on Maximum Entropy Generators for Energy-Based Models

Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning, CVPR 2021

Unsupervised Video Interpolation using Cycle Consistency

Implementation of "Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency"

[NeurIPS 2020] Blind Video Temporal Consistency via Deep Video Prior

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

Official PyTorch code for WACV 2022 paper "CFLOW-AD: Real-Time Unsupervised Anomaly Detection with Localization via Conditional Normalizing Flows"

AntroPy: entropy and complexity of (EEG) time-series in Python

ICLR21 Tent: Fully Test-Time Adaptation by Entropy Minimization

noisy labels; missing labels; semi-supervised learning; entropy; uncertainty; robustness and generalisation.

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

Predicting path with preference based on user demonstration using Maximum Entropy Deep Inverse Reinforcement Learning in a continuous environment