[ICCV 2021] Released code for Causal Attention for Unbiased Visual Recognition

Wang Tan

Last update: Dec 31, 2022

Related tags

Deep Learning CaaM

Overview

CaaM

This repo contains the codes of training our CaaM on NICO/ImageNet9 dataset. Due to my recent limited bandwidth, this codebase is still messy, which will be further refined and checked recently.

0. Bibtex

If you find our codes helpful, please cite our paper:

@inproceedings{wang2021causal,
  title={Causal Attention for Unbiased Visual Recognition},
  author={Wang, Tan and Zhou, Chang and Sun, Qianru and Zhang, Hanwang},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
  year={2021}
}

1. Preparation

Installation: Python3.6, Pytorch1.6, tensorboard, timm(0.3.4), scikit-learn, opencv-python, matplotlib, yaml
Dataset:

NICO: Please download from https://drive.google.com/file/d/1topMf4xqLpbhI1X6fs3hf8_M1ytieLqP/view?usp=sharing, we remove the damaged images in original NICO and rename the images. The construction details of our proposed subset are in our Appendix.
ImageNet9: Please follow the usual practice to download the ImageNet (ILSVRC2015) dataset.

Please remember to change the data path in the config file.

2. Evaluation:

For ResNet18 on NICO dataset

CUDA_VISIBLE_DEVICES=0 python train.py -cfg conf/ours_resnet18_multilayer2_bf0.02_noenv_pw5e5.yaml -debug -gpu -eval pretrain_model/nico_resnet18_ours_caam-best.pth

The results will be: Val Score: 0.4638461470603943 Test Score: 0.4661538600921631

For T2T-ViT7 on NICO dataset

CUDA_VISIBLE_DEVICES=0,1 python train.py -cfg conf/ours_t2tvit7_bf0.02_s4_noenv_pw5e4.yaml -debug -gpu -multigpu -eval pretrain_model/nico_t2tvit7_ours_caam-best.pth

The results will be: Val Score: 0.3799999952316284 Test Score: 0.3761538565158844

For ImageNet-9 dataset

Similarly, the pretrained model is in pretrain_model. Please note that on ImageNet9, we report the best performance for the 3 metrics in our paper. The pretrained model is for bias and unbias and we did not save the model for the best ImageNet-A.

3. Train

To perform training, please run the sh file in scripts. For example:

sh scripts/run_baseline_resnet18.sh

4. An interesting finding

Recently I found an interesting thing by accident. The mixup added on the baseline model would not bring much performance improvements (see Table 1. in the main paper). However, when performing mixup based on our CaaM, the performance can be further boosted.

Specifically, you can active the mixup by:

sh scripts/run_ours_resnet18_mixup.sh

This can make our CaaM achieve about 50~51% Val & Test accuracy on NICO dataset.

Acknowledgement

Special thanks to the authors of ReBias and IRM, and the datasets used in this research project.

If you have any question or find any bug, please kindly email me.

Allele-specific pipeline for unbiased read mapping(WIP), QTL discovery(WIP), and allelic-imbalance analysis

WASP2 (Currently in pre-development): Allele-specific pipeline for unbiased read mapping(WIP), QTL discovery(WIP), and allelic-imbalance analysis Requ

2 Aug 11, 2022

Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification"

[AAAI22] Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification We point out the overlooked unbiasedness in long-tailed clas

28 Oct 18, 2022

Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones

HaloNet - Pytorch Implementation of the Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones. This re

189 Nov 22, 2022

NP DRAW paper released code

NP-DRAW: A Non-Parametric Structured Latent Variable Model for Image Generation This repo contains the official implementation for the NP-DRAW paper.

22 Mar 13, 2022

Code of PVTv2 is released! PVTv2 largely improves PVTv1 and works better than Swin Transformer with ImageNet-1K pre-training.

Updates (2020/06/21) Code of PVTv2 is released! PVTv2 largely improves PVTv1 and works better than Swin Transformer with ImageNet-1K pre-training. Pyr

1.3k Jan 4, 2023

Comments

Question for the partition weights learning

Thank you sharing this amazing work.

I have one question for the partition weight \theta learning in Eq(7), we may claim that the parameter \theta is learning mappings from image ids to the context(environment) space in \mathbb{R}^M.

I am just curious that if it is possible to directly learn a function that maps from the original feature space \mathcal{X} to the context space, such that theta will not be parameter matrix but a parameterized model?

I assume it could be, however a difference between the current implementation and parameterized model is, we may need multiple model for each X. As the paper has shown that CAAM can be plugged in any intermediate layers in the models, thus for different intermediate X, we need to learn different parameterized models. Different from that, a global parameter matrix \theta is more efficient, since it can be shared across different intermediate representation of X. Please let me know if there is anything missing.

Thank you in advance.

opened by rowedenny 0

[ICCV 2021] Released code for Causal Attention for Unbiased Visual Recognition

Related tags

Overview

CaaM

0. Bibtex

1. Preparation

2. Evaluation:

3. Train

4. An interesting finding

Acknowledgement

You might also like...

Allele-specific pipeline for unbiased read mapping(WIP), QTL discovery(WIP), and allelic-imbalance analysis

Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification"

Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones

NP DRAW paper released code

Code of PVTv2 is released! PVTv2 largely improves PVTv1 and works better than Swin Transformer with ImageNet-1K pre-training.

Released code for Objects are Different: Flexible Monocular 3D Object Detection, CVPR21

This repository contains code released by Google Research.

Code for "Causal autoregressive flows" - AISTATS, 2021

Code for NeurIPS 2021 paper: Invariant Causal Imitation Learning for Generalizable Policies

Comments

Question for the partition weights learning

Owner

Wang Tan

Toward Spatially Unbiased Generative Models (ICCV 2021)

Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Causal-Adversarial-Instruments - PyTorch Implementation for Developing Library of Investigating Adversarial Examples on A Causal View by Instruments

PyTorch code for ICLR 2021 paper Unbiased Teacher for Semi-Supervised Object Detection

Official Pytorch implementation of "Unbiased Classification Through Bias-Contrastive and Bias-Balanced Learning (NeurIPS 2021)

[ICCV 2021] Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification

Pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering".

This is the official pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering" on VQA Task

This is the first released system towards complex meters` detection and recognition, which is implemented by computer vision techniques.

Official PyTorch implementation of "ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows"