CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)

JDAI-CV

Last update: Nov 25, 2022

Related tags

Deep Learning nas neural-architecture-search reid cross-modality visible-infrared cm-nas

Overview

CM-NAS

Official Pytorch code of paper CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification in ICCV2021.

Visible-Infrared person re-identification (VI-ReID) aims to match cross-modality pedestrian images, breaking through the limitation of single-modality person ReID in dark environment. In order to mitigate the impact of large modality discrepancy, existing works manually design various two-stream architectures to separately learn modalityspecific and modality-sharable representations. Such a manual design routine, however, highly depends on massive experiments and empirical practice, which is time consuming and labor intensive. In this paper, we systematically study the manually designed architectures, and identify that appropriately separating Batch Normalization (BN) layers is the key to bring a great boost towards crossmodality matching. Based on this observation, the essential objective is to find the optimal separation scheme for each BN layer. To this end, we propose a novel method, named Cross-Modality Neural Architecture Search (CM-NAS). It consists of a BN-oriented search space in which the standard optimization can be fulfilled subject to the cross-modality task. Equipped with the searched architecture, our method outperforms state-of-the-art counterparts in both two benchmarks, improving the Rank-1/mAP by 6.70%/6.13% on SYSU-MM01 and by 12.17%/11.23% on RegDB.

Requirements

Our experiments are conducted under the following environments:

Python 3.7
Pytorch == 1.3.1
torchvision == 0.4.2

Model Zoo

The searched configurations and the trained models can be downloaded in this link.

Dataset	Protocol	Rank-1	mAP	Protocol	Rank-1	mAP	Trained Model
SYSU-MM01	All-Single	61.99%	60.02%	Indoor-Single	67.01%	72.95%	Google Drive
RegDB	Vis-to-Inf	84.54%	80.32%	Inf-to-Vis	82.57%	78.31%	Google Drive

Noet, the results may have some fluctuations caused by random spliting the datasets.

Search

Codes will be released soon.

Train

Before training, please download the searched configurations.

For SYSU-MM01, first run ./utils/pre_process_sysu.py to prepare data, then configure data_root path in train_sysu.sh and run train_sysu.sh.
For RegDB, configure data_root path in train_regdb.sh and run train_regdb.sh.

Test

Before testing, please download the searched configurations and the trained models.

For SYSU-MM01, configure data_root path in test_sysu.sh and run test_sysu.sh.
For RegDB, configure data_root path in test_regdb.sh and run test_regdb.sh.

License

CM-NAS is released under the Apache License 2.0. Please see the LICENSE file for more information.

Citation

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follows.

@inproceedings{Fu2021CMNAS,
  title     =  {CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification},
  author    =  {Chaoyou Fu, Yibo Hu, Xiang Wu, Hailin Shi, Tao Mei and Ran He},
  booktitle =  {ICCV},
  year      =  {2021}
}

Acknowledgements

This repo is based on the following repo, thank the authors a lot.

mangye16/Cross-Modal-Re-ID-baseline

Exposure Time Calculator (ETC) and radial velocity precision estimator for the Near InfraRed Planet Searcher (NIRPS) spectrograph

NIRPS-ETC Exposure Time Calculator (ETC) and radial velocity precision estimator for the Near InfraRed Planet Searcher (NIRPS) spectrograph February 2

2 Sep 15, 2022

Comments

Hi. Where is the './checkpoints_sysu/cm-nas_searched_arch/config.cfg'?

when I run train_sysu.sh it produces the error: FileNotFoundError: [Errno 2] No such file or directory: './checkpoints_sysu/cm-nas_searched_arch/config.cfg'

And I noticed that there is no directory 'chechpoints_sysu'.

opened by justopit 1
Problems about the EMA model

Hello, I read your code and find there is an 'EMA' model in your code, which is not mentioned in your paper. So I want to ask what is it used for? Does it have better performance than the trained model?

opened by Mrkkew 0

CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)

Related tags

Overview

CM-NAS

Requirements

Model Zoo

Search

Train

Test

License

Citation

Acknowledgements

You might also like...

《LXMERT: Learning Cross-Modality Encoder Representations from Transformers》(EMNLP 2020)

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieva

Densely Connected Search Space for More Flexible Neural Architecture Search (CVPR2020)

[CVPR 2021] 'Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator'

[ICLR2021oral] Rethinking Architecture Selection in Differentiable NAS

DeepHyper: Scalable Asynchronous Neural Architecture and Hyperparameter Search for Deep Neural Networks

This code is a near-infrared spectrum modeling method based on PCA and pls

Exposure Time Calculator (ETC) and radial velocity precision estimator for the Near InfraRed Planet Searcher (NIRPS) spectrograph

Comments

Hi. Where is the './checkpoints_sysu/cm-nas_searched_arch/config.cfg'?

Problems about the EMA model

Owner

JDAI-CV

Code of Classification Saliency-Based Rule for Visible and Infrared Image Fusion

Paper Title: Heterogeneous Knowledge Distillation for Simultaneous Infrared-Visible Image Fusion and Super-Resolution

Code of paper Interact, Embed, and EnlargE (IEEE): Boosting Modality-specific Representations for Multi-Modal Person Re-identification.

[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark

"NAS-Bench-301 and the Case for Surrogate Benchmarks for Neural Architecture Search".

Naszilla is a Python library for neural architecture search (NAS)

NAS Benchmark in "Prioritized Architecture Sampling with Monto-Carlo Tree Search", CVPR2021

Paper: Cross-View Kernel Similarity Metric Learning Using Pairwise Constraints for Person Re-identification

Joint Detection and Identification Feature Learning for Person Search

code for paper "Does Unsupervised Architecture Representation Learning Help Neural Architecture Search?"