Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

NAVER/LINE Vision

Last update: Dec 6, 2022

Related tags

Deep Learning deep-learning pytorch metric-learning image-retrieval deep-metric-learning regularizer cars196 aaai2021 proxy-nca proxy-synthesis proxy-based synthetic-classes norm-softmax

Overview

Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning

Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

Geonmo Gu*¹, Byungsoo Ko*¹, Han-Gyu Kim² (* Authors contributed equally.)

¹@NAVER/LINE Vision, ²@NAVER Clova Speech

Paper | Presentation Video | PPT | Poster

Overview

Proxy Synthesis

Proxy Synthesis (PS) is a novel regularizer for any softmax variants and proxy-based losses in deep metric learning.

How it works?

Proxy Synthesis exploits synthetic classes and improves generalization by considering class relations and obtaining smooth decision boundaries.
Synthetic classes mimic unseen classes during training phase as described in below Figure.

Experimental results

Proxy Synthesis improves performance for every loss and benchmark dataset.

Getting Started

Installation

Clone the repository locally

$ git clone https://github.com/navervision/proxy-synthesis

Create conda virtual environment

$ conda create -n proxy_synthesis python=3.7 anaconda
$ conda activate proxy_synthesis

Install pytorch

Install pytorch according to your CUDA version

$ conda install pytorch torchvision cudatoolkit=<YOUR_CUDA_VERSION> -c pytorch

Install faiss

We use faiss library for faster evaluation
Install faiss according to your CUDA version

$ conda install faiss-gpu cudatoolkit=<YOUR_CUDA_VERSION> -c pytorch

Install requirements

$ pip install -r requirements.txt

Prepare Data

Download CARS196 dataset and unzip

$ wget http://imagenet.stanford.edu/internal/car196/car_ims.tgz
$ tar zxvf car_ims.tgz -C ./dataset

Rearrange CARS196 directory by following structure

# Dataset structure
/dataset/carDB/
  train/
    class1/
      img1.jpeg
    class2/
      img2.jpeg
  test/
    class1/
      img3.jpeg
    class2/
      img4.jpeg

# Rearrange dataset structure
$ python dataset/prepare_cars.py

Train models

Norm-SoftMax loss with CARS196

# Norm-SoftMax
$ python main.py --gpu=0 \
--save_path=./logs/CARS196_norm_softmax \
--data=./dataset/carDB --data_name=cars196 \
--dim=512 --batch_size=128 --epochs=130 \
--freeze_BN --loss=Norm_SoftMax \
--decay_step=50 --decay_stop=50 --n_instance=1 \
--scale=23.0 --check_epoch=5

PS + Norm-SoftMax loss with CARS196

# PS + Norm-SoftMax
$ python main.py --gpu=0 \
--save_path=./logs/CARS196_PS_norm_softmax \
--data=./dataset/carDB --data_name=cars196 \
 --dim=512 --batch_size=128 --epochs=130 \
--freeze_BN --loss=Norm_SoftMax \
--decay_step=50 --decay_stop=50 --n_instance=1 \
--scale=23.0 --check_epoch=5 \
--ps_alpha=0.40 --ps_mu=1.0

Proxy-NCA loss with CARS196

# Proxy-NCA
$ python main.py --gpu=0 \
--save_path=./logs/CARS196_proxy_nca \
--data=./dataset/carDB --data_name=cars196 \
--dim=512 --batch_size=128 --epochs=130 \
--freeze_BN --loss=Proxy_NCA \
--decay_step=50 --decay_stop=50 --n_instance=1 \
--scale=12.0 --check_epoch=5

PS + Proxy-NCA loss with CARS196

# PS + Proxy-NCA
$ python main.py --gpu=0 \
--save_path=./logs/CARS196_PS_proxy_nca \
--data=./dataset/carDB --data_name=cars196 \
--dim=512 --batch_size=128 --epochs=130 \
--freeze_BN --loss=Proxy_NCA \
--decay_step=50 --decay_stop=50 --n_instance=1 \
--scale=12.0 --check_epoch=5 \
--ps_alpha=0.40 --ps_mu=1.0

Check Test Results

$ tensorboard --logdir=logs --port=10000

Experimental results

We report Recall@1, RP and MAP performances of each loss, which are trained with CARS196 dataset for 8 runs.

Recall@1

Loss	1	2	3	4	5	6	7	8	Mean ± std
Norm-SoftMax	83.38	83.25	83.25	83.18	83.05	82.90	82.83	82.79	83.08 ± 0.21
PS + Norm-SoftMax	84.69	84.58	84.45	84.35	84.22	83.95	83.91	83.89	84.25 ± 0.31
Proxy-NCA	83.74	83.69	83.62	83.32	83.06	83.00	82.97	82.84	83.28 ± 0.36
PS + Proxy-NCA	84.52	84.39	84.32	84.29	84.22	84.12	83.94	83.88	84.21 ± 0.21

RP

Loss	1	2	3	4	5	6	7	8	Mean ± std
Norm-SoftMax	35.85	35.51	35.28	35.28	35.24	34.95	34.87	34.84	35.23 ± 0.34
PS + Norm-SoftMax	37.01	36.98	36.92	36.74	36.74	36.73	36.54	36.45	36.76 ± 0.20
Proxy-NCA	36.08	35.85	35.79	35.66	35.66	35.63	35.47	35.43	35.70 ± 0.21
PS + Proxy-NCA	36.97	36.84	36.72	36.64	36.63	36.60	36.43	36.41	36.66 ± 0.18

MAP

Loss	1	2	3	4	5	6	7	8	Mean ± std
Norm-SoftMax	25.56	25.56	25.00	24.93	24.90	24.59	24.57	24.56	24.92 ± 0.35
PS + Norm-SoftMax	26.71	26.67	26.65	26.56	26.53	26.52	26.30	26.17	26.51 ± 0.18
Proxy-NCA	25.66	25.52	25.37	25.36	25.33	25.26	25.22	25.04	25.35 ± 0.18
PS + Proxy-NCA	26.77	26.63	26.50	26.42	26.37	26.31	26.25	26.12	26.42 ± 0.20

Performance Graph

Below figure shows performance graph of test set during training.

Reference

Our code is based on SoftTriple repository (Arxiv, Github)

Citation

If you find Proxy Synthesis useful in your research, please consider to cite the following paper.

@inproceedings{gu2020proxy,
    title={Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning},
    author={Geonmo Gu, Byungsoo Ko, and Han-Gyu Kim},
    booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
    year={2021}
}

License

Copyright 2021-present NAVER Corp.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

Self-Attention Attribution This repository contains the implementation for AAAI-2021 paper Self-Attention Attribution: Interpreting Information Intera

60 Dec 29, 2022

[AAAI 2021] MVFNet: Multi-View Fusion Network for Efficient Video Recognition

MVFNet: Multi-View Fusion Network for Efficient Video Recognition (AAAI 2021) Overview We release the code of the MVFNet (Multi-View Fusion Network).

114 Nov 27, 2022

《LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation》(AAAI 2021) GitHub:

LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation

76 Dec 5, 2022

Code for "Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation". [AAAI 2021]

Graph Evolving Meta-Learning for Low-resource Medical Dialogue Generation Code to be further cleaned... This repo contains the code of the following p

29 Nov 1, 2022

:hot_pepper: R²SQL: "Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing." (AAAI 2021)

R²SQL The PyTorch implementation of paper Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing. (AAAI 2021) Requirement

60 Dec 31, 2022

Source code for paper "Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling", AAAI 2021

ATLOP Code for AAAI 2021 paper Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling. If you make use of this co

146 Nov 29, 2022

(AAAI 2021) Progressive One-shot Human Parsing

Comments

corresponding proxy, synthetic proxy and embedding
Hi, According to figure 3 in the README.

Proxy (train) is defined as "self.proxy" in the script https://github.com/navervision/proxy-synthesis/blob/master/loss/proxy_losses.py#L57

Synthetic proxy (train) and Synthetic embedding (train) are return values of https://github.com/navervision/proxy-synthesis/blob/master/loss/proxy_losses.py#L14 (input_l2, proxy_l2)

Is it correct?
opened by pribadihcr 2

Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

Related tags

Overview

Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning

Overview

Proxy Synthesis

How it works?

Experimental results

Getting Started

Installation

Prepare Data

Train models

Norm-SoftMax loss with CARS196

PS + Norm-SoftMax loss with CARS196

Proxy-NCA loss with CARS196

PS + Proxy-NCA loss with CARS196

Check Test Results

Experimental results

Recall@1

RP

MAP

Performance Graph

Reference

Citation

License

You might also like...

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

[AAAI 2021] MVFNet: Multi-View Fusion Network for Efficient Video Recognition

《LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation》(AAAI 2021) GitHub:

Code for "Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation". [AAAI 2021]

:hot_pepper: R²SQL: "Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing." (AAAI 2021)

Source code for paper "Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling", AAAI 2021

(AAAI 2021) Progressive One-shot Human Parsing

Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection, AAAI 2021.

MVFNet: Multi-View Fusion Network for Efficient Video Recognition (AAAI 2021)

Comments

corresponding proxy, synthetic proxy and embedding

Owner

NAVER/LINE Vision

Official implementation for paper Knowledge Bridging for Empathetic Dialogue Generation (AAAI 2021).

Official implementation of AAAI-21 paper "Label Confusion Learning to Enhance Text Classification Models"

Implementation of accepted AAAI 2021 paper: Deep Unsupervised Image Hashing by Maximizing Bit Entropy

PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection"

Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding (AAAI 2020) - PyTorch Implementation

Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification"

An official source code for paper Deep Graph Clustering via Dual Correlation Reduction, accepted by AAAI 2022

Official repository of the AAAI'2022 paper "Contrast and Generation Make BART a Good Dialogue Emotion Recognizer"

Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code

This is the code for the paper "Contrastive Clustering" (AAAI 2021)