Learning Super-Features for Image Retrieval

NAVER

Last update: Dec 28, 2022

Related tags

Deep Learning fire

Overview

Learning Super-Features for Image Retrieval

This repository contains the code for running our FIRe model presented in our ICLR'22 paper:

@inproceedings{superfeatures,
  title={{Learning Super-Features for Image Retrieval}},
  author={{Weinzaepfel, Philippe and Lucas, Thomas and Larlus, Diane and Kalantidis, Yannis}},
  booktitle={{ICLR}},
  year={2022}
}

License

The code is distributed under the CC BY-NC-SA 4.0 License. See LICENSE for more information. It is based on code from HOW, cirtorch and ASMK that are released under their own license, the MIT license.

Preparation

After cloning this repository, you must also have HOW, cirtorch and ASMK and have them in your PYTHONPATH.

install HOW

git clone https://github.com/gtolias/how
export PYTHONPATH=${PYTHONPATH}:$(realpath how)

install cirtorch

wget "https://github.com/filipradenovic/cnnimageretrieval-pytorch/archive/v1.2.zip"
unzip v1.2.zip
rm v1.2.zip
export PYTHONPATH=${PYTHONPATH}:$(realpath cnnimageretrieval-pytorch-1.2)

install ASMK

git clone https://github.com/jenicek/asmk.git
pip3 install pyaml numpy faiss-gpu
cd asmk
python3 setup.py build_ext --inplace
rm -r build
cd ..
export PYTHONPATH=${PYTHONPATH}:$(realpath asmk)

install dependencies by running:

pip3 install -r how/requirements.txt

data/experiments folders

All data will be stored under a folder fire_data that will be created when running the code; similarly, results and models from all experiments will be stored under folder fire_experiments

Evaluating our ICLR'22 FIRe model

To evaluate on ROxford/RParis our model trained on SfM-120k, simply run

python evaluate.py eval_fire.yml

With the released model and the parameters found in eval_fire.yml, we obtain 90.3 on the validation set, 82.6 and 62.2 on ROxford medium and hard respectively, 85.2 and 70.0 on RParis medium and hard respectively.

Training a FIRe model

Simply run

python train.py train_fire.yml -e train_fire

All training outputs will be saved to fire_experiments/train_fire.

To evaluate the trained model that was saved in fire_experiments/train_fire, simply run:

python evaluate.py eval_fire.yml -e train_fire -ml train_fire

Pretrained models

For reproducibility, we provide the following model weights for the architecture we use in the paper (ResNet50 without the last block + LIT):

Model pre-trained on ImageNet-1K (with Cross-Entropy, the pre-trained model we use for training FIRe) (link)
Model trained on SfM-120k trained with FIRe (link)

They will be automatically downloaded when running the training / testing script.

You might also like...

A Joint Video and Image Encoder for End-to-End Retrieval

Frozen️ in Time ❄️ ️️️️ ⏳ A Joint Video and Image Encoder for End-to-End Retrieval project page | arXiv | webvid-data Repository containing the code,

225 Dec 25, 2022

Instance-level Image Retrieval using Reranking Transformers

Instance-level Image Retrieval using Reranking Transformers Fuwen Tan, Jiangbo Yuan, Vicente Ordonez, ICCV 2021. Abstract Instance-level image retriev

87 Jan 3, 2023

Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking

Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking We revisit and address issues with Oxford 5k and Paris 6k image retrieval benchm

188 Dec 17, 2022

cisip-FIRe - Fast Image Retrieval

Fast Image Retrieval (FIRe) is an open source image retrieval project release by Center of Image and Signal Processing Lab (CISiP Lab), Universiti Malaya. This project implements most of the major binary hashing methods to date, together with different popular backbone networks and public datasets.

39 Nov 25, 2022

Multimodal commodity image retrieval 多模态商品图像检索

Comments

Huggingface Spaces

Hi, would you be interested in sharing a web demo on Huggingface Spaces for fire?

It would make this model more accessible as it would allow people to try out the model directly from the browser. Some other recent machine learning model repos have set up Spaces for easy access:

github: https://github.com/salesforce/BLIP Spaces: https://huggingface.co/spaces/akhaliq/BLIP

github: https://github.com/facebookresearch/omnivore Spaces: https://huggingface.co/spaces/akhaliq/omnivore

Spaces is completely free, and I can help setup a Gradio Space. Here are some getting started instructions if you'd prefer to do it yourself: https://huggingface.co/blog/gradio-spaces

opened by AK391 3
Gradio Blocks Demo

Hi, thanks for making the gradio demo here: https://huggingface.co/spaces/naver/SuperFeatures, great work. Would you be interested in updating the demo for the gradio blocks competition this month: https://huggingface.co/Gradio-Blocks or to add another model to the competition, thanks!

opened by AK391 0

Lowe’s first-to-second neighbor ratio test

nice work. https://github.com/naver/fire/blob/main/losses.py#L46

    dist2_second2 = torch.argmin(dist2, dim=0)
    ratio1to2 = dist[best2,arange] / dist2_second2. # should it be  dist[best2,arange] /dist2[dist2_second2:arange] ?

opened by sysuzyq 0

Result reproduction problem

I download the trained model fire.pth and just run : python evaluate.py eval_fire.yml Testing open source data roxford5k and rparis6k. But I can not get the results mentioned in the paper.

this is my testing result:

HOW INFO: Evaluated roxford5k: mAP E: 1.02, M: 1.67, H: 0.78 HOW INFO: Evaluated roxford5k: mP@k(1, 5, 10) E: [0. 2.06 2.65], M: [0. 2.29 3. ], H: [0. 0.29 0.43]

HOW INFO: Evaluated rparis6k: mAP E: 1.61, M: 3.98, H: 2.51 HOW INFO: Evaluated rparis6k: mP@k(1, 5, 10) E: [0. 0. 0.], M: [11.43 3.71 3.71], H: [11.43 3.71 3.71]

Could you please tell me why the score I got is so low ?

opened by TouchSkyWf 0

Learning Super-Features for Image Retrieval

Related tags

Overview

Learning Super-Features for Image Retrieval

License

Preparation

Evaluating our ICLR'22 FIRe model

Training a FIRe model

Pretrained models

You might also like...

A Joint Video and Image Encoder for End-to-End Retrieval

Instance-level Image Retrieval using Reranking Transformers

Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking

cisip-FIRe - Fast Image Retrieval

Multimodal commodity image retrieval 多模态商品图像检索

Source code of our TTH paper: Targeted Trojan-Horse Attacks on Language-based Image Retrieval.

Deep Learning: Architectures & Methods Project: Deep Learning for Audio Super-Resolution

Learning embeddings for classification, retrieval and ranking.

Joint Learning of 3D Shape Retrieval and Deformation, CVPR 2021

Comments

Huggingface Spaces

Gradio Blocks Demo

Lowe’s first-to-second neighbor ratio test

Result reproduction problem

Owner

NAVER

Super-Fast-Adversarial-Training - A PyTorch Implementation code for developing super fast adversarial training

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Static Features Classifier - A static features classifier for Point-Could clusters using an Attention-RNN model

Code for 'Single Image 3D Shape Retrieval via Cross-Modal Instance and Category Contrastive Learning', ICCV 2021

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

PyTorch code for 'Efficient Single Image Super-Resolution Using Dual Path Connections with Multiple Scale Learning'

Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022

Activity image-based video retrieval

Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback