Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning. CVPR 2018

Yin Cui

Last update: Oct 1, 2022

Related tags

Deep Learning computer-vision deep-learning tensorflow image-classification transfer-learning fine-grained fine-grained-classification cvpr2018 fine-grained-visual-categorization

Overview

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning

Tensorflow code and models for the paper:

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning
Yin Cui, Yang Song, Chen Sun, Andrew Howard, Serge Belongie
CVPR 2018

This repository contains code and pre-trained models used in the paper and 2 demos to demonstrate: 1) the importance of pre-training data on transfer learning; 2) how to calculate domain similarity between source domain and target domain.

Notice that we used a mini validation set (./inat_minival.txt) contains 9,697 images that are randomly selected from the original iNaturalist 2017 validation set. The rest of valdiation images were combined with the original training set to train our model in the paper. There are 665,473 training images in total.

Dependencies:

Python (3.5)
Tensorflow (1.11)
pyemd
scikit-learn
scikit-image

Preparation:

Clone the repo with recursive:

git clone --recursive https://github.com/richardaecn/cvpr18-inaturalist-transfer.git

Install dependencies. Please refer to TensorFlow, pyemd, scikit-learn and scikit-image official websites for installation guide.
Download data and feature and unzip them into the same directory as the cloned repo. You should have two folders './data' and './feature' in the repo's directory.

Datasets (optional):

In the paper, we used data from 9 publicly available datasets:

We provide a download link that includes the entire CUB-200-2011 dataset and data splits for the rest of 8 datasets. The provided link contains sufficient data for this repo. If you would like to use other 8 datasets, please download them from the official websites and put them in the corresponding subfolders under './data'.

Pre-trained Models (optional):

The models were trained using TensorFlow-Slim. We implemented Squeeze-and-Excitation Networks (SENet) under './slim'. The pre-trained models can be downloaded from the following links:

Network	Pre-trained Data	Input Size	Download Link
Inception-V3	ImageNet	299	link
Inception-V3	iNat2017	299	link
Inception-V3	iNat2017	448	link
Inception-V3	iNat2017	299 -> 560 FT¹	link
Inception-V3	ImageNet + iNat2017	299	link
Inception-V3 SE	ImageNet + iNat2017	299	link
Inception-V4	iNat2017	448	link
Inception-V4	iNat2017	448 -> 560 FT²	link
Inception-ResNet-V2	ImageNet + iNat2017	299	link
Inception-ResNet-V2 SE	ImageNet + iNat2017	299	link
ResNet-V2 50	ImageNet + iNat2017	299	link
ResNet-V2 101	ImageNet + iNat2017	299	link
ResNet-V2 152	ImageNet + iNat2017	299	link

¹ This model was trained with 299 input size on train + 90% val and then fine-tuned with 560 input size on 90% val.

² This model was trained with 448 input size on train + 90% val and then fine-tuned with 560 input size on 90% val.

TensorFlow Hub also provides a pre-trained Inception-V3 299 on iNat2017 original training set here.

Featrue Extraction (optional):

Run the following Python script to extract feature:

python feature_extraction.py

To run this script, you need to download the checkpoint of Inception-V3 299 trained on iNat2017. The dataset and pre-trained model can be modified in the script.

We provide a download link that includes features used in the domos of this repo.

Demos

Linear logistic regression on extracted features:

This demo shows the importance of pre-training data on transfer learning. Based on features extracted from an Inception-V3 pre-trained on iNat2017, we are able to achieve 89.9% classification accuracy on CUB-200-2011 with the simple logistic regression, outperforming most state-of-the-art methods.

LinearClassifierDemo.ipynb

Calculating domain similarity by Earth Mover's Distance (EMD): This demo gives an example to calculate the domain similarity proposed in the paper. Results correspond to part of the Fig. 5 in the original paper.

DomainSimilarityDemo.ipynb

Training and Evaluation

Convert dataset into '.tfrecord':

python convert_dataset.py --dataset_name=cub_200 --num_shards=10

Train (fine-tune) the model on 1 GPU:

CUDA_VISIBLE_DEVICES=0 ./train.sh

Evaluate the model on another GPU simultaneously:

CUDA_VISIBLE_DEVICES=1 ./eval.sh

Run Tensorboard for visualization:

tensorboard --logdir=./checkpoints/cub_200/ --port=6006

Citation

If you find our work helpful in your research, please cite it as:

@inproceedings{Cui2018iNatTransfer,
  title = {Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning},
  author = {Yin Cui, Yang Song, Chen Sun, Andrew Howard, Serge Belongie},
  booktitle={CVPR},
  year={2018}
}

You might also like...

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose Yijun Zhou and James Gregson - BMVC2020 Abstract: We present an end-to-end head-pos

368 Dec 26, 2022

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Stochastic CSLR This is the PyTorch implementation for the ECCV 2020 paper: Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuou

28 Dec 19, 2022

Code release for The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification (TIP 2020)

The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification Code release for The Devil is in the Channels: Mutual-Channel

230 Dec 31, 2022

Code for Talk-to-Edit (ICCV2021). Paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

Talk-to-Edit (ICCV2021) This repository contains the implementation of the following paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog Yumin

221 Jan 7, 2023

official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting By Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu

77 Dec 27, 2022

Official PyTorch implementation of N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras (ICCV 2021)

N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras Official PyTorch implementation of N-ImageNet: Towards Robust, Fine-Gra

32 Dec 26, 2022

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021) PyTorch implementation of SnapMix | paper Method Overview Cite

126 Dec 30, 2022

Official pytorch code for SSC-GAN: Semi-Supervised Single-Stage Controllable GANs for Conditional Fine-Grained Image Generation(ICCV 2021)

SSC-GAN_repo Pytorch implementation for 'Semi-Supervised Single-Stage Controllable GANs for Conditional Fine-Grained Image Generation'.PDF SSC-GAN:Sem

4 Aug 28, 2022

Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

Faster R-CNN pretrained on VisualGenome This repository modifies maskrcnn-benchmark for object detection and attribute prediction on VisualGenome data

7 Apr 20, 2021

Comments

exploitation of ckpt file

Hi, Thanks for your work. Do you have the tf.SavedModel version of the pretrained models you show ? If not, is there a way to transform a ckpt file in a tf.SavedModel easily ? I would like to use your model in keras at the very end if possible.

Amaury

opened by amapic 0
Query image in LinearClassifierDemo.ipynb
Hi, I Have a couple of questions:

In LinearClassifierDemo.ipynb, you gave the query image and it extracts similar images from training data. What is the criteria that extract those five images? Like is there any feature matching criteria?

The features we got by using pretrained models, these pretrained models are available publicly or you trained these by yourself and which pretrained model train on which dataset?
opened by muneebable 1

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning. CVPR 2018

Related tags

Overview

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning

Dependencies:

Preparation:

Datasets (optional):

Pre-trained Models (optional):

Featrue Extraction (optional):

Demos

Training and Evaluation

Citation

You might also like...

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Code release for The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification (TIP 2020)

Code for Talk-to-Edit (ICCV2021). Paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Official PyTorch implementation of N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras (ICCV 2021)

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

Official pytorch code for SSC-GAN: Semi-Supervised Single-Stage Controllable GANs for Conditional Fine-Grained Image Generation(ICCV 2021)

Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

Comments

exploitation of ckpt file

Query image in LinearClassifierDemo.ipynb

Owner

Yin Cui

PyTorch implementation of Weak-shot Fine-grained Classification via Similarity Transfer

The coda and data for "Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach" (ACL '21)

DeepLM: Large-scale Nonlinear Least Squares on Deep Learning Frameworks using Stochastic Domain Decomposition (CVPR 2021)

Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting (ICCV, 2021)

Hierarchical Metadata-Aware Document Categorization under Weak Supervision (WSDM'21)

Code and data of the Fine-Grained R2R Dataset proposed in paper Sub-Instruction Aware Vision-and-Language Navigation

Implementation for "Domain-Specific Bias Filtering for Single Labeled Domain Generalization"

The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding, by Chuhan Zhang, Ankush Gupta and Andrew Zisserman.

FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset (CVPR2022)

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).