Codes for "Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier"

Gina Wu

Last update: May 26, 2022

Related tags

Deep Learning Deep-RTC

Overview

Deep-RTC [project page]

This repository contains the source code accompanying our ECCV 2020 paper.

Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier
Tz-Ying Wu, Pedro Morgado, Pei Wang, Chih-Hui Ho, Nuno Vasconcelos

@inproceedings{Wu20DeepRTC,
	title={Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier},
	author={Tz-Ying Wu and Pedro Morgado and Pei Wang and Chih-Hui Ho and Nuno Vasconcelos},
	booktitle={European Conference on Computer Vision (ECCV)},
	year={2020}
}

Dependencies

Python (3.5.6)
PyTorch (1.2.0)
torchvision (0.4.0)
NumPy (1.15.2)
Pillow (5.2.0)
PyYaml (5.1.2)
tensorboardX (1.8)

Data preparation

CIFAR100 [Raw images] [Long-tail version]
AWA2 [Raw images]
ImageNet [Raw images] [Long-tail version]
iNaturalist [Raw images]

These datasets can be downloaded from the above links. Please organize the images in the hierarchical folders that represent the dataset hierarchy, and put the root folder under prepro/raw. For example,

prepro/raw/imagenet
--abstraction
----bubble
------ILSVRC2012_val_00014026.JPEG
------ILSVRC2012_val_00000697.JPEG
...
--physical_entity
----object
...

While CIFAR100 and iNaturalist have released taxonomies, we built the tree-type taxonomy of AWA2 and ImageNet with WordNet. All the taxonomies are provided in prepro/data/{dataset}/tree.npy, and the data splits are provided in prepro/splits/{dataset}/{split}.json. Please refer to prepro/README.md for more details. After the raw images are managed hierarchically, run

$ ./prepare_data.sh {dataset}

where {dataset}=awa2/cifar100/imagenet/inaturalist. This will automatically generate the data lists for all splits, and build the codeword matrices needed for training Deep-RTC. Note that our codes can be applied to other datasets once they are organized hierarchically.

Training and evaluation

To train and evaluate Deep-RTC, run

$ export PYTHONPATH=${PWD}/prepro:${PYTHONPATH}
$ ./run.sh {dataset}

where {dataset}=awa2/cifar100/imagenet/inaturalist. Our pretrained models can be downloaded here.

Comments

Question regarding CIFAR 100 preparation

Hi, I have a question regarding CIFAR 100 preparation. Usually, we utilize CIFAR 100 as a python-version on their webpage, which is divided into test and train files. However, according to the data preparation section on README.md, it should be saved as jpg with class hierarchy. According to train/valid/test splits in https://github.com/gina9726/Deep-RTC/tree/master/prepro/splits/cifar100, the file format is "Image{:5d}.jpg" and some indices are missing.

Could you tell me how to form CIFAR 100 dataset?

The Long-tailed version link just denotes the various long-tailed versions of tfrecord files.

Thank you.

opened by jd730 1
Fix hier_dataset to use the new 'tree.npy' and 'leaf_nodes.npy'

Hi, Thanks for your great work. I tried to execute the code and figured out there was an issue with the naming of the tree.npy and 'leaf_nodes.npy'.

Thanks Harsh

opened by rangwani-harsh 0

The official codes of "Semi-supervised Models are Strong Unsupervised Domain Adaptation Learners".

SSL models are Strong UDA learners Introduction This is the official code of paper "Semi-supervised Models are Strong Unsupervised Domain Adaptation L

26 Dec 26, 2022

Source codes of CenterTrack++ in 2021 ICME Workshop on Big Surveillance Data Processing and Analysis

MOT Tracked object bounding box association (CenterTrack++) New association method based on CenterTrack. Two new branches (Tracked Size and IOU) are a

36 Oct 4, 2022

The codes and models in 'Gaze Estimation using Transformer'.

GazeTR We provide the code of GazeTR-Hybrid in "Gaze Estimation using Transformer". We recommend you to use data processing codes provided in GazeHub.

65 Dec 27, 2022

codes for paper Combining Dynamic Local Context Focus and Dependency Cluster Attention for Aspect-level sentiment classification

DLCF-DCA codes for paper Combining Dynamic Local Context Focus and Dependency Cluster Attention for Aspect-level sentiment classification. submitted t

15 Aug 30, 2022

The codes for the work "Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation"

Swin-Unet The codes for the work "Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation"(https://arxiv.org/abs/2105.05537). A validatio

869 Jan 7, 2023

The source codes for ACL 2021 paper 'BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data'

BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data This repository provides the implementation details for

124 Dec 27, 2022

This is the official repo for TransFill: Reference-guided Image Inpainting by Merging Multiple Color and Spatial Transformations at CVPR'21. According to some product reasons, we are not planning to release the training/testing codes and models. However, we will release the dataset and the scripts to prepare the dataset.

TransFill-Reference-Inpainting This is the official repo for TransFill: Reference-guided Image Inpainting by Merging Multiple Color and Spatial Transf

80 Dec 8, 2022

Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021

Towards Diverse Paragraph Captioning for Untrimmed Videos This repository contains PyTorch implementation of our paper Towards Diverse Paragraph Capti

61 Oct 11, 2022

Codes for TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.

TS-CAM: Token Semantic Coupled Attention Map for Weakly SupervisedObject Localization This is the official implementaion of paper TS-CAM: Token Semant

112 Jan 2, 2023

Codes for "Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier"

Related tags

Overview

Deep-RTC [project page]

Dependencies

Data preparation

Training and evaluation

You might also like...

The official codes of "Semi-supervised Models are Strong Unsupervised Domain Adaptation Learners".

Source codes of CenterTrack++ in 2021 ICME Workshop on Big Surveillance Data Processing and Analysis

The codes and models in 'Gaze Estimation using Transformer'.

codes for paper Combining Dynamic Local Context Focus and Dependency Cluster Attention for Aspect-level sentiment classification

The codes for the work "Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation"

The source codes for ACL 2021 paper 'BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data'

Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021

Codes for TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.

Comments

Question regarding CIFAR 100 preparation

Fix hier_dataset to use the new 'tree.npy' and 'leaf_nodes.npy'

Owner

Gina Wu

This is my codes that can visualize the psnr image in testing videos.

codes for Image Inpainting with External-internal Learning and Monochromic Bottleneck

Source codes for "Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs"

Codes for our paper "SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic Knowledge" (EMNLP 2020)

Source codes for the paper "Local Additivity Based Data Augmentation for Semi-supervised NER"

Python codes for Lite Audio-Visual Speech Enhancement.

Codes for our IJCAI21 paper: Dialogue Discourse-Aware Graph Model and Data Augmentation for Meeting Summarization

Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"

Pytorch codes for "Self-supervised Multi-view Stereo via Effective Co-Segmentation and Data-Augmentation"

Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"