Official code repository for A Simple Long-Tailed Rocognition Baseline via Vision-Language Model.

Related tags

Deep Learning BALLAD
Overview

BALLAD

This is the official code repository for A Simple Long-Tailed Rocognition Baseline via Vision-Language Model.

image

Requirements

  • Python3
  • Pytorch(1.7.1 recommended)
  • yaml
  • other necessary packages

Datasets

  • ImageNet_LT
  • Places_LT

Download the ImageNet_2014 and Places_365.

Modify the data_root in main.py to refer to your own dataset path.

Training

Phase A

python main.py --cfg ./config/ImageNet_LT/clip_A_rn50.yaml

Phase B

python main.py --cfg ./config/ImageNet_LT/clip_B_rn50.yaml

Testing

python main.py --cfg ./config/ImageNet_LT/test.yaml --test

Acknowledgments

The codes is based on https://github.com/zhmiao/OpenLongTailRecognition-OLTR and motivated by https://github.com/facebookresearch/classifier-balancing.

Comments
  • Approximate Training Time

    Approximate Training Time

    Hi. May I ask how long does it take to train the model with setting "ResNet50×16" on ImageNet-LT using an 8xA100 or 8xV100 server?

    I would appreciate it a lot if you could provide me an approximate training time.

    opened by sandylaker 2
  • Question about the paper.

    Question about the paper.

    Hi @TeleeMa , thanks for your inspiring work in applying the vision-language model to long-tailed recognition. And I have some questions about the precision of the zero-shot CLIP on the Imagenet-LT test split. In the paper, you point out the results are balanced on many-shots (59.4%), medium-shots (57.5%), and low-shots (57.6%) subsets and the overall performance (58.2%). And the performance of the original CLIP on the Imagenet are 73.3%: image So I wonder if there is any difference in settings between BALLAD and CLIP? Looking forward to your reply and thanks in advance~

    opened by waveboo 1
  • Training Config of PlacesLT

    Training Config of PlacesLT

    Hello,

    Great work! Thanks a lot for sharing the codes!

    I can find the config of training on ImageNetLT, but the configs of training on PlaceLT and CIFAR10 are missing. Also, I cannot find the details of training hyperparameters in the original paper.

    Could you please share the config files for PlaceLT and CIFAR10? Or you can simply tell me the details of hyperparameters here, please. Thanks for your help!

    opened by chrisyxue 0
  • Releasing the pre-trained models

    Releasing the pre-trained models

    Thank you for your awesome work, it's very inspiring.

    I just wonder if you have any plan of releasing the checkpoint of your model? It would be nice to have them so we don't need to re-run the experiments. :)

    opened by Duconnor 0
  • The version of CIFAR

    The version of CIFAR

    Thanks for your great work! I have a slight problem, will you release the version of CIFAR? The small dataset will be more helpful and feasible for us to train the model. Thank you!

    opened by madoka109 0
Owner
peng gao
Young Scientist at Shanghai AI Lab
peng gao
Jingju baseline - A baseline model of our project of Beijing opera script generation

Jingju Baseline It is a baseline of our project about Beijing opera script gener

midon 1 Jan 14, 2022
Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

This repository contains code for the following two papers: VisualBERT: A Simple and Performant Baseline for Vision and Language (arxiv) with a short

Natural Language Processing @UCLA 463 Dec 9, 2022
Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective Zhengzhuo Xu, Zenghao Chai, Chun Yuan This is the PyTorch implement

Sincere 16 Dec 15, 2022
Code for the AAAI-2022 paper: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification (AAAI 2022) Prerequisite PyTorch >= 1.2.0 P

null 16 Dec 14, 2022
Image-retrieval-baseline - MUGE Multimodal Retrieval Baseline

MUGE Multimodal Retrieval Baseline This repo is implemented based on the open_cl

null 47 Dec 16, 2022
Image-generation-baseline - MUGE Text To Image Generation Baseline

MUGE Text To Image Generation Baseline Requirements and Installation More detail

null 23 Oct 17, 2022
Improving Calibration for Long-Tailed Recognition (CVPR2021)

MiSLAS Improving Calibration for Long-Tailed Recognition Authors: Zhisheng Zhong, Jiequan Cui, Shu Liu, Jiaya Jia [arXiv] [slide] [BibTeX] Introductio

Jia Research Lab 116 Dec 20, 2022
Improving Calibration for Long-Tailed Recognition (CVPR2021)

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Jia Research Lab 19 Apr 28, 2021
Pytorch implementation for "Adversarial Robustness under Long-Tailed Distribution" (CVPR 2021 Oral)

Adversarial Long-Tail This repository contains the PyTorch implementation of the paper: Adversarial Robustness under Long-Tailed Distribution, CVPR 20

Tong WU 89 Dec 15, 2022
Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021

Exploring Classification Equilibrium in Long-Tailed Object Detection (LOCE, ICCV 2021) Paper Introduction The conventional detectors tend to make imba

null 52 Nov 21, 2022
Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)

Large-Scale Long-Tailed Recognition in an Open World [Project] [Paper] [Blog] Overview Open Long-Tailed Recognition (OLTR) is the author's re-implemen

Zhongqi Miao 761 Dec 26, 2022
Awesome Long-Tailed Learning

Awesome Long-Tailed Learning This repo pays specially attention to the long-tailed distribution, where labels follow a long-tailed or power-law distri

Stomach_ache 284 Jan 6, 2023
Improving Calibration for Long-Tailed Recognition (CVPR2021)

MiSLAS Improving Calibration for Long-Tailed Recognition Authors: Zhisheng Zhong, Jiequan Cui, Shu Liu, Jiaya Jia [arXiv] [slide] [BibTeX] Introductio

DV Lab 116 Dec 20, 2022
Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification"

[AAAI22] Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification We point out the overlooked unbiasedness in long-tailed clas

PatatiPatata 28 Oct 18, 2022
On Size-Oriented Long-Tailed Graph Classification of Graph Neural Networks

On Size-Oriented Long-Tailed Graph Classification of Graph Neural Networks We provide the code (in PyTorch) and datasets for our paper "On Size-Orient

Zemin Liu 4 Jun 18, 2022
Official Code for ICML 2021 paper "Revisiting Point Cloud Shape Classification with a Simple and Effective Baseline"

Revisiting Point Cloud Shape Classification with a Simple and Effective Baseline Ankit Goyal, Hei Law, Bowei Liu, Alejandro Newell, Jia Deng Internati

Princeton Vision & Learning Lab 115 Jan 4, 2023
Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://arxiv.org/abs/2103.06332).

Hurdles to Progress in Long-form Question Answering This repository contains the official scripts and datasets accompanying our NAACL 2021 paper, "Hur

Kalpesh Krishna 41 Nov 8, 2022
This is an official implementation for "DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation"

DeciWatch: A Simple Baseline for 10× Efficient 2D and 3D Pose Estimation This repo is the official implementation of "DeciWatch: A Simple Baseline for

null 117 Dec 24, 2022
[ECCVW2020] Robust Long-Term Object Tracking via Improved Discriminative Model Prediction (RLT-DiMP)

Feel free to visit my homepage Robust Long-Term Object Tracking via Improved Discriminative Model Prediction (RLT-DIMP) [ECCVW2020 paper] Presentation

Seokeon Choi 35 Oct 26, 2022