A Simple Long-Tailed Rocognition Baseline via Vision-Language Model

Teli Ma

Last update: Jan 20, 2022

Related tags

Deep Learning BALLAD

Overview

BALLAD

This is the official code repository for A Simple Long-Tailed Rocognition Baseline via Vision-Language Model.

Requirements

Python3
Pytorch(1.7.1 recommended)
yaml
other necessary packages

Datasets

ImageNet_LT
Places_LT

Download the ImageNet_2014 and Places_365.

Modify the data_root in main.py to refer to your own dataset path.

Training

Phase A

python main.py --cfg ./config/ImageNet_LT/clip_A_rn50.yaml

Phase B

python main.py --cfg ./config/ImageNet_LT/clip_B_rn50.yaml

Testing

python main.py --cfg ./config/ImageNet_LT/test.yaml --test

Acknowledgments

The codes is based on https://github.com/zhmiao/OpenLongTailRecognition-OLTR and motivated by https://github.com/facebookresearch/classifier-balancing.

Awesome Long-Tailed Learning

Awesome Long-Tailed Learning This repo pays specially attention to the long-tailed distribution, where labels follow a long-tailed or power-law distri

284 Jan 6, 2023

Improving Calibration for Long-Tailed Recognition (CVPR2021)

MiSLAS Improving Calibration for Long-Tailed Recognition Authors: Zhisheng Zhong, Jiequan Cui, Shu Liu, Jiaya Jia [arXiv] [slide] [BibTeX] Introductio

116 Dec 20, 2022

Code for the AAAI-2022 paper: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification (AAAI 2022) Prerequisite PyTorch = 1.2.0 P

16 Dec 14, 2022

Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification"

[AAAI22] Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification We point out the overlooked unbiasedness in long-tailed clas

28 Oct 18, 2022

On Size-Oriented Long-Tailed Graph Classification of Graph Neural Networks

On Size-Oriented Long-Tailed Graph Classification of Graph Neural Networks We provide the code (in PyTorch) and datasets for our paper "On Size-Orient

4 Jun 18, 2022

[ECCVW2020] Robust Long-Term Object Tracking via Improved Discriminative Model Prediction (RLT-DiMP)

Feel free to visit my homepage Robust Long-Term Object Tracking via Improved Discriminative Model Prediction (RLT-DIMP) [ECCVW2020 paper] Presentation

35 Oct 26, 2022

Step by Step on how to create an vision recognition model using LOBE.ai, export the model and run the model in an Azure Function

3 Mar 30, 2022

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps Here is the code for ssbassline model. We also provide OCR results/features/mode

51 Nov 18, 2022

A task-agnostic vision-language architecture as a step towards General Purpose Vision

Towards General Purpose Vision Systems By Tanmay Gupta, Amita Kamath, Aniruddha Kembhavi, and Derek Hoiem Overview Welcome to the official code base f

79 Dec 23, 2022

Owner

Teli Ma

GitHub

A Simple Long-Tailed Rocognition Baseline via Vision-Language Model

Related tags

Overview

BALLAD

Requirements

Datasets

Training

Phase A

Phase B

Testing

Acknowledgments

You might also like...

Awesome Long-Tailed Learning

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Code for the AAAI-2022 paper: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification"

On Size-Oriented Long-Tailed Graph Classification of Graph Neural Networks

[ECCVW2020] Robust Long-Term Object Tracking via Improved Discriminative Model Prediction (RLT-DiMP)

Step by Step on how to create an vision recognition model using LOBE.ai, export the model and run the model in an Azure Function

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]

A task-agnostic vision-language architecture as a step towards General Purpose Vision

Owner

Teli Ma

Jingju baseline - A baseline model of our project of Beijing opera script generation

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Image-retrieval-baseline - MUGE Multimodal Retrieval Baseline

Image-generation-baseline - MUGE Text To Image Generation Baseline

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Pytorch implementation for "Adversarial Robustness under Long-Tailed Distribution" (CVPR 2021 Oral)

Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021

Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)