Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation (ACM MM 2020)

Jialian Wu

Last update: Jan 6, 2023

Related tags

Deep Learning Forest_RCNN

Overview

Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation (ACM MM 2020)

Official implementation of:

Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation
Jialian Wu, Liangchen Song, Tiancai Wang, Qian Zhang and Junsong Yuan
In ACM International Conference on Multimedia , Seattle WA, October 12-16, 2020.

Many thanks to mmdetection authors for their great framework!

News

Mar 2, 2021 Update: We test Forest R-CNN on LVIS v1.0 set. Thanks for considering comparing with our method :)

Jan 1, 2021 Update: We propose Forest DetSeg, an extension of original Forest R-CNN. Forest DetSeg extends the proposed method to RetinaNet. While the new work is under review now, the code has been available. More details will come up along with the new paper.

Installation

Please refer to INSTALL.md for installation and dataset preparation.

Forest R-CNN

Inference

# Examples
# single-gpu testing
python tools/test.py configs/lvis/forest_rcnn_r50_fpn.py forest_rcnn_res50.pth --out out.pkl --eval bbox segm

# multi-gpu testing
./tools/dist_test.sh configs/lvis/forest_rcnn_r50_fpn.py forest_rcnn_res50.pth ${GPU_NUM} --out out.pkl --eval bbox segm

Training

# Examples
# single-gpu training
python tools/train.py configs/lvis/forest_rcnn_r50_fpn.py --validate

# multi-gpu training
./tools/dist_train.sh configs/lvis/forest_rcnn_r50_fpn.py ${GPU_NUM} --validate

(Note that we found in our experiments the best result comes up around the 20-th epoch instead of the end of training.)

Forest RetinaNet

Inference

# Examples  
# multi-gpu testing
./tools/dist_test.sh configs/lvis/forest_retinanet_r50_fpn_1x.py forest_retinanet_res50.pth ${GPU_NUM} --out out.pkl --eval bbox segm

Training

# Examples    
# multi-gpu training
./tools/dist_train.sh configs/lvis/forest_retinanet_r50_fpn_1x.py ${GPU_NUM} --validate

Main Results

Instance Segmentation on LVIS v0.5 val set

AP and AP.b denote the mask AP and box AP. r, c, f represent the rare, common, frequent contegoires.

Method	Backbone	AP	AP.r	AP.c	AP.f	AP.b	AP.b.r	AP.b.c	AP.b.f	download
MaskRCNN	R50-FPN	21.7	6.8	22.6	26.4	21.8	6.5	21.6	28.0	model
Forest R-CNN	R50-FPN	25.6	18.3	26.4	27.6	25.9	16.9	26.1	29.2	model
MaskRCNN	R101-FPN	23.6	10.0	24.8	27.6	23.5	8.7	23.1	29.8	model
Forest R-CNN	R101-FPN	26.9	20.1	27.9	28.3	27.5	20.0	27.5	30.4	model
MaskRCNN	X-101-32x4d-FPN	24.8	10.0	26.4	28.6	24.8	8.6	25.0	30.9	model
Forest R-CNN	X-101-32x4d-FPN	28.5	21.6	29.7	29.7	28.8	20.6	29.2	31.7	model

Instance Segmentation on LVIS v1.0 val set

Method	Backbone	AP	AP.r	AP.c	AP.f	AP.b
MaskRCNN	R50-FPN	19.2	0.0	17.2	29.5	20.0
Forest R-CNN	R50-FPN	23.2	14.2	22.7	27.7	24.6

Visualized Examples

Citation

If you find it useful in your research, please consider citing our paper as follows:

@inproceedings{wu2020forest,
title={Forest R-CNN: Large-vocabulary long-tailed object detection and instance segmentation},
author={Wu, Jialian and Song, Liangchen and Wang, Tiancai and Zhang, Qian and Yuan, Junsong},
booktitle={Proceedings of the 28th ACM International Conference on Multimedia},
pages={1570--1578},
year={2020}}

Comments

KeyError:'neg_category_ids'

when i run the CUDA_VISIBLE_DEVICES=1 python tools/test.py configs/lvis/forest_rcnn_r101_32_x4d_fpn.py forest_rcnn_resnext101.pth --out out.pkl --eval bbox segm after loading the dataset, it occur KeyError:'neg_category_ids' how to slove the problem

opened by zhangshen12356 6
Enhanced C++ implementation

The NMS-Resampling method is very useful and easy to understand.

However, I have found that applying the official implementation in Python (see here) really slows the training process.

I have managed to implement it in C++ and integrate it in mmcv, which in my case gives between 3x and 4x speed-ups in the whole training process.

If you are interested, I can create a pull request with the code and the instructions to make use of this faster implementation. I think that this make help pepople using this useful tool.

Best regards!

opened by jesusmolrdv 3
How to get a correct version of mmcv

Could you plz figure out the correct version of mmcv? I have tried 1.2.0, but No module named 'mmcv.cnn.weight_init' I have tried some old versions which have mmcv.cnn.weight_init, but it tells me that all tensors must be on devices[0]

opened by Chauncy-Cai 1
About NMS resampling method

Hello, author, thank you for your work! I want to use NMS resampling method alone to train on lvisv1 data set. Can you provide a new rpn_ head. py file? Of course, it would be better to have complete file code. thank you!

opened by sanmulab 0
Can't train correctly

I follow your install guide and run the training code with "./tools/dist_train.sh configs/lvis/forest_rcnn_r50_fpn.py 1 --validate". However, it always ends with the following error:

Can you explain the reason?

opened by U-Help 0
NMS Resampling-Linear

In the paper, the equation for NMS Resampling-Linear is as following: But, in my opinion, for xi in frequent classes, threshold should be alpha_f+...

opened by U-Help 1

Owner

Jialian Wu

Ph.D. Candidate at SUNY Buffalo

GitHub

Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)

Large-Scale Long-Tailed Recognition in an Open World [Project] [Paper] [Blog] Overview Open Long-Tailed Recognition (OLTR) is the author's re-implemen

761 Dec 26, 2022

object detection; robust detection; ACM MM21 grand challenge; Security AI Challenger Phase VII

赛题背景在商品知识产权领域，知识产权体现为在线商品的设计和品牌。不幸的是，在每一天，存在着非法商户通过一些对抗手段干扰商标识别来逃避侵权，这带来了很高的知识产权风险和财务损失。为了促进先进的多媒体人工智能技术的发展，以保护企业来之不易的创作和想法免受恶意使用和剽窃，因此提出了鲁棒性标识检测挑战赛

65 Dec 22, 2022

This is a GUI interface which can process forest fire detection, smoke detection and fire segmentation

This is a GUI interface which can process forest fire detection, smoke detection and fire segmentation. Yolov5 is used to detect fire and smoke and unet is used to segment fire.

7 Jan 8, 2023

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

This project is a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

147 Dec 3, 2022

Code for ACM MM 2020 paper "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination"

NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination The offical implementation for the "NOH-NMS: Improving Pedestrian Detection by

64 Nov 11, 2022

Code for ACM MM2021 paper "Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection"

CTDNet The PyTorch code for ACM MM2021 paper "Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection" Requirements Python 3.6

28 Oct 20, 2022

The code repository for "RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection" (ACM MM'21)

RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection (ACM MM'21) By Zhuofan Zong, Qianggang Cao, Biao Leng Introduction F

9 Jul 30, 2022

Code for CMaskTrack R-CNN (proposed in Occluded Video Instance Segmentation)

CMaskTrack R-CNN for OVIS This repo serves as the official code release of the CMaskTrack R-CNN model on the Occluded Video Instance Segmentation data

61 Nov 25, 2022

Complete-IoU (CIoU) Loss and Cluster-NMS for Object Detection and Instance Segmentation (YOLACT)

Complete-IoU Loss and Cluster-NMS for Improving Object Detection and Instance Segmentation. Our paper is accepted by IEEE Transactions on Cybernetics

290 Dec 25, 2022

Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation

Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation This paper has been accepted and early accessed

39 Sep 20, 2022

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Swin Transformer for Object Detection This repo contains the supported code and configuration files to reproduce object detection results of Swin Tran

1.4k Dec 30, 2022

Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation (ACM MM 2020)

Related tags

Overview

Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation (ACM MM 2020)

News

Installation

Forest R-CNN

Inference

Training

Forest RetinaNet

Inference

Training

Main Results

Instance Segmentation on LVIS v0.5 val set

Instance Segmentation on LVIS v1.0 val set

Visualized Examples

Citation

Comments

KeyError:'neg_category_ids'

Enhanced C++ implementation

How to get a correct version of mmcv

About NMS resampling method

Can't train correctly

NMS Resampling-Linear

Owner

Jialian Wu

Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)

object detection; robust detection; ACM MM21 grand challenge; Security AI Challenger Phase VII

This is a GUI interface which can process forest fire detection, smoke detection and fire segmentation

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

Code for ACM MM 2020 paper "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination"

Code for ACM MM2021 paper "Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection"

The code repository for "RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection" (ACM MM'21)

Code for CMaskTrack R-CNN (proposed in Occluded Video Instance Segmentation)

Complete-IoU (CIoU) Loss and Cluster-NMS for Object Detection and Instance Segmentation (YOLACT)

Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Object detection and instance segmentation toolkit based on PaddlePaddle.

Res2Net for Instance segmentation and Object detection using MaskRCNN

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Pytorch implementation for "Adversarial Robustness under Long-Tailed Distribution" (CVPR 2021 Oral)

Awesome Long-Tailed Learning

Improving Calibration for Long-Tailed Recognition (CVPR2021)