Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)

Jia Research Lab

Last update: Dec 29, 2022

Related tags

Computer Vision SA-AutoAug

Overview

SA-AutoAug

Scale-aware Automatic Augmentation for Object Detection

Yukang Chen, Yanwei Li, Tao Kong, Lu Qi, Ruihang Chu, Lei Li, Jiaya Jia

[Paper] [BibTeX]

This project provides the implementation for the CVPR 2021 paper "Scale-aware Automatic Augmentation for Object Detection". Scale-aware AutoAug provides a new search space and search metric to find effective data agumentation policies for object detection. It is implemented on maskrcnn-benchmark and FCOS. Both search and training codes have been released. To facilitate more use, we re-implement the training code based on Detectron2.

Installation

For maskrcnn-benchmark code, please follow INSTALL.md for instruction.

For FCOS code, please follow INSTALL.md for instruction.

For Detectron2 code, please follow INSTALL.md for instruction.

Search

(You can skip this step and directly train on our searched policies.)

To search with 8 GPUs, run:

cd /path/to/SA-AutoAug/maskrcnn-benchmark
export NGPUS=8
python3 -m torch.distributed.launch --nproc_per_node=$NGPUS tools/search.py --config-file configs/SA_AutoAug/retinanet_R-50-FPN_search.yaml OURPUT_DIR /path/to/searchlog_dir

Since we finetune on an existing baseline model during search, a baseline model is needed. You can download this model for search, or you can use other Retinanet baseline model trained by yourself.

Training

To train the searched policies on maskrcnn-benchmark (FCOS)

cd /path/to/SA-AutoAug/maskrcnn-benchmark
export NGPUS=8
python3 -m torch.distributed.launch --nproc_per_node=$NGPUS tools/train_net.py --config-file configs/SA_AutoAug/CONFIG_FILE  OUTPUT_DIR /path/to/traininglog_dir

For example, to train the retinanet ResNet-50 model with our searched data augmentation policies in 6x schedule:

cd /path/to/SA-AutoAug/maskrcnn-benchmark
export NGPUS=8
python3 -m torch.distributed.launch --nproc_per_node=$NGPUS tools/train_net.py --config-file configs/SA_AutoAug/retinanet_R-50-FPN_6x.yaml  OUTPUT_DIR models/retinanet_R-50-FPN_6x_SAAutoAug

To train the searched policies on detectron2

cd /path/to/SA-AutoAug/detectron2
python3 ./tools/train_net.py --num-gpus 8 --config-file ./configs/COCO-Detection/SA_AutoAug/CONFIG_FILE OUTPUT_DIR /path/to/traininglog_dir

For example, to train the retinanet ResNet-50 model with our searched data augmentation policies in 6x schedule:

cd /path/to/SA-AutoAug/detectron2
python3 ./tools/train_net.py --num-gpus 8 --config-file ./configs/COCO-Detection/SA_AutoAug/retinanet_R_50_FPN_6x.yaml OUTPUT_DIR output_retinanet_R_50_FPN_6x_SAAutoAug

Results

We provide the results on COCO val2017 set with pretrained models.

Based on maskrcnn-benchmark

Method	Backbone	AP_bbox	Download
Faster R-CNN	ResNet-50	41.8	Model
Faster R-CNN	ResNet-101	44.2	Model
RetinaNet	ResNet-50	41.4	Model
RetinaNet	ResNet-101	42.8	Model
Mask R-CNN	ResNet-50	42.8	Model
Mask R-CNN	ResNet-101	45.3	Model

Based on FCOS

Method	Backbone	AP_bbox	Download
FCOS	ResNet-50	42.6	Model
FCOS	ResNet-101	44.0	Model
ATSS	ResNext-101-32x8d-dcnv2	48.5	Model
ATSS	ResNext-101-32x8d-dcnv2 (1200 size)	49.6	Model

Based on Detectron2

Method	Backbone	AP_bbox	Download
Faster R-CNN	ResNet-50	41.9	Model - Metrics
Faster R-CNN	ResNet-101	44.2	Model - Metrics
RetinaNet	ResNet-50	40.8	Model - Metrics
RetinaNet	ResNet-101	43.1	Model - Metrics
Mask R-CNN	ResNet-50	-	Training
Mask R-CNN	ResNet-101	-	Training

Citing SA-AutoAug

Consider cite SA-Autoaug in your publications if it helps your research.

@inproceedings{saautoaug,
  title={Scale-aware Automatic Augmentation for Object Detection},
  author={Yukang Chen, Yanwei Li, Tao Kong, Lu Qi, Ruihang Chu, Lei Li, Jiaya Jia},
  booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2021}
}

Acknowledgments

This training code of this project is built on maskrcnn-benchmark, Detectron2, FCOS, and ATSS. The search code of this project is modified from DetNAS. Some augmentation code and settings follow AutoAug-Det. We thanks a lot for the authors of these projects.

Note that:

(1) We also provides script files for search and training in maskrcnn-benchmark, FCOS, and, detectron2.

(2) Any issues or pull requests on this project are welcome. In addition, if you meet problems when applying the augmentations to other datasets or codebase, feel free to contact Yukang Chen ([email protected]).

Comments

模型训练时使用的增强策略，参数和论文中的不一样

感谢作者的伟大贡献，我在detectron2环境下训练了自己的数据集，训练时输出了使用的增强策略，发现和论文中的表9的增强策略有些不一样，这是为什么呢？ {'zoom_out': {'prob': 0.30000000000000004, 'level': 9}, 'zoom_in': {'prob': 0.25, 'level': 3}} {'policies': [[('Color', 0.4, 2), ('translateX', 0.4, 4)], [('Brightness', 0.2, 4), ('rotate', 0.4, 2)], [('Sharpness', 0.4, 2), ('shearX', 0.2, 6)], [('SolarizeAdd', 0.2, 2), ('hflip', 0.5, 1)], [('Color', 0.0, 8), ('translateY', 0.2, 8)]], 'scale_ratios': {'area': [7, 5, 1], 'prob': [3, 3, 3]}} 论文中的表9的增强策略

opened by gfzwytc 2
Translate set to (0, 0) in geometric _augs scripts

The translate variable to be used in the box-level geometric augmentations in the FCOS training scripts is set to (0, 0) here: https://github.com/dvlab-research/SA-AutoAug/blob/e69a0a841b00d638c65647c7efb5e3e1b782bad5/FCOS/fcos_core/augmentations/box_level_augs/geometric_augs.py#L35

opened by deepanshi-s 1
Regarding the geometric bbox augmentation.

Hi, @yukang2017

On this line of code you set the translation value to (0,0) even if you apply the translate augmentation.

That being said translate[0] + translate[1] != 0 will never be True.

Is this the expected behaviour?

opened by gkagkos 1
Question about bounding boxes augmentation

https://github.com/dvlab-research/SA-AutoAug/blob/master/FCOS/fcos_core/augmentations/box_level_augs/geometric_augs.py#L24

I saw the bounding box augmentation codes and I think there are some differences with AutoAugment-Det codes.

https://github.com/tensorflow/tpu/blob/3679ca6b979349dde6da7156be2528428b000c7c/models/official/detection/utils/autoaugment_utils.py#L505

What I understand is AutoAugment-Det implement translate_only_bbox as translate image patch with bounding box unchanged, but SA-AutoAug implement translate bounding boxes also image patches (there may be some redundant object cause of paste)

And just curious why just paste the image patches rather than translate the image patch?

There could be visual artifacts cause of paste on translation

opened by Jasonlee1995 1
Search script with detectron2

Hi I'm curious if there is a search script for detectron2? I know there is a search.sh for maskrcnn framework, but I want to put all my work with detectron2 :D

Thanks for this great work!

opened by james77777778 1
Training with already searched policy.

The 6x schedule is too long for training. If I want to search augmentation policy in the first time and use the searched policy after, How can I do that? @yanwei-li Thanks.

opened by qianwangn 1

Owner

Jia Research Lab

Research lab focusing on CV led by Prof. Jiaya Jia

GitHub

(CVPR 2021) Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

BRNet Introduction This is a release of the code of our paper Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds,

86 Oct 5, 2022

(CVPR 2021) ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

ST3D Code release for the paper ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection, CVPR 2021 Authors: Jihan Yang*, Shaoshu

224 Dec 28, 2022

deployment of a hybrid model for automatic weapon detection/ anomaly detection for surveillance applications

Automatic Weapon Detection Deployment of a hybrid model for automatic weapon detection/ anomaly detection for surveillance applications. Loved the pro

4 Mar 4, 2022

This project modify tensorflow object detection api code to predict oriented bounding boxes. It can be used for scene text detection.

This is an oriented object detector based on tensorflow object detection API. Most of the code is not changed except for those related to the need of

30 Oct 22, 2022

A novel region proposal network for more general object detection ( including scene text detection ).

DeRPN: Taking a further step toward more general object detection DeRPN is a novel region proposal network which concentrates on improving the adaptiv

Deep Learning and Vision Computing Lab, SCUT

151 Dec 12, 2022

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

YOLOv5_DOTA_OBB YOLOv5 in DOTA_OBB dataset with CSL_label.(Oriented Object Detection) Datasets and pretrained checkpoint Datasets : DOTA Pretrained Ch

1.1k Dec 30, 2022

Morphological edge detection or object's boundary detection using erosion and dialation in OpenCV python

Morphologycal-edge-detection-using-erosion-and-dialation the task is to detect object boundary using erosion or dialation . Here, use the kernel or st

3 Nov 25, 2022

Omdena-abuja-anpd - Automatic Number Plate Detection for the security of lives and properties using Computer Vision.

1 Jan 1, 2022

This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network Introduction This is a tensorflow re-implementation of PSENet: Shape Robu

498 Dec 30, 2022

PSENet - Shape Robust Text Detection with Progressive Scale Expansion Network.

News Python3 implementations of PSENet [1], PAN [2] and PAN++ [3] are released at https://github.com/whai362/pan_pp.pytorch. [1] W. Wang, E. Xie, X. L

1.1k Dec 24, 2022

Image augmentation for machine learning experiments.

imgaug This python library helps you with augmenting images for your machine learning projects. It converts a set of input images into a new, much lar

13.2k Jan 2, 2023

Image augmentation library in Python for machine learning.

Augmentor is an image augmentation library in Python for machine learning. It aims to be a standalone library that is platform and framework independe

4.8k Jan 4, 2023

Fast image augmentation library and easy to use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about library: https://www.mdpi.com/2078-2489/11/2/125

Albumentations Albumentations is a Python library for image augmentation. Image augmentation is used in deep learning and computer vision tasks to inc

11.4k Jan 2, 2023

Geometric Augmentation for Text Image

Text Image Augmentation A general geometric augmentation tool for text images in the CVPR 2020 paper "Learn to Augment: Joint Data Augmentation and Ne

440 Jan 5, 2023

The code for “Oriented RepPoints for Aerail Object Detection”

Oriented RepPoints for Aerial Object Detection The code for the implementation of “Oriented RepPoints”, Under review. (arXiv preprint) Introduction Or

207 Dec 24, 2022

OCR, Object Detection, Number Plate, Real Time

README.md PrePareded anaconda env requirements.txt clova AI → deep text recognition → trained weights (ex, .pth) wpod-net weights (ex, .h5 , .json) ht

7 Dec 6, 2022

When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework (CVPR 2021 oral)

MTLFace This repository contains the PyTorch implementation and the dataset of the paper: When Age-Invariant Face Recognition Meets Face Age Synthesis

120 Jan 5, 2023

CVPR 2021 Oral paper "LED2-Net: Monocular 360˚ Layout Estimation via Differentiable Depth Rendering" official PyTorch implementation.

LED2-Net This is PyTorch implementation of our CVPR 2021 Oral paper "LED2-Net: Monocular 360˚ Layout Estimation via Differentiable Depth Rendering". Y

83 Jan 4, 2023

Distilling Knowledge via Knowledge Review, CVPR 2021

ReviewKD Distilling Knowledge via Knowledge Review Pengguang Chen, Shu Liu, Hengshuang Zhao, Jiaya Jia This project provides an implementation for the

194 Dec 28, 2022

Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)

Related tags

Overview

SA-AutoAug

Installation

Search

Training

Results

Based on maskrcnn-benchmark

Based on FCOS

Based on Detectron2

Citing SA-AutoAug

Acknowledgments

Comments

模型训练时使用的增强策略，参数和论文中的不一样

Translate set to (0, 0) in geometric _augs scripts

Regarding the geometric bbox augmentation.

Question about bounding boxes augmentation

Search script with detectron2

Training with already searched policy.

Owner

Jia Research Lab

(CVPR 2021) Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

(CVPR 2021) ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

deployment of a hybrid model for automatic weapon detection/ anomaly detection for surveillance applications

This project modify tensorflow object detection api code to predict oriented bounding boxes. It can be used for scene text detection.

A novel region proposal network for more general object detection ( including scene text detection ).

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

Morphological edge detection or object's boundary detection using erosion and dialation in OpenCV python

Omdena-abuja-anpd - Automatic Number Plate Detection for the security of lives and properties using Computer Vision.

This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

PSENet - Shape Robust Text Detection with Progressive Scale Expansion Network.

Image augmentation for machine learning experiments.

Image augmentation library in Python for machine learning.

Fast image augmentation library and easy to use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about library: https://www.mdpi.com/2078-2489/11/2/125

Geometric Augmentation for Text Image

The code for “Oriented RepPoints for Aerail Object Detection”

OCR, Object Detection, Number Plate, Real Time

When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework (CVPR 2021 oral)

CVPR 2021 Oral paper "LED2-Net: Monocular 360˚ Layout Estimation via Differentiable Depth Rendering" official PyTorch implementation.

Distilling Knowledge via Knowledge Review, CVPR 2021