SWA Object Detection

Last update: Nov 28, 2022

Related tags

Overview

SWA Object Detection

This project hosts the scripts for training SWA object detectors, as presented in our paper:

@article{zhang2020swa,
  title={SWA Object Detection},
  author={Zhang, Haoyang and Wang, Ying and Dayoub, Feras and S{\"u}nderhauf, Niko},
  journal={arXiv preprint arXiv:2012.12645},
  year={2020}
}

The full paper is available at: https://arxiv.org/abs/2012.12645.

Introduction

Do you want to improve 1.0 AP for your object detector without any inference cost and any change to your detector? Let us tell you such a recipe. It is surprisingly simple: train your detector for an extra 12 epochs using cyclical learning rates and then average these 12 checkpoints as your final detection model. This potent recipe is inspired by Stochastic Weights Averaging (SWA), which is proposed in [1] for improving generalization in deep neural networks. We found it also very effective in object detection. In this work, we systematically investigate the effects of applying SWA to object detection as well as instance segmentation. Through extensive experiments, we discover a good policy of performing SWA in object detection, and we consistently achieve ~1.0 AP improvement over various popular detectors on the challenging COCO benchmark. We hope this work will make more researchers in object detection know this technique and help them train better object detectors.

SWA Object Detection: averaging multiple detection models leads to a better one.

Updates

2020.01.08 Reimplement the code and now it is more convenient, more flexible and easier to perform both the conventional training and SWA training. See Instructions.
2020.01.07 Update to MMDetection v2.8.0.
2020.12.24 Release the code.

Installation

This project is based on MMDetection. Therefore the installation is the same as original MMDetection.
Please check get_started.md for installation. Note that you should change the version of PyTorch and CUDA to yours when installing mmcv in step 3 and clone this repo instead of MMdetection in step 4.

If you run into problems with pycocotools, please install it by:

pip install "git+https://github.com/open-mmlab/cocoapi.git#subdirectory=pycocotools"

Usage of MMDetection

MMDetection provides colab tutorial, and full guidance for quick run with existing dataset and with new dataset for beginners. There are also tutorials for finetuning models, adding new dataset, designing data pipeline, customizing models, customizing runtime settings and useful tools.

Please refer to FAQ for frequently asked questions.

Instructions

We add a SWA training phase to the object detector training process, implement a SWA hook that helps process averaged models, and write a SWA config for conveniently deploying SWA training in training various detectors. We also provide many config files for reproducing the results in the paper.

By including the SWA config in detector config files and setting related parameters, you can have different SWA training modes.

Two-pahse mode. In this mode, the training will begin with the traditional training phase, and it continues for epochs. After that, SWA training will start, with loading the best model on the validation from the previous training phase (becasue swa_load_from = 'best_bbox_mAP.pth'in the SWA config).

As shown in swa_vfnet_r50 config, the SWA config is included at line 4 and only the SWA optimizer is reset at line 118 in this script. Note that configuring parameters in local scripts will overwrite those values inherited from the SWA config.

You can change those parameters that are included in the SWA config to use different optimizers or different learning rate schedules for the SWA training. For example, to use a different initial learning rate, say 0.02, you just need to set swa_optimizer = dict(type='SGD', lr=0.02, momentum=0.9, weight_decay=0.0001) in the SWA config (global effect) or in the swa_vfnet_r50 config (local effect).

To start the training, run:
```
./tools/dist_train.sh configs/swa/swa_vfnet_r50_fpn_1x_coco.py 8
```
Only-SWA mode. In this mode, the traditional training is skipped and only the SWA training is performed. In general, this mode should work with a pre-trained detection model which you can download from the MMDetection model zoo.

Have a look at the swa_mask_rcnn_r101 config. By setting only_swa_training = True and swa_load_from = mask_rcnn_pretraind_model, this script conducts only SWA training, starting from a pre-trained detection model. To start the training, run:
```
./tools/dist_train.sh configs/swa/swa_mask_rcnn_r101_fpn_2x_coco.py 8
```

In both modes, we have implemented the validation stage and saving functions for the SWA model. Thus, it would be easy to monitor the performance and select the best SWA model.

Results and Models

For your convenience, we provide the following SWA models. These models are obtained by averaging checkpoints that are trained with cyclical learning rates for 12 epochs.

Model	bbox AP (val)	segm AP (val)	Download
SWA-MaskRCNN-R50-1x-0.02-0.0002-38.2-34.7	39.1, +0.9	35.5, +0.8	model \| config
SWA-MaskRCNN-R101-1x-0.02-0.0002-40.0-36.1	41.0, +1.0	37.0, +0.9	model \| config
SWA-MaskRCNN-R101-2x-0.02-0.0002-40.8-36.6	41.7, +0.9	37.4, +0.8	model \| config
SWA-FasterRCNN-R50-1x-0.02-0.0002-37.4	38.4, +1.0	-	model \| config
SWA-FasterRCNN-R101-1x-0.02-0.0002-39.4	40.3, +0.9	-	model \| config
SWA-FasterRCNN-R101-2x-0.02-0.0002-39.8	40.7, +0.9	-	model \| config
SWA-RetinaNet-R50-1x-0.01-0.0001-36.5	37.8, +1.3	-	model \| config
SWA-RetinaNet-R101-1x-0.01-0.0001-38.5	39.7, +1.2	-	model \| config
SWA-RetinaNet-R101-2x-0.01-0.0001-38.9	40.0, +1.1	-	model \| config
SWA-FCOS-R50-1x-0.01-0.0001-36.6	38.0, +1.4	-	model \| config
SWA-FCOS-R101-1x-0.01-0.0001-39.2	40.3, +1.1	-	model \| config
SWA-FCOS-R101-2x-0.01-0.0001-39.1	40.2, +1.1	-	model \| config
SWA-YOLOv3(320)-D53-273e-0.001-0.00001-27.9	28.7, +0.8	-	model \| config
SWA-YOLOv3(680)-D53-273e-0.001-0.00001-33.4	34.2, +0.8	-	model \| config
SWA-VFNet-R50-1x-0.01-0.0001-41.6	42.8, +1.2	-	model \| config
SWA-VFNet-R101-1x-0.01-0.0001-43.0	44.3, +1.3	-	model \| config
SWA-VFNet-R101-2x-0.01-0.0001-43.5	44.5, +1.0	-	model \| config

Notes:

SWA-MaskRCNN-R50-1x-0.02-0.0002-38.2-34.7 means this SWA model is produced based on the pre-trained Mask RCNN model that has a ResNet50 backbone, is trained under 1x schedule with the initial learning rate 0.02 and ending learning rate 0.0002, and achieves 38.2 bbox AP and 34.7 mask AP on the COCO val2017 respectively. This SWA model acheives 39.1 bbox AP and 35.5 mask AP, which are higher than the pre-trained model by 0.9 bbox AP and 0.8 mask AP respectively. This rule applies to other object detectors.
In addition to these baseline detectors, SWA can also improve more powerful detectors. One example is VFNetX whose performance on the COCO val2017 is improved from 52.2 AP to 53.4 AP (+1.2 AP).
More detailed results including AP₅₀ and AP₇₅ can be found here.

Contributing

Any pull requests or issues are welcome.

Citation

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follows:

@article{zhang2020swa,
  title={SWA Object Detection},
  author={Zhang, Haoyang and Wang, Ying and Dayoub, Feras and S{\"u}nderhauf, Niko},
  journal={arXiv preprint arXiv:2012.12645},
  year={2020}
}

Acknowledgment

Many thanks to Dr Marlies Hankel and MASSIVE HPC for supporting precious GPU computation resources!

We also would like to thank MMDetection team for producing this great object detection toolbox.

License

This project is released under the Apache 2.0 license.

References

[1] Averaging Weights Leads to Wider Optima and Better Generalization; Pavel Izmailov, Dmitry Podoprikhin, Timur Garipov, Dmitry Vetrov, Andrew Gordon Wilson; Uncertainty in Artificial Intelligence (UAI), 2018

Comments

how to use of swa repo correctly

Hello, sir. This is a nice work! I want to use this repo to improve my detection performance, but there are some question about using this repo. I have trained my detection model in mmdetection, and get the epoch_12.pth normally, and follow the Only-SWA mode make the config, after training, there are 24 models in my work_dir:

I use swa/get_swa_model.py which use swa_epoch_1.pth to swa_epoch_12.pth to produce swa_1-12.pth, and I use this model to test, the final score decrease a little, is there some problem when I use swa? Very confused, and very appreciate for your reply!!

opened by yustaub 6
Mask RCNN save the best segm model

Hi, really glad that the save_the_best_model is updated. But would like to ask, how can i save the best_segm_mAP.pth, but not best_bbox_mAP.pth? thx:)

opened by yc0619 5
Performance with other optimizer, such as Adam, AdamW

Hi, thanks for your nice work.

I am wondering the performance of SWA with other optimizers, such as Adam, Adamw; Can it achieve consistent performance gain?

If the original network is trained with Adam or Adamw, can SWA (with SGD) improve its performance?

Thanks very much.

opened by xinge008 4
what's the difference between swa_epoch_xx.pth and swa_model_xx.pth?

Hello sir! I appreciate your wonderful work which helps a lot. But there's a question I can't figure out.

When I run Two-pahse mode, after I got 12 traditional checkpoints, I can get 2 checkpoints after each epoch. I wonder the difference between them, and which one should I use?

epoch_1.pth to epoch_12.pth is traditional checkpoints

swa_epoch_xx.pth and swa_model_xx.pth are the checkpoints after swa training

opened by HuXinzhi1004 2
ModuleNotFoundError: No module named 'mmdet'

I have download the master branch of swa_object_detection. Meanwhile i have mmdetection on my pc.

I have try to train mask rcnn r50fpn with the normal mmdetection code. But this error comes always. Appreciate for the help!

opened by yc0619 2
Can I use dist_train.sh to train the original model?

I think I should train the original model at first, then start to train the extra checkpoints.

But the scripts as follows in README.md don't include the script of training the original model. I think the first script as follows only trains the extra checkpoints of 12/24 epochs. And the second script as follows averages these 12/24 checkpoints for final detection model. (1)./tools/dist_train.sh configs/swa/swa_mask_rcnn_r101_fpn_2x_coco.py (2)./swa/get_swa_model.py work_dirs/swa_mask_rcnn_r101_fpn_2x_coco 1 12 --save_dir work_dirs/swa_mask_rcnn

Maybe I can use this script as follow to train the original model. ./tools/dist_train.sh configs/mask_rcnn/mask_rcnn_r101_fpn_2x_coco.py work_dir=mymodel Then I can use this script as follow to train the extra checkpoints. And I must set work_dir to the path where saves the original model. ./tools/dist_train.sh configs/swa/swa_mask_rcnn_r101_fpn_2x_coco.py work_dir=mymodel

Am I right? Thanks very much!

opened by yushanshan05 2
When I run Two-pahse mode I met error

Traceback (most recent call last): File "tools/train.py", line 188, in main() File "tools/train.py", line 184, in main meta=meta) File "/home/hxz/anaconda3/envs/mmdnew/lib/python3.7/site-packages/mmdet-2.12.0-py3.7.egg/mmdet/apis/train.py", line 175, in train_detector runner.run(data_loaders, cfg.workflow) File "/home/hxz/anaconda3/envs/mmdnew/lib/python3.7/site-packages/mmcv/runner/epoch_based_runner.py", line 125, in run epoch_runner(data_loaders[i], **kwargs) File "/home/hxz/anaconda3/envs/mmdnew/lib/python3.7/site-packages/mmcv/runner/epoch_based_runner.py", line 54, in train self.call_hook('after_train_epoch') File "/home/hxz/anaconda3/envs/mmdnew/lib/python3.7/site-packages/mmcv/runner/base_runner.py", line 307, in call_hook getattr(hook, fn_name)(self) File "/home/hxz/anaconda3/envs/mmdnew/lib/python3.7/site-packages/mmdet-2.12.0-py3.7.egg/mmdet/core/evaluation/eval_hooks.py", line 149, in after_train_epoch self.save_best_checkpoint(runner, key_score) File "/home/hxz/anaconda3/envs/mmdnew/lib/python3.7/site-packages/mmdet-2.12.0-py3.7.egg/mmdet/core/evaluation/eval_hooks.py", line 166, in save_best_checkpoint last_ckpt = runner.meta['hook_msgs']['last_ckpt'] KeyError: 'last_ckpt'

opened by HuXinzhi1004 0
when i run only_swa_training i meet a error
Environment

Python: 3.7.10 | packaged by conda-forge | (default, Feb 19 2021, 16:07:37) [GCC 9.3.0] CUDA available: True GPU 0,1: Tesla V100-SXM2-32GB CUDA_HOME: /usr/local/cuda NVCC: Cuda compilation tools, release 10.1, V10.1.168 GCC: gcc (Ubuntu 7.5.0-3ubuntu1~18.04) 7.5.0 PyTorch: 1.6.0+cu101 PyTorch compiling details: PyTorch built with:

GCC 7.3

C++ Version: 201402

Intel(R) Math Kernel Library Version 2019.0.5 Product Build 20190808 for Intel(R) 64 architecture applications

Intel(R) MKL-DNN v1.5.0 (Git Hash e2ac1fac44c5078ca927cb9b90e1b3066a0b2ed0)

OpenMP 201511 (a.k.a. OpenMP 4.5)

NNPACK is enabled

CPU capability usage: AVX2

CUDA Runtime 10.1

NVCC architecture flags: -gencode;arch=compute_37,code=sm_37;-gencode;arch=compute_50,code=sm_50;-gencode;arch=compute_60,code=sm_60;-gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_75,code=sm_75

CuDNN 7.6.3

Magma 2.5.2

Build settings: BLAS=MKL, BUILD_TYPE=Release, CXX_FLAGS= -Wno-deprecated -fvisibility-inlines-hidden -DUSE_PTHREADPOOL -fopenmp -DNDEBUG -DUSE_FBGEMM -DUSE_QNNPACK -DUSE_PYTORCH_QNNPACK -DUSE_XNNPACK -DUSE_VULKAN_WRAPPER -O2 -fPIC -Wno-narrowing -Wall -Wextra -Werror=return-type -Wno-missing-field-initializers -Wno-type-limits -Wno-array-bounds -Wno-unknown-pragmas -Wno-sign-compare -Wno-unused-parameter -Wno-unused-variable -Wno-unused-function -Wno-unused-result -Wno-unused-local-typedefs -Wno-strict-overflow -Wno-strict-aliasing -Wno-error=deprecated-declarations -Wno-stringop-overflow -Wno-error=pedantic -Wno-error=redundant-decls -Wno-error=old-style-cast -fdiagnostics-color=always -faligned-new -Wno-unused-but-set-variable -Wno-maybe-uninitialized -fno-math-errno -fno-trapping-math -Werror=format -Wno-stringop-overflow, PERF_WITH_AVX=1, PERF_WITH_AVX2=1, PERF_WITH_AVX512=1, USE_CUDA=ON, USE_EXCEPTION_PTR=1, USE_GFLAGS=OFF, USE_GLOG=OFF, USE_MKL=ON, USE_MKLDNN=ON, USE_MPI=OFF, USE_NCCL=ON, USE_NNPACK=ON, USE_OPENMP=ON, USE_STATIC_DISPATCH=OFF,

TorchVision: 0.7.0+cu101 OpenCV: 4.5.3 MMCV: 1.3.8 MMCV Compiler: GCC 7.3 MMCV CUDA Compiler: 10.1 MMDetection: 2.14.0+7b5a58b Error traceback SWAHook’object has no attribute 'save_checkpoint'
opened by krisandchris 1
problem in get_swa_model.py

Hi, @hyz-xmaster . Thanks for releasing the code.

When using your utils to get the avg model get_swa_model.py, I've got a problem. For example, if I want to avg chkpts of 13-24 epochs, I would intuitively pass starting=13 and ending=24 in args. However, the code actually gave me the avg of 13-23, because of this line: https://github.com/hyz-xmaster/swa_object_detection/blob/2feb86790a41f91c3167fb282ba47dc295c65658/swa/get_swa_model.py#L28

I think it would be better to use ending_id + 1 instead of ending_id for easier understanding.

opened by lkct 1
compare traditional training result with swa result at same epoch level
nice work to make swa work in object detection! i have one question about same epoch level comparison. the result looks like faster rcnn r50 1x + 1x swa extra training get same result as faster rcnn r50 2x? i think maybe some problem.

faster rcnn r50 1x + 1x swa extra training use cyclic training, but origin faster rcnn r50 2x use step down lr training. this mismatch may lead to differenct converge. i think the best way is to train models from scratch with cyclic training to get a fair comaprison.

swa needs to change batch norm param to match average weight. frozen bn may harm the final ensemble result
opened by Haijunlv 3
question about BN fronze

@hyz-xmaster hi， thank you for your great job. I have read your paper, and it said that batch normalization layers in backbones are frozen. My question is that you just frozen BN in backbone or you fronzen all the layer in backbone.

opened by yongjingli 1

Owner

GitHub

Yolo object detection - Yolo object detection with python

How to run download required files make build_image make download Docker versio

3 Jan 26, 2022

MOT-Tracking-by-Detection-Pipeline - For Tracking-by-Detection format MOT (Multi Object Tracking), is it a framework that separates Detection and Tracking processes?

MOT-Tracking-by-Detection-Pipeline Tracking-by-Detection形式のMOT(Multi Object Trac

41 Nov 23, 2022

Tools to create pixel-wise object masks, bounding box labels (2D and 3D) and 3D object model (PLY triangle mesh) for object sequences filmed with an RGB-D camera.

Tools to create pixel-wise object masks, bounding box labels (2D and 3D) and 3D object model (PLY triangle mesh) for object sequences filmed with an RGB-D camera. This project prepares training and testing data for various deep learning projects such as 6D object pose estimation projects singleshotpose, as well as object detection and instance segmentation projects.

305 Dec 16, 2022

Official PyTorch implementation of Joint Object Detection and Multi-Object Tracking with Graph Neural Networks

This is the official PyTorch implementation of our paper: "Joint Object Detection and Multi-Object Tracking with Graph Neural Networks". Our project website and video demos are here.

443 Dec 6, 2022

CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection

CLOCs is a novel Camera-LiDAR Object Candidates fusion network. It provides a low-complexity multi-modal fusion framework that improves the performance of single-modality detectors. CLOCs operates on the combined output candidates of any 3D and any 2D detector, and is trained to produce more accurate 3D and 2D detection results.

254 Dec 16, 2022

Object Detection and Multi-Object Tracking

1.6k Jan 4, 2023

Object tracking and object detection is applied to track golf puts in real time and display stats/games.

Putting_Game Object tracking and object detection is applied to track golf puts in real time and display stats/games. Works best with the Perfect Prac

1 Dec 29, 2021

Auto-Lama combines object detection and image inpainting to automate object removals

Auto-Lama Auto-Lama combines object detection and image inpainting to automate object removals. It is build on top of DE:TR from Facebook Research and

44 Dec 9, 2022

A Data Annotation Tool for Semantic Segmentation, Object Detection and Lane Line Detection.(In Development Stage)

Data-Annotation-Tool How to Run this Tool? To run this software, follow the steps: git clone https://github.com/Autonomous-Car-Project/Data-Annotation

13 Aug 18, 2022

object detection; robust detection; ACM MM21 grand challenge; Security AI Challenger Phase VII

赛题背景在商品知识产权领域，知识产权体现为在线商品的设计和品牌。不幸的是，在每一天，存在着非法商户通过一些对抗手段干扰商标识别来逃避侵权，这带来了很高的知识产权风险和财务损失。为了促进先进的多媒体人工智能技术的发展，以保护企业来之不易的创作和想法免受恶意使用和剽窃，因此提出了鲁棒性标识检测挑战赛

65 Dec 22, 2022

This project deals with the detection of skin lesions within the ISICs dataset using YOLOv3 Object Detection with Darknet.

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. Skin Lesion detection using YOLO This project deal

1 Nov 22, 2021

SiamMOT is a region-based Siamese Multi-Object Tracking network that detects and associates object instances simultaneously.

SiamMOT: Siamese Multi-Object Tracking

432 Dec 17, 2022

SWA Object Detection

Related tags

Overview

SWA Object Detection

Introduction

Updates

Installation

Usage of MMDetection

Instructions

Results and Models

Contributing

Citation

Acknowledgment

License

References

Comments

Owner

Yolo object detection - Yolo object detection with python

MOT-Tracking-by-Detection-Pipeline - For Tracking-by-Detection format MOT (Multi Object Tracking), is it a framework that separates Detection and Tracking processes?

Tools to create pixel-wise object masks, bounding box labels (2D and 3D) and 3D object model (PLY triangle mesh) for object sequences filmed with an RGB-D camera.

Official PyTorch implementation of Joint Object Detection and Multi-Object Tracking with Graph Neural Networks

CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection

Object Detection and Multi-Object Tracking

Object tracking and object detection is applied to track golf puts in real time and display stats/games.

Auto-Lama combines object detection and image inpainting to automate object removals

A Data Annotation Tool for Semantic Segmentation, Object Detection and Lane Line Detection.(In Development Stage)

object detection; robust detection; ACM MM21 grand challenge; Security AI Challenger Phase VII

This project deals with the detection of skin lesions within the ISICs dataset using YOLOv3 Object Detection with Darknet.

SiamMOT is a region-based Siamese Multi-Object Tracking network that detects and associates object instances simultaneously.

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning (CoRL 2021)

SafePicking: Learning Safe Object Extraction via Object-Level Mapping, ICRA 2022

Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

Detectron2 is FAIR's next-generation platform for object detection and segmentation.