This repo is customed for VisDrone.

Last update: Jul 17, 2022

Related tags

Deep Learning tensorflow object-detection fpn cascade-rcnn visdrone model-ensemble

Overview

Object Detection for VisDrone(无人机航拍图像目标检测)

My environment

1、Windows10 (Linux available)
2、tensorflow >= 1.12.0
3、python3.6 (anaconda)
4、cv2
5、ensemble-boxes(pip install ensemble-boxes)

Datasets(XML format for training set)

(1).Datasets is available on https://github.com/VisDrone/VisDrone-Dataset
(2).Please download xml annotations on Baidu Yun (提取码: ia3f), or Google Drive, and configure it in ./core/config/cfgs.py
(3).You can also use ./data/visdrone2xml.py to generate your visdrone xml files, modify the path information.

training-set format:

├── VisDrone2019-DET-train
│     ├── Annotation(xml format)
│     ├── JPEGImages

Pretrained Models(ResNet50vd, 101vd)

Please download pretrained models on Baidu Yun (提取码: krce), or Google Drive, then put it into ./data/pretrained_weights

Train

Modify the parameters in ./core/config/cfgs.py
python train_step.py

Eval

Modify the parameters in ./core/config/cfgs.py
python eval_visdrone.py, it will get txt format file, then use official matlab tools to eval the final results.
python eval_model_ensemble.py. Before the running of this file, you should set NORMALIZED_RESULTS_FOR_MODEL_ENSEMBLE=True in cfgs.py and then run eval_visdrone.py to get normalized txt result.

Visualization

Modify the parameters in ./core/config/cfgs.py
python image_demo.py, it will get visualized results.

Visualized Result (multi-scale training+multi-scale testing)

Test Result(Validation set)：

1. ResNet50-vd

Name	maxDets	Result(s/m)
Average Precision (AP) @( IoU=0.50:0.95)	maxDets=500	31.26%/35.1%
Average Precision (AP) @( IoU=0.50 )	maxDets=500	56.44%/60.29%
Average Precision (AP) @( IoU=0.75 )	maxDets=500	30.13%/35.42%
Average Recall (AR) @( IoU=0.50:0.95)	maxDets= 1	0.78%/0.58%
Average Recall (AR) @( IoU=0.50:0.95)	maxDets= 10	6.62%/6.05%
Average Recall (AR) @( IoU=0.50:0.95)	maxDets=100	38.21%/40.99%
Average Recall (AR) @( IoU=0.50:0.95)	maxDets=500	48.41%/53%

"s" means single-scale training + single-scale testing; "m"means multi-scale training + multi-scale testing

2. ResNet101-vd

Name	maxDets	Result(s/m)
Average Precision (AP) @( IoU=0.50:0.95)	maxDets=500	31.7%/35.98%
Average Precision (AP) @( IoU=0.50 )	maxDets=500	56.94%/61.64%
Average Precision (AP) @( IoU=0.75 )	maxDets=500	30.59%/36.13%
Average Recall (AR) @( IoU=0.50:0.95)	maxDets= 1	0.67%/0.61%
Average Recall (AR) @( IoU=0.50:0.95)	maxDets= 10	6.29%/6.13%
Average Recall (AR) @( IoU=0.50:0.95)	maxDets=100	38.66%/42.33%
Average Recall (AR) @( IoU=0.50:0.95)	maxDets=500	49.29%/53.68%

3. Model Ensemble (ResNet101-vd+ResNet50-vd)

Name	maxDets	Result
Average Precision (AP) @( IoU=0.50:0.95)	maxDets=500	36.76%
Average Precision (AP) @( IoU=0.50 )	maxDets=500	62.33%
Average Precision (AP) @( IoU=0.75 )	maxDets=500	37.41%
Average Recall (AR) @( IoU=0.50:0.95)	maxDets= 1	0.59%
Average Recall (AR) @( IoU=0.50:0.95)	maxDets= 10	6.06%
Average Recall (AR) @( IoU=0.50:0.95)	maxDets=100	42.57%
Average Recall (AR) @( IoU=0.50:0.95)	maxDets=500	54.53%

You can download trained weights(ResNet50vd, 101vd) on Baidu Yun (提取码: 9u9m), or Google Drive, then put it into ./saved_weights

Reference

1、https://github.com/DetectionTeamUCAS/Faster-RCNN_Tensorflow
2、https://github.com/open-mmlab/mmdetection
3、https://github.com/ZFTurbo/Weighted-Boxes-Fusion
4、https://github.com/kobiso/CBAM-tensorflow-slim
5、https://github.com/SJTU-Thinklab-Det/DOTA-DOAI
6、https://github.com/Viredery/tf-eager-fasterrcnn
7、https://github.com/VisDrone/VisDrone2018-DET-toolkit
8、https://github.com/YunYang1994/tensorflow-yolov3
9、https://github.com/zhpmatrix/VisDrone2018

You might also like...

This repo is developed for Strong Baseline For Vehicle Re-Identification in Track 2 Ai-City-2021 Challenges

A STRONG BASELINE FOR VEHICLE RE-IDENTIFICATION This paper is accepted to the IEEE Conference on Computer Vision and Pattern Recognition Workshop(CVPR

78 Dec 29, 2022

Official repo for the work titled "SharinGAN: Combining Synthetic and Real Data for Unsupervised GeometryEstimation"

SharinGAN Official repo for the work titled "SharinGAN: Combining Synthetic and Real Data for Unsupervised GeometryEstimation" The official project we

23 Oct 19, 2022

This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).

DCL-PyTorch Pytorch implementation for the Dynamic Concept Learner (DCL). More details can be found at the project page. Framework Grounding Physical

31 Jan 6, 2023

A repo that contains all the mesh keys needed for mesh backend, along with a code example of how to use them in python

Mesh-Keys A repo that contains all the mesh keys needed for mesh backend, along with a code example of how to use them in python Have been seeing alot

53 Dec 13, 2022

시각 장애인을 위한 스마트 지팡이에 활용될 딥러닝 모델 (DL Model Repo)

SmartCane-DL-Model Smart Cane using semantic segmentation 참고한 Github repositoy 🔗 https://github.com/JunHyeok96/Road-Segmentation.git 데이터셋 🔗 https://

4 Dec 3, 2021

Repo for "Event-Stream Representation for Human Gaits Identification Using Deep Neural Networks"

Summary This is the code for the paper Event-Stream Representation for Human Gaits Identification Using Deep Neural Networks by Yanxiang Wang, Xian Zh

54 Jan 3, 2023

We will release the code of "ConTNet: Why not use convolution and transformer at the same time?" in this repo

ConTNet Introduction ConTNet (Convlution-Tranformer Network) is proposed mainly in response to the following two issues: (1) ConvNets lack a large rec

93 Nov 8, 2022

The repo of the preprinting paper "Labels Are Not Perfect: Inferring Spatial Uncertainty in Object Detection"

Inferring Spatial Uncertainty in Object Detection A teaser version of the code for the paper Labels Are Not Perfect: Inferring Spatial Uncertainty in

21 Mar 3, 2022

This Repo is the official CUDA implementation of ICCV 2019 Oral paper for CARAFE: Content-Aware ReAssembly of FEatures

Introduction This Repo is the official CUDA implementation of ICCV 2019 Oral paper for CARAFE: Content-Aware ReAssembly of FEatures. @inproceedings{Wa

42 Jan 7, 2023