YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks

Adam Van Etten

Last update: Jan 1, 2023

Related tags

Deep Learning yoltv5

Overview

YOLTv5

YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks.

YOLTv5 builds upon YOLT and SIMRDWN, and updates these frameworks to use the YOLOv5 version of the YOLO object detection family. This repository has generally similar performance to the Darknet-based YOLTv4 repository. For those users who prefer a PyTorch backend, however, we provide YOLTv5.

Below, we provide examples of how to use this repository with the open-source SpaceNet dataset.

Running YOLTv5

0. Installation (Preliminary)

YOLTv5 is built to execute on a GPU-enabled machine.

cd yoltv5/yolov5
pip install -r requirements.txt 

# update with geo packages
conda install -c conda-forge gdal
conda install -c conda-forge osmnx=0.12 
conda install  -c conda-forge scikit-image
conda install  -c conda-forge statsmodels
pip install torchsummary
pip install utm
pip install numba
pip install jinja2==2.10

1. Train

Training preparation is accomplished via prep_train.py. To train a model, run:

cd /yoltv5
python yolov5/train.py --img 640 --batch 16 --epochs 100 --data yoltv5_train_vehicles_8cat.yaml --weights yolov5l.pt

2. Test

Simply edit yoltv5_test_vehicles_8cat.yaml to point to the appropriate locations, then run the test.sh script:

cd yoltv5
./test.sh ../configs/yoltv5_test_vehicles_8cat.yaml

Outputs will look something like the figure below:

You might also like...

A public available dataset for road boundary detection in aerial images

Topo-boundary This is the official github repo of paper Topo-boundary: A Benchmark Dataset on Topological Road-boundary Detection Using Aerial Images

79 Jan 4, 2023

Official code of the paper "ReDet: A Rotation-equivariant Detector for Aerial Object Detection" (CVPR 2021)

ReDet: A Rotation-equivariant Detector for Aerial Object Detection ReDet: A Rotation-equivariant Detector for Aerial Object Detection (CVPR2021), Jiam

334 Dec 23, 2022

[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles

UAV-Human Official repository for CVPR2021: UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicle Paper arXiv Res

129 Jan 4, 2023

An algorithm that handles large-scale aerial photo co-registration, based on SURF, RANSAC and PyTorch autograd.

41 Oct 29, 2022

A vision library for performing sliced inference on large images/small objects

SAHI: Slicing Aided Hyper Inference A vision library for performing sliced inference on large images/small objects Overview Object detection and insta

2.3k Jan 4, 2023

Pixel-level Crack Detection From Images Of Levee Systems : A Comparative Study

PIXEL-LEVEL CRACK DETECTION FROM IMAGES OF LEVEE SYSTEMS : A COMPARATIVE STUDY G

2 Jul 23, 2022

moving object detection for satellite videos.

DSFNet: Dynamic and Static Fusion Network for Moving Object Detection in Satellite Videos Algorithm Introduction DSFNet: Dynamic and Static Fusion Net

39 Dec 16, 2022

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

MMdnn MMdnn is a comprehensive and cross-framework tool to convert, visualize and diagnose deep learning (DL) models. The "MM" stands for model manage

5.7k Jan 9, 2023

Bulk2Space is a spatial deconvolution method based on deep learning frameworks

Bulk2Space Spatially resolved single-cell deconvolution of bulk transcriptomes using Bulk2Space Bulk2Space is a spatial deconvolution method based on

60 Dec 27, 2022

Comments

No module named 'eval'

I am trying to run yoltv5 on Anaconda - Windows and using jupyter notebook. ran the requirements.txt file and followed the readme. I was able to train yoltv5. The code filled the ../runs/train/ directory

However, I tried the test script !python test.py C:\Users\Documents\Projects\code\yoltv5\configs\test.yaml and got an error

Traceback (most recent call last): File "C:\Users\Documents\Projects\code\yoltv5\yoltv5\test.py", line 53, in import eval ModuleNotFoundError: No module named 'eval'

I couldn't find an eval.py file, what am I missing ?

Thanks in advance

opened by vmve 0
How did you switch the detection framework of YOLOV5 to YOLT?

Thank you for your work. I have some questions about your project code: I looked at the model you called YoloV5 in the middle, but did not see the operation about model modification mentioned in your article (modifying the image network structure, such as modifying the Stride size, upsampling the image, and Ensemble). The most important thing is that you do not read the model. yaml configuration file like YOLOv5, how did you switch the detection framework of YOLOV5 to YOLT?

opened by Scheaven 0
Extract Spatial Coordinate

How can i apply YOLOt on Geo Tiff satellite images, and extract the real x,y coordinate for the bounding box ?! Not just deal with the image an normal png or jpg image !

opened by AhmedHefnawy 0

YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks

Related tags

Overview

YOLTv5

Running YOLTv5

0. Installation (Preliminary)

1. Train

2. Test

You might also like...

A public available dataset for road boundary detection in aerial images

Official code of the paper "ReDet: A Rotation-equivariant Detector for Aerial Object Detection" (CVPR 2021)

[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles

An algorithm that handles large-scale aerial photo co-registration, based on SURF, RANSAC and PyTorch autograd.

A vision library for performing sliced inference on large images/small objects

Pixel-level Crack Detection From Images Of Levee Systems : A Comparative Study

moving object detection for satellite videos.

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

Bulk2Space is a spatial deconvolution method based on deep learning frameworks

Comments

No module named 'eval'

How did you switch the detection framework of YOLOV5 to YOLT?

Extract Spatial Coordinate

Owner

Adam Van Etten

Aerial Imagery dataset for fire detection: classification and segmentation (Unmanned Aerial Vehicle (UAV))

Learning Calibrated-Guidance for Object Detection in Aerial Images

Tiny Object Detection in Aerial Images.

DeepLM: Large-scale Nonlinear Least Squares on Deep Learning Frameworks using Stochastic Domain Decomposition (CVPR 2021)

🍅🍅🍅YOLOv5-Lite: lighter, faster and easier to deploy. Evolved from yolov5 and the size of model is only 1.7M (int8) and 3.3M (fp16). It can reach 10+ FPS on the Raspberry Pi 4B when the input size is 320×320~

Tools to create pixel-wise object masks, bounding box labels (2D and 3D) and 3D object model (PLY triangle mesh) for object sequences filmed with an RGB-D camera.

a practicable framework used in Deep Learning. So far UDL only provide DCFNet implementation for the ICCV paper (Dynamic Cross Feature Fusion for Remote Sensing Pansharpening)

A large dataset of 100k Google Satellite and matching Map images, resembling pix2pix's Google Maps dataset.

SiamMOT is a region-based Siamese Multi-Object Tracking network that detects and associates object instances simultaneously.

An image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testingAn image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testing