Yolov5+SlowFast: Realtime Action Detection Based on PytorchVideo

WuFan

Last update: Dec 30, 2022

Related tags

Deep Learning yolo_slowfast

Overview

Yolov5+SlowFast: Realtime Action Detection

A realtime action detection frame work based on PytorchVideo.

Here are some details about our modification:

we choose yolov5 as an object detector instead of detectron2, it is faster and more convenient
we use a tracker(deepsort) to allocate action labels to all objects(with same ids) in different frames
our processing speed reached 24.2 FPS at 30 inference barch size (on a single RTX 2080Ti GPU)

Relevant infomation: FAIR/PytorchVideo; Ultralytics/Yolov5

Demo comparison betwween original(<-left) and ours(->right).

Installation

create a new python environment:
```
conda create -n env_name python=3.7.11
```
install requiments:
```
pip install -r requirements.txt
```
download weights file(ckpt.t7) from [deepsort] to this folder:
```
./deep_sort/deep_sort/deep/checkpoint/
```
test on your video:
```
python yolo_slowfast.py --input {path to your video}
```
The first time to execute this command may take some times to download the yolov5 code and it's weights file from torch.hub, keep your network connected.

References

Thanks for these great works:

[1] Ultralytics/Yolov5

[2] ZQPei/deepsort

[3] FAIR/PytorchVideo

[2] AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions. paper

[3] SlowFast Networks for Video Recognition. paper

Citation

If you find our work useful, please cite as follow:

{   yolo_slowfast,
    author = {Wu Fan},
    title = { A realtime action detection frame work based on PytorchVideo},
    year = {2021},
    url = {\url{https://github.com/wufan-tb/gmm_dae}}
}

You might also like...

joint detection and semantic segmentation, based on ultralytics/yolov5,

Multi YOLO V5——Detection and Semantic Segmentation Overeview This is my undergraduate graduation project which based on ultralytics YOLO V5 tag v5.0.

477 Jan 6, 2023

Frigate - NVR With Realtime Object Detection for IP Cameras

A complete and local NVR designed for HomeAssistant with AI object detection. Uses OpenCV and Tensorflow to perform realtime object detection locally for IP cameras.

6.4k Dec 31, 2022

Realtime YOLO Monster Detection With Non Maximum Supression

Realtime-YOLO-Monster-Detection-With-Non-Maximum-Supression Table of Contents In

5 Oct 7, 2022

Facial Expression Detection In The Realtime

The human's facial expressions is very important to detect thier emotions and sentiment. It can be very efficient to use to make our computers make interviews. Furthermore, we have robots now can detect the human's emotions and based on thats take an action .etc. So, It will be better to provide a tool or model for this.

4 Mar 1, 2022

Comments

the issue on the windows

it doesn't work properly on windows, just like this and for many peoples in a classroom, it doesn't work. just like this so I want to the site of its application scenarios

opened by xiaomengxin123 4
Detection on custom yolo weights

I have a custom trained yolo weights and there the person class is not having 0 class index. How to handle that? As far as I understood you are taking the 0th index and passing it to slowfast_r50, since person class is in 0th index at coco dataset

opened by soumyadbanik 0
Real time testing

Hai @wufan-tb

How to test with real-time videos from video surveillance. is that possible and how to test the modal with multiple video surveillance cameras

Thanks

opened by Chuttyboy 1
some Exception

Exception: CUDA error: CUBLAS_STATUS_INTERNAL_ERROR when calling cublasCreate(handle). Cache may be out of date, try force_reload=True or see https://github.com/ultralytics/yolov5/issues/36 for help.

opened by a957815154 1

Yolov5+SlowFast: Realtime Action Detection Based on PytorchVideo

Related tags

Overview

Yolov5+SlowFast: Realtime Action Detection

A realtime action detection frame work based on PytorchVideo.

Here are some details about our modification:

Demo comparison betwween original(<-left) and ours(->right).

Installation

References

Citation

You might also like...

joint detection and semantic segmentation, based on ultralytics/yolov5,

Frigate - NVR With Realtime Object Detection for IP Cameras

Realtime YOLO Monster Detection With Non Maximum Supression

Facial Expression Detection In The Realtime

Drone detection using YOLOv5

YOLOv5 🚀 is a family of object detection architectures and models pretrained on the COCO dataset

YOLOv5 detection interface - PyQt5 implementation

A Python training and inference implementation of Yolov5 helmet detection in Jetson Xavier nx and Jetson nano

YOLOv5 + ROS2 object detection package

Comments

the issue on the windows

Detection on custom yolo weights

Real time testing

some Exception

Owner

WuFan

TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-Captured Scenarios

Multi-task yolov5 with detection and segmentation based on yolov5

PyTorchVideo is a deeplearning library with a focus on video understanding work

🍅🍅🍅YOLOv5-Lite: lighter, faster and easier to deploy. Evolved from yolov5 and the size of model is only 1.7M (int8) and 3.3M (fp16). It can reach 10+ FPS on the Raspberry Pi 4B when the input size is 320×320~

Yolov5-lite - Minimal PyTorch implementation of YOLOv5

The official TensorFlow implementation of the paper Action Transformer: A Self-Attention Model for Short-Time Pose-Based Human Action Recognition

Allows including an action inside another action (by preprocessing the Yaml file). This is how composite actions should have worked.

Official implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21).

Official Pytorch Implementation of 'Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization' (ICCV-21 Oral)

Human Action Controller - A human action controller running on different platforms.