Python scripts for performing object detection with the 1000 labels of the ImageNet dataset in ONNX.

Ibai Gorordo

Last update: Nov 14, 2022

Related tags

Deep Learning python opencv computer-vision imagenet object-detection onnx object-localization class-agnostic-detection

Overview

ONNX-ImageNet-1K-Object-Detector

Python scripts for performing object detection with the 1000 labels of the ImageNet dataset in ONNX. The repository combines a class agnostic object localizer to first detect the objects in the image, and next a ResNet50 model trained on ImageNet is used to label each box.

Original image: https://commons.wikimedia.org/wiki/File:Il_cuore_di_Como.jpg

Why

There are a lot of object detection models, but since most of them are trained in the COCO dataset, most of them can only detect a maximum of 80 classes. This repository proposes a "quick and dirty" solution to be able to detect the 1000 objects available in the ImageNet dataset.

❗ Important ❗

This model uses a lightweight class agnostic object localizer to first detect the objects. Therefore, this repository is not going to behave as well as other object detection models in complex scenes. In those cases, the object localizer will fail quickly and therefore no objects will be detected.
The ResNet50 clasifier is fast in a desktop GPU, however, since it needs to run for each of the detected boxes, the performance might be affected for images with many objects.

Requirements

Check the requirements.txt file.

Installation

pip install -r requirements.txt

ONNX model

Class Agnostic Object Localizer: The original model from TensorflowHub (link at the bottom) was converted to different formats (including .onnx) by PINTO0309, the models can be found in his repository. This repository will automatically download the model if the model is not found in the models folder.
ResNet50 Classifier: The original model from PaddleClas (link at the bottom) was converted to ONNX format using a similar procedure as the one described in this article by PINTO0309. This repository will automatically download the model.

How to use

Image inference:

python image_object_detection.py

Video inference:

python video_object_detection.py

Webcam inference:

python video_object_detection.py

Examples

Macaque Detection

Original image: https://commons.wikimedia.org/wiki/File:Onsen_Monkey.JPG

Christmas Stocking Detection

Original image: https://unsplash.com/photos/paSqTlm3DsA

Burrito Detection

Original image: https://commons.wikimedia.org/wiki/File:Breakfast_burrito_(cropped).jpg

Bridge Detection

Original image: https://commons.wikimedia.org/wiki/File:Bayonne_Bridge_Collins_Pk_jeh-2.JPG

[Inference video Example]

1k.detector.output_Trim.mp4

Original video: https://www.pexels.com/video/a-medusa-jellyfish-swimming-gracefully-underwater-2731905/ (by Vova Krasilnikov)

References

Original Class Agnostic Object Localizer: https://tfhub.dev/google/object_detection/mobile_object_localizer_v1/1
Original Resnet50 (ResNet50_vd_ssld) Classifier from PaddleClass: https://github.com/PaddlePaddle/PaddleClas/blob/release/2.3/docs/zh_CN/algorithm_introduction/ImageNet_models.md
PINTO0309's model zoo: https://github.com/PINTO0309/PINTO_model_zoo
PINTO0309's model conversion tool: https://github.com/PINTO0309/openvino2tensorflow
PaddlePaddle to ONNX conversion article: https://zenn.dev/pinto0309/scraps/cf319db8fea4c3

Comments

tarfile.ReadError: not a gzip file

Hey tried you object detector and got following error:

File "ONNX-ImageNet-1K-Object-Detector-main\detector1K\utils.py", line 15, in download_gdrive_tar_model tar = tarfile.open("tmp/tmp.tar.gz", "r:gz") File "AppData\Local\Programs\Python\Python39\lib\tarfile.py", line 1629, in open return func(name, filemode, fileobj, **kwargs) File "AppData\Local\Programs\Python\Python39\lib\tarfile.py", line 1686, in gzopen raise ReadError("not a gzip file") tarfile.ReadError: not a gzip file

any idea what might be the problem?

opened by Bernie-R 1

noisy labels; missing labels; semi-supervised learning; entropy; uncertainty; robustness and generalisation.

ProSelfLC: CVPR 2021 ProSelfLC: Progressive Self Label Correction for Training Robust Deep Neural Networks For any specific discussion or potential fu

57 Dec 4, 2022

Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in ONNX

ONNX msg_chn_wacv20 depth completion Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20 model in

19 Oct 22, 2022

A set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

Overview This is a set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI. Make TFRecords To run t

8 Nov 1, 2022

ONNX Runtime Web demo is an interactive demo portal showing real use cases running ONNX Runtime Web in VueJS.

ONNX Runtime Web demo is an interactive demo portal showing real use cases running ONNX Runtime Web in VueJS. It currently supports four examples for you to quickly experience the power of ONNX Runtime Web.

58 Dec 18, 2022

A repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]

PINTO_model_zoo Please read the contents of the LICENSE file located directly under each folder before using the model. My model conversion scripts ar

2.4k Jan 5, 2023

An executor that loads ONNX models and embeds documents using the ONNX runtime.

ONNXEncoder An executor that loads ONNX models and embeds documents using the ONNX runtime. Usage via Docker image (recommended) from jina import Flow

2 Mar 15, 2022

A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB, or simply to separate onnx files to any size you want.

sne4onnx A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB, or

10 Aug 30, 2022

Simple ONNX operation generator. Simple Operation Generator for ONNX.

sog4onnx Simple ONNX operation generator. Simple Operation Generator for ONNX. https://github.com/PINTO0309/simple-onnx-processing-tools Key concept V

6 May 15, 2022

A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for ONNX.

sam4onnx A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for

Python scripts for performing object detection with the 1000 labels of the ImageNet dataset in ONNX.

Related tags

Overview

ONNX-ImageNet-1K-Object-Detector

Why

❗ Important ❗

Requirements

Installation

ONNX model

How to use

Examples

Macaque Detection

Christmas Stocking Detection

Burrito Detection

Bridge Detection

[Inference video Example]

References

You might also like...

noisy labels; missing labels; semi-supervised learning; entropy; uncertainty; robustness and generalisation.

Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in ONNX

A set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

ONNX Runtime Web demo is an interactive demo portal showing real use cases running ONNX Runtime Web in VueJS.

An executor that loads ONNX models and embeds documents using the ONNX runtime.

A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB, or simply to separate onnx files to any size you want.

Simple ONNX operation generator. Simple Operation Generator for ONNX.

A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for ONNX.

Comments

tarfile.ReadError: not a gzip file

Owner

Ibai Gorordo

ONNX-PackNet-SfM: Python scripts for performing monocular depth estimation using the PackNet-SfM model in ONNX

Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

Python scripts for performing lane detection using the LSTR model in ONNX

Python scripts for performing road segemtnation and car detection using the HybridNets multitask model in ONNX.

Python scripts form performing stereo depth estimation using the CoEx model in ONNX.

Python scripts form performing stereo depth estimation using the HITNET model in ONNX.

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Python scripts for performing stereo depth estimation using the MobileStereoNet model in ONNX

Tools to create pixel-wise object masks, bounding box labels (2D and 3D) and 3D object model (PLY triangle mesh) for object sequences filmed with an RGB-D camera.

Example scripts for the detection of lanes using the ultra fast lane detection model in ONNX.