ObjectDrawer-ToolBox: a graphical image annotation tool to generate ground plane masks for a 3D object reconstruction system

Last update: Jan 5, 2023

Related tags

Deep Learning ObjectDrawer-ToolBox

Overview

ObjectDrawer-ToolBox

ObjectDrawer-ToolBox is a graphical image annotation tool to generate ground plane masks for a 3D object reconstruction system, Object Drawer.

Prerequisites

Python 3.8
ffmpeg

Requirements

opencv-python
ffmpeg-python==0.2.0

Getting Started

Given the video captured by conventional recording devices, the ObjectDrawer-ToolBox is used to sample frames from the video and label the ground plane masks. In the ground plane mask task, you need to label 3 images and only keep ground pixels in these images.

Please run

python label_image.py \
  --video_path /path/to/your/videos.mp4

How to annotate ground plane

Draw a polygon to cover the pixels area which are not belong to groud plane. Tips: red / black line denotes a unfinished / finished polygon.
Press w to delete non-ground pixels when finished a polygon drawing.
Repeat the step 1 & 2, until there are only ground plane areas in the image.

Operation Tips:

Press w to delete pixels.
Press d to finish current annotation and start to label next image.
Press esc to clear candidate regions

select pixels

delete pixels

Annotation result

After ground plane annotation, a zip file named "label_${videos}.zip" is generated in same directory as the input video.
Upload the video ("videos.mp4") and zip file on the website of Object Drawer.

Examples

To clarify the label data format, we provided examples video & label file. You can download these data and submit to the Object Drawer website for testing.

Name	Video	Label File
multi-seat sofa	download (.mp4)	download (.zip)
single sofa	download (.mov)	download (.zip)

Acknowledgements

Thanks to PlenOctree for the octree converter and online viewer. We take U2NeT as the segmentation algorithm. Please consider citing their papers and following their license.

Code for "Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo"

Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo This repository includes the source code for our CVPR 2021 paper on multi-view mult

66 Jan 4, 2023

Efficient electromagnetic solver based on rigorous coupled-wave analysis for 3D and 2D multi-layered structures with in-plane periodicity

Efficient electromagnetic solver based on rigorous coupled-wave analysis for 3D and 2D multi-layered structures with in-plane periodicity, such as gratings, photonic-crystal slabs, metasurfaces, surface-emitting lasers, nano-antennas, and more.

17 Dec 19, 2022

Weakly Supervised 3D Object Detection from Point Cloud with Only Image Level Annotation

SCCKTIM Weakly Supervised 3D Object Detection from Point Cloud with Only Image-Level Annotation Our code will be available soon. The class knowledge t

1 Nov 12, 2021

A Data Annotation Tool for Semantic Segmentation, Object Detection and Lane Line Detection.(In Development Stage)

Data-Annotation-Tool How to Run this Tool? To run this software, follow the steps: git clone https://github.com/Autonomous-Car-Project/Data-Annotation

13 Aug 18, 2022

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction TSDF++ is a novel multi-object TSDF formulation that can encode mult

130 Dec 29, 2022

A Planar RGB-D SLAM which utilizes Manhattan World structure to provide optimal camera pose trajectory while also providing a sparse reconstruction containing points, lines and planes, and a dense surfel-based reconstruction.

ManhattanSLAM Authors: Raza Yunus, Yanyan Li and Federico Tombari ManhattanSLAM is a real-time SLAM library for RGB-D cameras that computes the camera

117 Dec 28, 2022

"MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction" (CVPRW 2022) & (Winner of NTIRE 2022 Challenge on Spectral Reconstruction from RGB)

MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction (CVPRW 2022) Yuanhao Cai, Jing Lin, Zudi Lin, Haoqian Wang, Yulun Z

274 Jan 5, 2023

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

About This repository provides data and code for the paper: Scalable Data Annotation Pipeline for High-Quality Large Speech Datasets Development (subm

86 Dec 7, 2022

CoReNet is a technique for joint multi-object 3D reconstruction from a single RGB image.

CoReNet CoReNet is a technique for joint multi-object 3D reconstruction from a single RGB image. It produces coherent reconstructions, where all objec

80 Dec 25, 2022

ObjectDrawer-ToolBox: a graphical image annotation tool to generate ground plane masks for a 3D object reconstruction system

Related tags

Overview

ObjectDrawer-ToolBox

Prerequisites

Requirements

Getting Started

How to annotate ground plane

Annotation result

Examples

Acknowledgements

You might also like...

Code for "Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo"

Efficient electromagnetic solver based on rigorous coupled-wave analysis for 3D and 2D multi-layered structures with in-plane periodicity

Weakly Supervised 3D Object Detection from Point Cloud with Only Image Level Annotation

A Data Annotation Tool for Semantic Segmentation, Object Detection and Lane Line Detection.(In Development Stage)

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

A Planar RGB-D SLAM which utilizes Manhattan World structure to provide optimal camera pose trajectory while also providing a sparse reconstruction containing points, lines and planes, and a dense surfel-based reconstruction.

"MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction" (CVPRW 2022) & (Winner of NTIRE 2022 Challenge on Spectral Reconstruction from RGB)

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

CoReNet is a technique for joint multi-object 3D reconstruction from a single RGB image.

Owner

PointCloud Annotation Tools, support to label object bound box, ground, lane and kerb

Tools to create pixel-wise object masks, bounding box labels (2D and 3D) and 3D object model (PLY triangle mesh) for object sequences filmed with an RGB-D camera.

git《Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction》(ECCV 2020) GitHub:

This project aim to create multi-label classification annotation tool to boost annotation speed and make it more easier.

A graphical Semi-automatic annotation tool based on labelImg and Yolov5

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning (CoRL 2021)

Official Repo for Ground-aware Monocular 3D Object Detection for Autonomous Driving

[ICCV'21] PlaneTR: Structure-Guided Transformers for 3D Plane Recovery