PartImageNet is a large, high-quality dataset with part segmentation annotations

Ju He

Last update: Nov 30, 2022

Related tags

Deep Learning PartImageNet

Overview

PartImageNet: A Large, High-Quality Dataset of Parts

We will release our dataset and scripts soon after cleaning and approval.

Introduction

PartImageNet is a large, high-quality dataset with part segmentation annotations. It consists of 158 classes from ImageNet with approximately 24′000 images. The classes are grouped into 11 super-categories and the parts split are designed according to the super-category as shown below. The number in the brackets after the category name indicates the total number of classes of the category.

Category	Annotated Parts
Quadruped (46)	Head, Body, Foot, Tail
Biped (17)	Head, Body, Hand, Foot, Tail
Fish (10)	Head, Body, Fin, Tail
Bird (14)	Head, Body, Wing, Foot, Tail
Snake (15)	Head, Body
Reptile (20)	Head, Body, Foot, Tail
Car (23)	Body, Tier, Side Mirror
Bicycle (6)	Head, Body, Seat, Tier
Boat (4)	Body, Sail
Aeroplane (2)	Head, Body, Wing, Engine, Tail
Bottle (5)	Body, Mouth

The statistics of train/val/test split is shown below.

Split	Number of classes	Number of images
Train	109	16540
Val	19	2957
Test	30	4598
Total	158	24095

For more detailed statistics, please check out our paper.

Possible Usage

PartImageNet has broad potential in and can be benefit to numerious research fields while we simply explore its usage in Part Discovery, Few-shot Learning and Semantic Segmentation in the paper. We hope that with the propose of the PartImageNet, we could attarct more attention to the part-based models and yield more interesting works. We will release our implementation later as well.

Example Figures

You might also like...

A Large-Scale Dataset for Spinal Vertebrae Segmentation in Computed Tomography

75 Dec 26, 2022

LIVECell - A large-scale dataset for label-free live cell segmentation

LIVECell dataset This document contains instructions of how to access the data associated with the submitted manuscript "LIVECell - A large-scale data

112 Jan 7, 2023

Utility tools for the "Divide and Remaster" dataset, introduced as part of the Cocktail Fork problem paper

Divide and Remaster Utility Tools Utility tools for the "Divide and Remaster" dataset, introduced as part of the Cocktail Fork problem paper The DnR d

46 Dec 11, 2022

LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation (NeurIPS2021 Benchmark and Dataset Track)

LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation by Junjue Wang, Zhuo Zheng, Ailong Ma, Xiaoyan Lu, and Yanfei Zh

174 Dec 22, 2022

This is an official implementation of "Polarized Self-Attention: Towards High-quality Pixel-wise Regression"

Polarized Self-Attention: Towards High-quality Pixel-wise Regression This is an official implementation of: Huajun Liu, Fuqiang Liu, Xinyi Fan and Don

212 Jan 8, 2023

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

PortaSpeech - PyTorch Implementation PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech. Model Size Module Nor

279 Jan 4, 2023

Code for the paper SphereRPN: Learning Spheres for High-Quality Region Proposals on 3D Point Clouds Object Detection, ICIP 2021.

SphereRPN Code for the paper SphereRPN: Learning Spheres for High-Quality Region Proposals on 3D Point Clouds Object Detection, ICIP 2021. Authors: Th

15 Dec 2, 2022

[SIGGRAPH Asia 2021] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning.

DeepVecFont This is the homepage for "DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning". Yizhi Wang and Zhouhui Lian. WI

17 Dec 22, 2022

[SIGGRAPH Asia 2021] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning.

DeepVecFont This is the homepage for "DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning". Yizhi Wang and Zhouhui Lian. WI

5 Oct 22, 2021

Comments

What is the meaning of 'category_id' in each annotation?

I notice that each annotation has a field named 'category_id', the total number of all unique 'category_id' is 40, consistently in 'train.json' and 'test.json'. I wonder whether 'category_id' is the id of the object part, but it seems that the original paper does not mention it.

opened by hqhQAQ 1
How can I get the part semantic segmentation mask?

When I convert an object segment into a semantic mask using coco API, I find some intersection areas between the different objects. How should I solve the overleaping area when generating semantic mask labels? Thanks for your reply.

opened by xushilin1 0
when do you provide script to process images?

This is an excellent work! Using cocoapi (https://github.com/cocodataset/cocoapi), I can have this segmentation but not sure how to get the head of the ibex only.

opened by giangnguyen2412 0

PartImageNet is a large, high-quality dataset with part segmentation annotations

Related tags

Overview

PartImageNet: A Large, High-Quality Dataset of Parts

Introduction

Possible Usage

Example Figures

You might also like...

A Large-Scale Dataset for Spinal Vertebrae Segmentation in Computed Tomography

LIVECell - A large-scale dataset for label-free live cell segmentation

Utility tools for the "Divide and Remaster" dataset, introduced as part of the Cocktail Fork problem paper

LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation (NeurIPS2021 Benchmark and Dataset Track)

This is an official implementation of "Polarized Self-Attention: Towards High-quality Pixel-wise Regression"

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Code for the paper SphereRPN: Learning Spheres for High-Quality Region Proposals on 3D Point Clouds Object Detection, ICIP 2021.

[SIGGRAPH Asia 2021] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning.

[SIGGRAPH Asia 2021] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning.

Comments

What is the meaning of 'category_id' in each annotation?

How can I get the part semantic segmentation mask?

when do you provide script to process images?

Owner

Ju He

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

A modified version of DeepMind's Alphafold2 to divide CPU part (MSA and template searching) and GPU part (prediction model)

Seeing Dynamic Scene in the Dark: High-Quality Video Dataset with Mechatronic Alignment (ICCV2021)

NUANCED is a user-centric conversational recommendation dataset that contains 5.1k annotated dialogues and 26k high-quality user turns.

Flickr-Faces-HQ (FFHQ) is a high-quality image dataset of human faces, originally created as a benchmark for generative adversarial networks (GAN)

Facestar dataset. High quality audio-visual recordings of human conversational speech.

Synthetic LiDAR sequential point cloud dataset with point-wise annotations

Official Implementation and Dataset of "PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask and Group-Level Consistency", CVPR 2021

A large dataset of 100k Google Satellite and matching Map images, resembling pix2pix's Google Maps dataset.