Le dataset des images du projet d'IA de 2021

Last update: Nov 15, 2021

Related tags

Deep Learning face-mask-dataset-ilc-2021

Overview

face-mask-dataset-ilc-2021

Le dataset des images du projet d'IA de 2021, Indiquez vos id git dans la issue pour les droits

TL;DR:

Choisir 200 images JPEG avec environ 1/3 sans masque, 1/3 avec masque, et 1/3 mal mis
Renommer les images avec le hash md5 du fichier
Annoter avec labelimg (ou autre pour fichier xml au format PASCAL-VOC)
commit sur votre branch "contrib_NOM1_NOM2"
Une fois toutes les images annotées, => Pull requests vers branche VALID
Le discord ILC est pratique pour échanger

1. Répartition

Les images sont repertoriées en 3 catégories :

"with_mask", un masque correctment porté et qui recouvre la bouche et le nez
"with_incorrect_mask", un masque porté sous le nez, ou de facon pas très covid-friendly
"without_mask, Un visage sans masque

Le dataset doit faire environ 2300 images qui répartit par 23 doit donner environ 100 images à annoter par personne

2. Gestion des images

Les images doivent être traitées de la sorte :

Le nom correspond au md5sum du fichier
Les masques rajoutés en mode photoshop sont à proscrire pour des raisons de performances
on recherche les images similaires par exemple à l’aide du script python compare_images
La répartition des images doivent être équilibrés (environ le même nombre d'image dans chaque catégorie à 100 images près)

3. Pour commit

L'idée va être d'avoir une branche "VALID" pour ajouter toutes les images en attentes de validation et de ne garder la branche "main" que pour le résultat final. Pensez à bien mettre renseigner vos avancés dans vos commits et pull request. -> Chaque binome ajoutera sur sa branche "contrib_NOM1_NOM2", et on effectuera un pull request vers la branche "VALID" une fois les 200 images ajoutées et annotées

4. Les outils qui vont bien

Pour annoter les images : labelimg
Pour trouver les doublons dans les images : Le script "compare_images.py" (run n'importe ou), et lui passer les deux dossier source(les images des autres) et to_add (les votres à ajouter)
Pour renommer toutes ses images en leur hash MD5 (A faire avant d'annoter) : le script "rename_dir_md5.py" (à déplacer dans le dossier JPEGImages pour run)

Extract MNIST handwritten digits dataset binary file into bmp images

MNIST-dataset-extractor Extract MNIST handwritten digits dataset binary file into bmp images More info at http://yann.lecun.com/exdb/mnist/ Dependenci

6 May 24, 2021

The first dataset of composite images with rationality score indicating whether the object placement in a composite image is reasonable.

Object-Placement-Assessment-Dataset-OPA Object-Placement-Assessment (OPA) is to verify whether a composite image is plausible in terms of the object p

53 Nov 15, 2022

Semantic Segmentation of images using PixelLib with help of Pascalvoc dataset trained with Deeplabv3+ framework.

CARscan- Approach 1 - Segmentation of images by detecting contours. It failed because in images with elements along with cars were also getting detect

5 Jul 29, 2021

Face Synthetics dataset is a collection of diverse synthetic face images with ground truth labels.

The Face Synthetics dataset Face Synthetics dataset is a collection of diverse synthetic face images with ground truth labels. It was introduced in ou

608 Jan 2, 2023

Script that receives an Image (original) and a set of images to be used as "pixels" in reconstruction of the Original image using the set of images as "pixels"

picinpics Script that receives an Image (original) and a set of images to be used as "pixels" in reconstruction of the Original image using the set of

1 Oct 24, 2021

[CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach

Comments

Adding our 200 pictures with their corresponding annotations
Collected 3 groups of pictures that depict humans in 3 different situations.

The first one, in which they wear their mask correctly

The second one, in which they don't wear a mask at all

The third and last one, in which they wear a mask incorrectly

When we talk about people wearing masks incorrectly, we talk about people wearing their masks without covering the nose and the chin at the same time, so if they cover just one of the two or none at all (for example by having it over their necks), that means that they are wearing the mask incorrectly.

We tried having 3 groups of photos of equivalent size.
opened by TebaiOsama 3
Problème nom de catégorie

Certaines annotations ont comme catégorie "mask_weared_incorrect" au lieu de "with_incorrect_mask".

Exemple: le fichier "annotations/0294a2bd4e2f4641e050176d83134542.xml"

opened by youssef-t 2

Le dataset des images du projet d'IA de 2021

Related tags

Overview

face-mask-dataset-ilc-2021

1. Répartition

2. Gestion des images

3. Pour commit

4. Les outils qui vont bien

You might also like...

Extract MNIST handwritten digits dataset binary file into bmp images

The first dataset of composite images with rationality score indicating whether the object placement in a composite image is reasonable.

Semantic Segmentation of images using PixelLib with help of Pascalvoc dataset trained with Deeplabv3+ framework.

Face Synthetics dataset is a collection of diverse synthetic face images with ground truth labels.

Script that receives an Image (original) and a set of images to be used as "pixels" in reconstruction of the Original image using the set of images as "pixels"

[CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach

HDR Video Reconstruction: A Coarse-to-fine Network and A Real-world Benchmark Dataset (ICCV 2021)

EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections

VIL-100: A New Dataset and A Baseline Model for Video Instance Lane Detection (ICCV 2021)

Comments

Adding our 200 pictures with their corresponding annotations

Problème nom de catégorie

Owner

Official Implementation and Dataset of "PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask and Group-Level Consistency", CVPR 2021

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

This is the dataset and code release of the OpenRooms Dataset.

Dataset used in "PlantDoc: A Dataset for Visual Plant Disease Detection" accepted in CODS-COMAD 2020

LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation (NeurIPS2021 Benchmark and Dataset Track)

This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes dataset. This code is implemented in Pytorch.

The Habitat-Matterport 3D Research Dataset - the largest-ever dataset of 3D indoor spaces.

A public available dataset for road boundary detection in aerial images