Internship Assessment Task for BaggageAI.

Arya Shah

Last update: Nov 14, 2022

Related tags

Deep Learning BaggageAI

Overview

BaggageAI Internship Task

Problem Statement:

You are given two sets of images:- background and threat objects. Background images are the background x-ray images of baggage that gets generated after passing through a X-ray machine at airport. Threat images are the x-ray images of threats that are prohibited at airport while travelling.
Your task is to cut the threat objects, scale it down, rotate with 45 degree and paste it into the background images using image processing techniques in python.
Threat objects should be translucent, means it should not look like that it is cut pasted. It should look like that the threat was already there in the background images. Translucent means the threat objects should have shades of background where it is pasted.
Threat should not go outside the boundary of the baggage. ** difficult **
If there is any background of threat objects, then it should not be cut pasted into the background images, which means while cutting the threat objects, the boundary of a threat object should be tight-bound.

Solution:

Libraries Used :

OpenCV
numpy
glob
os
matplotlib
itertools

Methodology

To start with, we read the threat images, background images using the read_images function. For each threat image, it is first converted to grayscale and then dilated with 5x5 matrix of ones with iteration 2. Thi sis done to smooth out the image since the bright area around the threat image gets dilated around the background. Next, we create a mask for the threat object using a threshold value for white and the cv2 function inRange(). Then, the threat image is cropped to a square using a threshold value using the form_square() function. The images are padded dynamically so that when the threat is rotated 45 degrees, the whole threat image is covered and nothing is cut out. Loop through the background images and find the coordinates of the centre of the largest contour found in the background image using get_xy() function. Next, we fix the threat image according to the x, y position in background image. Finally we lace the threat in the background image using the place_threat() function.

The saved images are stored in the output folder for future reference.

Documentation:

read_images(path): This function reads the .jpg files from a specific location and returns a list of images as numpy array and the number of images read.
form_square(image): This function takes in a image(threat, with the background set to black using the inRange() OpenCV function) and finds the left, right, top, and bottom of the threat object, therby removing the extra background. NOTE: The threat object is not guaranteed to be a square. So this function also checks the image for the height and width of the cropped threat image and pad black portion in top-buttom of left-right making it a square image.
pad_image(image): This function calculates the diagonal length of the image and set the height and width of the image equal to diagonal length.
get_xy(background): This function craeates a binary image of the baggage using inRange() function and then inverts it. Next it finds the contours in the binary image and then the contour with maximum area is selected and the center of the countour is found using moments().
place_threat(background, threat, x=0, y=0): This function places the threat image in the background image in (x, y) location on the background. Defaults to x=0 and y=0.

Implementation of temporal pooling methods studied in [ICIP'20] A Comparative Evaluation Of Temporal Pooling Methods For Blind Video Quality Assessment

5 Sep 16, 2022

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

DSN-IQA Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment" Requirements Python =3.8.0 Pytorch =1.7.1 Usage wit

7 Oct 13, 2022

A user-friendly research and development tool built to standardize RL competency assessment for custom agents and environments.

Built with ❤️ by Sam Showalter Contents Overview Installation Dependencies Usage Scripts Standard Execution Environment Development Environment Benchm

1 Nov 18, 2021

To propose and implement a multi-class classification approach to disaster assessment from the given data set of post-earthquake satellite imagery.

2 Jan 5, 2022

Internship Assessment Task for BaggageAI.

Related tags

Overview

BaggageAI Internship Task

Problem Statement:

Solution:

Libraries Used :

Methodology

Documentation:

You might also like...

Implementation of temporal pooling methods studied in [ICIP'20] A Comparative Evaluation Of Temporal Pooling Methods For Blind Video Quality Assessment

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

A user-friendly research and development tool built to standardize RL competency assessment for custom agents and environments.

Lightweight Face Image Quality Assessment

MRQy is a quality assurance and checking tool for quantitative assessment of magnetic resonance imaging (MRI) data.

ZEBRA: Zero Evidence Biometric Recognition Assessment

No-reference Image Quality Assessment(NIQA) Algorithms (BRISQUE, NIQE, PIQE, RankIQA, MetaIQA)

Pytorch implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

To propose and implement a multi-class classification approach to disaster assessment from the given data set of post-earthquake satellite imagery.

Owner

Arya Shah

Repo for 2021 SDD assessment task 2, by Felix, Anna, and James.

Easy and comprehensive assessment of predictive power, with support for neuroimaging features

MagFace: A Universal Representation for Face Recognition and Quality Assessment

Code for paper "A Critical Assessment of State-of-the-Art in Entity Alignment" (https://arxiv.org/abs/2010.16314)

[CVPRW 2021] Code for Region-Adaptive Deformable Network for Image Quality Assessment

NIMA: Neural IMage Assessment

A PyTorch Implementation of Neural IMage Assessment

Official PyTorch implementation of the paper "Recycling Discriminator: Towards Opinion-Unaware Image Quality Assessment Using Wasserstein GAN", accepted to ACM MM 2021 BNI Track.

[ICCV 2021] Group-aware Contrastive Regression for Action Quality Assessment

Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"