Repo for Photon-Starved Scene Inference using Single Photon Cameras, ICCV 2021

Bhavya Goyal

Last update: Nov 15, 2022

Related tags

Deep Learning pytorch

Overview

Photon-Starved Scene Inference using Single Photon Cameras

ICCV 2021
Arxiv Project Video

Bhavya Goyal, Mohit Gupta

University of Wisconsin-Madison

Abstract

Scene understanding under low-light conditions is a challenging problem. This is due to the small number of photons captured by the camera and the resulting low signal-to-noise ratio (SNR). Single-photon cameras (SPCs) are an emerging sensing modality that are capable of cap-turing images with high sensitivity. Despite having minimal read-noise, images captured by SPCs in photon-starved conditions still suffer from strong shot noise, preventing reliable scene inference. We propose photon scale-space, a collection of high-SNR images spanning a wide range of photons-per-pixel (PPP) levels (but same scene content) as guides to train inference model on low photon flux images. We develop training techniques that push images with different illumination levels closer to each other in feature representation space. The key idea is that having a spectrum of different brightness levels during training enables effective guidance, and increases robustness to shot noise even in extreme noise cases. Based on the proposed approach, we demonstrate, via simulations and real experiments with a SPAD camera, high-performance on various inference tasks such as image classification and monocular depth estimation under ultra low-light, down to < 1 PPP.

Code Structure

.
├── classification          # Code for image classification using Photon Net training
├── monodepth               # Code for monocular depth estimation using Photon Net training
├── simulation              # Scripts for simulating noisy SPAD images
├── figures                 # figures used for results
└── README.md

Requirements/Installation

Install PyTorch (pytorch.org)
pip install -r requirements.txt

How to Use

Download the datasets (CUB/CARS/NYUV2/others) from the official sources and use scripts in simulation to simulate noisy images from SPAD
Use classification and monodepth code for image classifiation and monocular depth estimation using Photon Net

Citation

@InProceedings{Goyal_2021_ICCV,
    author    = {Goyal, Bhavya and Gupta, Mohit},
    title     = {Photon-Starved Scene Inference Using Single Photon Cameras},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {2512-2521}
}

You might also like...

A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision for Visual Scene Graph Generation''

README.md shall be finished soon. WSSGG 0 Overview 1 Installation 1.1 Faster-RCNN 1.2 Language Parser 1.3 GloVe Embeddings 2 Settings 2.1 VG-GT-Graph

35 Nov 20, 2022

Monocular 3D pose estimation. OpenVINO. CPU inference or iGPU (OpenCL) inference.

human-pose-estimation-3d-python-cpp RealSenseD435 (RGB) 480x640 + CPU Corei9 45 FPS (Depth is not used) 1. Run 1-1. RealSenseD435 (RGB) 480x640 + CPU

8 Oct 3, 2022

PyTorch-LIT is the Lite Inference Toolkit (LIT) for PyTorch which focuses on easy and fast inference of large models on end-devices.

PyTorch-LIT PyTorch-LIT is the Lite Inference Toolkit (LIT) for PyTorch which focuses on easy and fast inference of large models on end-devices. With

157 Dec 11, 2022

Data-depth-inference - Data depth inference with python

Repo for Photon-Starved Scene Inference using Single Photon Cameras, ICCV 2021

Related tags

Overview

Photon-Starved Scene Inference using Single Photon Cameras

Bhavya Goyal, Mohit Gupta

Abstract

Code Structure

Requirements/Installation

How to Use

Citation

You might also like...

A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision for Visual Scene Graph Generation''

Monocular 3D pose estimation. OpenVINO. CPU inference or iGPU (OpenCL) inference.

PyTorch-LIT is the Lite Inference Toolkit (LIT) for PyTorch which focuses on easy and fast inference of large models on end-devices.

Data-depth-inference - Data depth inference with python

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

Official Repository for the ICCV 2021 paper "PixelSynth: Generating a 3D-Consistent Experience from a Single Image"

Change is Everywhere: Single-Temporal Supervised Object Change Detection in Remote Sensing Imagery (ICCV 2021)

"3D Human Texture Estimation from a Single Image with Transformers", ICCV 2021

Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images (ICCV 2021)

Owner

Bhavya Goyal

Official PyTorch implementation of N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras (ICCV 2021)

Code for CVPR 2021 oral paper "Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts"

Neural Scene Graphs for Dynamic Scene (CVPR 2021)

Torchserve server using a YoloV5 model running on docker with GPU and static batch inference to perform production ready inference.

Automatic number plate recognition using tech: Yolo, OCR, Scene text detection, scene text recognation, flask, torch

[ICCV 2021] Official Tensorflow Implementation for "Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions"

Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.

Sync2Gen Code for ICCV 2021 paper: Scene Synthesis via Uncertainty-Driven Attribute Synchronization

Official PyTorch code of Holistic 3D Scene Understanding from a Single Image with Implicit Representation (CVPR 2021)

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion