Dirty Pixels: Towards End-to-End Image Processing and Perception

Last update: Nov 18, 2022

Related tags

Deep Learning DirtyPixels

Overview

Dirty Pixels: Towards End-to-End Image Processing and Perception

This repository contains the code for the paper

Dirty Pixels: Towards End-to-End Image Processing and Perception
Steven Diamond, Vincent Sitzmann, Frank Julca-Aguilar, Stephen Boyd, Gordon Wetzstein, Felix Heide
Transactions on Graphics, 2021 | To be presented at SIGGRAPH, 2021

Installation

Clone this repository:

git clone [email protected]:princeton-computational-imaging/DirtyPixels.git

The project was developed using Python 3.6, Tensorflow (v1.12) and Slim. We provide an environment file to install all dependencies (creating an envirnoment called dirtypix):

conda env create -f environment.yml
conda activate dirtypix

Running Experiments

We provide code and data and trained models to reproduce the main results presented at the paper, and instructions on how to use this project for further research:

EVALUATION_INSTRUCTIONS.md provides instructions on how to evaluate our proposed models and reproduce results of the paper.
TRAINING_INSTRUCTIONS.md gives instructions on how to train new models following our proposed approach.
ADD_NOISE_INSTRUCTIONS.md explains how to simulate noisy raw images following the image formation model defined in the manuscript.

Citation

If you find our work useful in your research, please cite:

@article{steven:dirtypixels2021,
  title={Dirty Pixels: Towards End-to-End Image Processing and Perception},
  author={Diamond, Steven and Sitzmann, Vincent and Julca-Aguilar, Frank and Boyd, Stephen and Wetzstein, Gordon and Heide, Felix},
  journal={ACM Transactions on Graphics (SIGGRAPH)},
  year={2021},
  publisher={ACM}
}

License

This project is released under MIT License.

You might also like...

MNIST, but with Bezier curves instead of pixels

bezier-mnist This is a work-in-progress vector version of the MNIST dataset. Samples Here are some samples from the training set. Note that, while the

15 Jan 16, 2022

Code for our CVPR 2021 Paper "Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes".

Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes (CVPR 2021) Project page | Paper | Colab | Colab for Drawing App Rethinking Style

153 Jan 4, 2023

PaddleRobotics is an open-source algorithm library for robots based on Paddle, including open-source parts such as human-robot interaction, complex motion control, environment perception, SLAM positioning, and navigation.

简体中文 | English PaddleRobotics paddleRobotics是基于paddle的机器人开源算法库集，包括人机交互、复杂运动控制、环境感知、slam定位导航等开源算法部分。人机交互主动多模交互技术TFVT-HRI 主动多模交互技术是通过视觉、语音、触摸传感器等输入机器人

185 Dec 26, 2022

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

This project is a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

147 Dec 3, 2022

Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch

Perceiver - Pytorch Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch Install $ pip install perceiver-pytorch Usage

876 Dec 29, 2022

Implementation of Perceiver, General Perception with Iterative Attention in TensorFlow

Perceiver This Python package implements Perceiver: General Perception with Iterative Attention by Andrew Jaegle in TensorFlow. This model builds on t

84 Oct 15, 2022

Official source code to CVPR'20 paper, "When2com: Multi-Agent Perception via Communication Graph Grouping"

When2com: Multi-Agent Perception via Communication Graph Grouping This is the PyTorch implementation of our paper: When2com: Multi-Agent Perception vi

34 Nov 9, 2022

Project page of the paper 'Analyzing Perception-Distortion Tradeoff using Enhanced Perceptual Super-resolution Network' (ECCVW 2018)

EPSR (Enhanced Perceptual Super-resolution Network) paper This repo provides the test code, pretrained models, and results on benchmark datasets of ou

78 Nov 19, 2022

Certifiable Outlier-Robust Geometric Perception

Certifiable Outlier-Robust Geometric Perception About This repository holds the implementation for certifiably solving outlier-robust geometric percep

83 Dec 31, 2022

Comments

Are you sure the way you provide raw images is not a joke?

When I look at your sensor_model.py file, I find that the raw image is actually a rgb image generated by a mask. Honestly, this toy quality image leads me to think that the experimental results of the paper are not fair.

To save time, for people who plan to use this dataset to validate raw images, I suggest you look elsewhere.

opened by Baboom-l 0
Question about reproduction

I am sorry to bother you. When I retrain the network (isp + mobilenet) following your advice, I got loss=nan from the first step. I didn't modify your code at all.Do you have any advice to make this code work?

opened by miaoyuchun 0

Dirty Pixels: Towards End-to-End Image Processing and Perception

Related tags

Overview

Dirty Pixels: Towards End-to-End Image Processing and Perception

Installation

Running Experiments

Citation

License

You might also like...

MNIST, but with Bezier curves instead of pixels

Code for our CVPR 2021 Paper "Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes".

PaddleRobotics is an open-source algorithm library for robots based on Paddle, including open-source parts such as human-robot interaction, complex motion control, environment perception, SLAM positioning, and navigation.

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch

Implementation of Perceiver, General Perception with Iterative Attention in TensorFlow

Official source code to CVPR'20 paper, "When2com: Multi-Agent Perception via Communication Graph Grouping"

Project page of the paper 'Analyzing Perception-Distortion Tradeoff using Enhanced Perceptual Super-resolution Network' (ECCVW 2018)

Certifiable Outlier-Robust Geometric Perception

Comments

Are you sure the way you provide raw images is not a joke?

Question about reproduction

Owner

Code for Towards Streaming Perception (ECCV 2020) :car:

🐤 Nix-TTS: An Incredibly Lightweight End-to-End Text-to-Speech Model via Non End-to-End Distillation

Towards End-to-end Video-based Eye Tracking

Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)

Activating More Pixels in Image Super-Resolution Transformer

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing"

Where2Act: From Pixels to Actions for Articulated 3D Objects

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose (CVPR 2021)

PixelPick This is an official implementation of the paper "All you need are a few pixels: semantic segmentation with PixelPick."