Source code for CVPR2022 paper "Abandoning the Bayer-Filter to See in the Dark"

Last update: Dec 15, 2022

Related tags

Deep Learning Abandon_Bayer-Filter_See_in_the_Dark

Overview

Abandoning the Bayer-Filter to See in the Dark (CVPR 2022)

Paper: https://arxiv.org/abs/2203.04042 (Arxiv version)

This code includes the training and testing procedures of our network on our Mono-colored raw Paired (MCR) dataset and SID dataset's Sony part.

Abstract: Low-light image enhancement - a pervasive but challenging problem, plays a central role in enhancing the visibility of an image captured in a poor illumination environment. Due to the fact that not all photons can pass the Bayer-Filter on the sensor of the color camera, in this work, we first present a De-Bayer-Filter simulator based on deep neural networks to generate a monochrome raw image from the colored raw image. Next, a fully convolutional network is proposed to achieve the low-light image enhancement by fusing colored raw data with synthesized monochrome raw data. Channel-wise attention is also introduced to the fusion process to establish a complementary interaction between features from colored and monochrome raw images. To train the convolutional networks, we propose a dataset with monochrome and color raw pairs named Mono-Colored Raw paired dataset (MCR) collected by using a monochrome camera without Bayer-Filter and a color camera with Bayer-Filter. The proposed pipeline take advantages of the fusion of the virtual monochrome and the color raw images and our extensive experiments indicate that significant improvement can be achieved by leveraging raw sensor data and data-driven learning.

Video demos:

Requirments

This is the Pytorch implementation of our work. The next requirments and some other frequently-used Library will be needed.

Python >= 3.7
Pytorch >= 1.7.1
scikit-image 0.18.1
imageio 2.9.0
rawpy 0.17.0

Dataset

We propose the MCR [Google Drive, Baidu Netdisk (Extraction code: 22cv)], a dataset of colored raw and monochrome raw image pairs, captured with the same exposure setting. Each image has a resolution of 1280×1024.

The zip file contain 3 parts:

Mono_Colored_RAW_Paired_DATASET
- RGB_GT (498 images)
- Mono_GT (498 images)
- Color_RAW_Input (498 × 8 images)

Totally 498 different scenes, each scene has 1 corresponding RGB and Monochrome ground truth and 8 different exposure color Raw inputs.

We assemble the color camera and the monochrome camera up and down, setting the same exposure times and gain in the same scene.

And we choose the monochrome GT and RGB GT under the longest exposure time.

The file name contains the image information. Take the image name:"C00001_48mp_0x8_0x1fff.tif" as an example.

"C" means it is color raw image;

"00001" is the image number;

"48mp" is the master clock frequency 48 MHz;

"0x8" is the hex number of global gain;

"0x1fff" indicate the shutter width of the camera which can calculate the exposure time.

For visual convenience, we convert '.raw' files to '.tif' files which have the same 'RGGB' Bayer-filter pattern as the raw files.

The relationship between shutter width and the exposure time is shown in the table below.

The first 2 rows represent the exposure time settings in the indoor scenes which image number is between 1 and 499, the last 2 rows represent outdoor scenes' exposure time settings.

Alignment

We propose our alignment code for color and monochrome images taken by 2 cameras under the same scene.

Alignment/raw folder contain samples of color and monochrome raw images in one scene.

Firstly,in convert_RAW.py, we reshape the flatten '.raw' files into '1280×1024' and save as '.tif' files, and then we use the 'exiftool.exe' to add the head info 'pbpx_exft_args.txt' and save the images as '.dng' and '.jpg' files.

Secondly, in alignment.py, we choose two '.jpg' images from color and monochrome, and use opencv to calculate the homograph to do the alignment for monochrome images.

Training & Testing

For fully loading our dataset, 36G RAM are needed.

The 'random_path_list' contain the split train&test path lists in our dataset.

We split 3984 pairs of our dataset into train set: 3600 pairs and test set: 384 pairs

We train and test our MCR dataset with train.py and test.py, and we also train our network on SID dataset's Sony part with train_on_SID.py and test_on_SID.py.

The pre-trained models on both datasets can be found at MCR_pretrained_model and SID_pretrained_model

Citation

If you find this project useful in your research, please consider citing:

@inproceedings{Dong2022Abandoning,
	title={Abandoning the Bayer-Filter to See in the Dark},
	author={Dong, Xingbo and Xu, Wanyan and Miao, Zhihui and Ma, Lan and Zhang, Chao and Yang, Jiewen and Jin, Zhe and Teoh, Andrew Beng Jin and Shen, Jiajun},
	booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
	year={2022}
}

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

You might also like...

TCTrack: Temporal Contexts for Aerial Tracking (CVPR2022)

Comments

How to find the `pbpx_exft_args.txt`

Hi,

Thanks for sharing the nice work. I'm trying to follow the readme to convert raw image to RGB, but having difficulties. Could you please share how you ran the scripts? Also, where can I find the pbpx_exft_args.txt? Thanks!

opened by Ir1d 0
How to find the `pbpx_exft_args.txt`

Hi,

Thanks for sharing the nice work. I'm trying to follow the readme to convert raw image to RGB, but having difficulties. Could you please share how you ran the scripts? Also, where can I find the pbpx_exft_args.txt? Thanks!

Never mind, I found it. Thanks!

opened by Ir1d 0

Source code for CVPR2022 paper "Abandoning the Bayer-Filter to See in the Dark"

Related tags

Overview

Abandoning the Bayer-Filter to See in the Dark (CVPR 2022)

Requirments

Dataset

Alignment

Training & Testing

Citation

License

You might also like...

TCTrack: Temporal Contexts for Aerial Tracking (CVPR2022)

Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding (CVPR2022)

FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset (CVPR2022)

Towards Implicit Text-Guided 3D Shape Generation (CVPR2022)

Video Frame Interpolation with Transformer (CVPR2022)

[CVPR2022] Representation Compensation Networks for Continual Semantic Segmentation

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Unsupervised Domain Adaptation for Nighttime Aerial Tracking (CVPR2022)

CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View

Comments

How to find the `pbpx_exft_args.txt`

How to find the `pbpx_exft_args.txt`

Owner

Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

This is an official implementation of the CVPR2022 paper "Blind2Unblind: Self-Supervised Image Denoising with Visible Blind Spots".

CVPR2022 paper "Dense Learning based Semi-Supervised Object Detection"

The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

Official code for "Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes", CVPR2022

Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)

PSTR: End-to-End One-Step Person Search With Transformers (CVPR2022)

[CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos

Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)

Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis (CVPR2022)