ICCV2021 - A New Journey from SDRTV to HDRTV.

XyChen

Last update: Dec 27, 2022

Related tags

Deep Learning HDRTVNet

Overview

HDRTVNet [Paper Link]

A New Journey from SDRTV to HDRTV

By Xiangyu Chen*, Zhengwen Zhang*, Jimmy S. Ren, Lynhoo Tian, Yu Qiao and Chao Dong

(* indicates equal contribution)

This paper is accepted to ICCV 2021.

Overview

Simplified SDRTV/HDRTV formation pipeline:

Overview of the method:

Dataset

We conduct a dataset using videos with 4K resolutions under HDR10 standard (10-bit, Rec.2020, PQ) and their counterpart SDR versions from Youtube. The dataset consists of a training set with 1235 image pairs and a test set with 117 image pairs. Please refer to the paper for the details on the processing of the dataset. The dataset can be downloaded from Baidu Netdisk (access code: 6qvu) or OneDrive (access code: HDRTVNet).

We also provide the original Youtube links of these videos, which can be found in this file. Note that we cannot provide the download links since we do not have the copyright to distribute. Please download this dataset only for academic use.

Configuration

Please refer to the requirements. Matlab is also used to process the data, but it is not necessary and can be replaced by OpenCV.

How to test

We provide the pretrained models to test, which can be downloaded from Baidu Netdisk (access code: 2me9) or OneDrive (access code: HDRTVNet). Since our method is casaded of three steps, the results also need to be inferenced step by step.

Before testing, it is optional to generate the downsampled inputs of the condition network in advance. Make sure the input_folder and save_LR_folder in ./scripts/generate_mod_LR_bic.m are correct, then run the file using Matlab. After that, matlab-bicubic-downsampled versions of the input SDR images are generated that will be input to the condition network. Note that this step is not necessary, but can reproduce more precise performance.
For the first part of AGCM, make sure the paths of dataroot_LQ, dataroot_cond, dataroot_GT and pretrain_model_G in ./codes/options/test/test_AGCM.yml are correct, then run

cd codes
python test.py -opt options/test/test_AGCM.yml

Note that if the first step is not preformed, the line of dataroot_cond should be commented. The test results will be saved to ./results/Adaptive_Global_Color_Mapping.
For the second part of LE, make sure dataroot_LQ is modified into the path of results obtained by AGCM, then run

python test.py -opt options/test/test_LE.yml

Note that results generated by LE can achieve the best quantitative performance. The part of HG is for the completeness of the solution and improving the visual quality forthermore. For testing the last part of HG, make sure dataroot_LQ is modified into the path of results obtained by LE, then run

python test.py -opt options/test/test_HG.yml

Note that the results of the each step are 16-bit images that can be converted into HDR10 video.

How to train

Prepare the data. Generate the sub-images with specific patch size using ./scripts/extract_subimgs_single.py and generate the down-sampled inputs for the condition network (using the ./scripts/generate_mod_LR_bic.m or any other methods).
For AGCM, make sure that the paths and settings in ./options/train/train_AGCM.yml are correct, then run

cd codes
python train.py -opt options/train/train_AGCM.yml

For LE, the inputs are generated by the trained AGCM model. The original data should be inferenced through the first step (refer to the last part on how to test AGCM) and then be processed by extracting sub-images. After that, modify the corresponding settings in ./options/train/train_LE.yml and run

python train.py -opt options/train/train_LE.yml

For HG, the inputs are also obtained by the last part LE, thus the training data need to be processed by similar operations as the previous two parts. When the data is prepared, it is recommended to pretrain the generator at first by running

python train.py -opt options/train/train_HG_Generator.yml

After that, choose a pretrained model and modify the path of pretrained model in ./options/train/train_HG_GAN.yml, then run

python train.py -opt options/train/train_HG_GAN.yml

All models and training states are stored in ./experiments.

Metrics

Five metrics are used to evaluate the quantitative performance of different methods, including PSNR, SSIM, SR_SIM, Delta E_ITP (ITU Rec.2124) and HDR-VDP3. Since the latter three metrics are not very common in recent papers, we provide some reference codes in ./metrics for convenient usage.

Visualization

Since HDR10 is an HDR standard using PQ transfer function for the video, the correct way to visualize the results is to synthesize the image results into a video format and display it on the HDR monitor or TVs that support HDR. The HDR images in our dataset are generated by directly extracting frames from the original HDR10 videos, thus these images consisting of PQ values look relatively dark compared to their true appearances. We provide the reference commands of our extracting frames and synthesizing videos in ./scripts. Please use MediaInfo to check the format and the encoding information of synthesized videos before visualization. If circumstances permit, we strongly recommend to observe the HDR results and the original HDR resources by this way on the HDR dispalyer.

If the HDR displayer is not available, some media players with HDR render can play the HDR video and show a relatively realistic look, such as Potplayer. Note that this is only an approximate alternative, and it still cannot fully restore the appearance of HDR content on HDR monitors.

Citation

If our work is helpful to you, please cite our paper:

@inproceedings{chen2021new,
  title={A New Journey from SDRTV to HDRTV}, 
  author={Chen, Xiangyu and Zhang, Zhengwen and Ren, Jimmy S. and Tian, Lynhoo and Qiao, Yu and Dong, Chao},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  year={2021}
}

Comments

Reproduced result of Deep-SRITM
Hi, really appreciate again.

Which source code has been used for the scores of Deep-SRITM in Table 1?

The matlab code (official)

PyTorch version (Link: https://github.com/greatwallet/Pytorch-Implemented-Deep-SR-ITM.git)

Re-written version by authors?

Thank you for your kindness
opened by tnt-ooo-tnt 7
Reproduced results of previous works
Hi, I am really glad to see your beautiful works.

I have got some questions here.

In Figure 1, (b), (c) and (d) can be counted as the categories of methods. It also shows how you reproduced your results on Table 1. In Table 1, it divides methods in 4 categories (LDR-to-HDR, image-to-image translation, photo retouching, and SDRTV-to-HDRTV).

I am little bit confused where image-to-image translation and photo retouching belong to.

Are they all end-to-end methods?

In data manipulation scripts, I could find a python script and a matlab script.

First, the python script crops 480x480 patches from inputs with step size 240. Next, the matlab script produces mod, bicubic-downsampled, and bicubic-upsampled images.

I want to know the exact description of the accurate usage of each processed image like the location (which network to feed) of each dataset.

(I am confused with the paths named LE_training_set_res_sub, LE_test_set_res_bicx4, test_hdr_bicx4 .. etc)

Thank you for your reply in advance.
opened by tnt-ooo-tnt 5
Ablout mask calculation

Hi, thank you for your great work! I have some question about the mask calculation in highlight generation. As discribed in Figure 2 in your paper, the mask is computed from original SDR image, but I found it is computed from LE output in your code. So which one should we use? As I realized, the non-linear RGB value in HDR image is hardly more than 0.95.

https://github.com/chxy95/HDRTVNet/blob/41d63b58bb953a83fcd1666d77848945a261fb06/codes/data/LQGT_hallucination_dataset.py#L77

opened by hahahaprince 3
输出图片尺寸问题

输入是4k(38402160),AGCM和LE输出均与输入相同，但是HG输出尺寸为(38402176)，我定位了一下，是因为generate_mask.py代码中 if H%32!=0 or W%32!=0: H_new = int(np.ceil(H / 32) * 32) W_new = int(np.ceil(W / 32) * 32) img_LQ = cv2.resize(img_LQ, (W_new, H_new)) 请问为什么在HG网络中尺寸需要调整成可以整除32的呢？输出尺寸变化的问题怎么解决呢？

opened by sunyclj 5
module 'data.util' has no attribute 'read_npy'

In data/LQGT_hallucination_dataset.py", line 45, in getitem mask = util.read_npy(mask_path)

AttributeError: module 'data.util' has no attribute 'read_npy'

opened by feimengjuan 1
Bit depth of the original video

Hi, I have a question. As you say，the dataset is extracted from the video of HDR10 standard (10-bit, Rec.2020, PQ) and stored in 16 bit PNG . However, when I read an HDR image with MATLAB, I found that the image contains more than 60000 pixel intensities instead of 1024. This means that this is a real 16 bit image. Is the original video really just 10 bits?

opened by gzsun1416 1
Possibility of real-time mpv shader version?

Hi, HDRTVNet looks really great! I was wandering if it would be possible to make a shader version of HDRTVNet for real-time use as a mpv shader or a simplified version of it. Looking at the list of User Shaders for mpv (https://github.com/mpv-player/mpv/wiki/User-Scripts#user-shaders), all current Deep Learning shaders are only for spatial upscaling and HDRTVNet would be the first shader for color space upmapping and it would make HDRTVNet more practical to use.

Thank you very much

opened by SephirothFFKH 2
multi-gpu training

Hi, thanks for your good work!

I would like to know how to train in multi-gpus to improve the speed of training?

Looking forward to your reply! thanks a lot in advance!

opened by haifangqin 3
man-made color card

I want to reproduce the Color Transition Test. But after searching for a long time, I still don't know how to do it. Can you tell me or give me a hint. Trouble you many times. Thank you!

opened by wanghu178 1

Owner

XyChen

PhD. Student，Computer Vision

GitHub

Evaluation Pipeline for our ECCV2020: Journey Towards Tiny Perceptual Super-Resolution.

Journey Towards Tiny Perceptual Super-Resolution Test code for our ECCV2020 paper: https://arxiv.org/abs/2007.04356 Our x4 upscaling pre-trained model

6 Mar 30, 2022

The all new way to turn your boring vector meshes into the new fad in town; Voxels!

Voxelator The all new way to turn your boring vector meshes into the new fad in town; Voxels! Notes: I have not tested this on a rotated mesh. With fu

6 Feb 3, 2022

Official PyTorch Implementation of Rank & Sort Loss [ICCV2021]

Rank & Sort Loss for Object Detection and Instance Segmentation The official implementation of Rank & Sort Loss. Our implementation is based on mmdete

229 Dec 20, 2022

source code of “Visual Saliency Transformer” (ICCV2021)

Visual Saliency Transformer (VST) source code for our ICCV 2021 paper “Visual Saliency Transformer” by Nian Liu, Ni Zhang, Kaiyuan Wan, Junwei Han, an

89 Dec 21, 2022

HiFT: Hierarchical Feature Transformer for Aerial Tracking (ICCV2021)

HiFT: Hierarchical Feature Transformer for Aerial Tracking Ziang Cao, Changhong Fu, Junjie Ye, Bowen Li, and Yiming Li Our paper is Accepted by ICCV 2

Intelligent Vision for Robotics in Complex Environment

55 Nov 23, 2022

Implementation of ICCV2021(Oral) paper - VMNet: Voxel-Mesh Network for Geodesic-aware 3D Semantic Segmentation

VMNet: Voxel-Mesh Network for Geodesic-Aware 3D Semantic Segmentation Created by Zeyu HU Introduction This work is based on our paper VMNet: Voxel-Mes

82 Dec 27, 2022

Implementation for our ICCV2021 paper: Internal Video Inpainting by Implicit Long-range Propagation

Implicit Internal Video Inpainting Implementation for our ICCV2021 paper: Internal Video Inpainting by Implicit Long-range Propagation paper | project

202 Dec 30, 2022

This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is accepted to ICCV2021.

GMPQ: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation This is the pytorch implementation for the paper: Generalizable Mix

18 Sep 2, 2022

Official code for "Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer. ICCV2021".

Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer. ICCV2021. Introduction We proposed a novel model training paradi

103 Dec 14, 2022

Seeing Dynamic Scene in the Dark: High-Quality Video Dataset with Mechatronic Alignment (ICCV2021)

Seeing Dynamic Scene in the Dark: High-Quality Video Dataset with Mechatronic Alignment This is a pytorch project for the paper Seeing Dynamic Scene i

21 Nov 28, 2022

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation （ICCV2021）

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation This is a pytorch project for the paper Dynamic Divide-and-Conquer Ad

29 Nov 21, 2022

CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)

CM-NAS Official Pytorch code of paper CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification in ICCV2021. Vis

40 Nov 25, 2022

Parametric Contrastive Learning (ICCV2021)

Parametric-Contrastive-Learning This repository contains the implementation code for ICCV2021 paper: Parametric Contrastive Learning (https://arxiv.or

156 Dec 21, 2022

Code and models for ICCV2021 paper "Robust Object Detection via Instance-Level Temporal Cycle Confusion".

Robust Object Detection via Instance-Level Temporal Cycle Confusion This repo contains the implementation of the ICCV 2021 paper, Robust Object Detect

69 Oct 13, 2022

TOOD: Task-aligned One-stage Object Detection, ICCV2021 Oral

One-stage object detection is commonly implemented by optimizing two sub-tasks: object classification and localization, using heads with two parallel branches, which might lead to a certain level of spatial misalignment in predictions between the two tasks.

264 Jan 9, 2023

PyTorch implementation of our ICCV2021 paper: StructDepth: Leveraging the structural regularities for self-supervised indoor depth estimation

StructDepth PyTorch implementation of our ICCV2021 paper: StructDepth: Leveraging the structural regularities for self-supervised indoor depth estimat

112 Nov 28, 2022

ICCV2021 - A New Journey from SDRTV to HDRTV.

Related tags

Overview

HDRTVNet [Paper Link]

A New Journey from SDRTV to HDRTV

Overview

Getting Started

Dataset

Configuration

How to test

How to train

Metrics

Visualization

Citation

Comments

Owner

XyChen

Evaluation Pipeline for our ECCV2020: Journey Towards Tiny Perceptual Super-Resolution.

The all new way to turn your boring vector meshes into the new fad in town; Voxels!

Official PyTorch Implementation of Rank & Sort Loss [ICCV2021]

source code of “Visual Saliency Transformer” (ICCV2021)

HiFT: Hierarchical Feature Transformer for Aerial Tracking (ICCV2021)

Implementation of ICCV2021(Oral) paper - VMNet: Voxel-Mesh Network for Geodesic-aware 3D Semantic Segmentation

Implementation for our ICCV2021 paper: Internal Video Inpainting by Implicit Long-range Propagation

This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is accepted to ICCV2021.

Official code for "Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer. ICCV2021".

Seeing Dynamic Scene in the Dark: High-Quality Video Dataset with Mechatronic Alignment (ICCV2021)

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation （ICCV2021）

CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)

Parametric Contrastive Learning (ICCV2021)

Code and models for ICCV2021 paper "Robust Object Detection via Instance-Level Temporal Cycle Confusion".

TOOD: Task-aligned One-stage Object Detection, ICCV2021 Oral

Online Multi-Granularity Distillation for GAN Compression (ICCV2021)

Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021

Official implementation of "A Unified Objective for Novel Class Discovery", ICCV2021 (Oral)

PyTorch implementation of our ICCV2021 paper: StructDepth: Leveraging the structural regularities for self-supervised indoor depth estimation