ACM Multimedia Oral

Learning Exposure Correction in Dynamic Scenes

Jin Liu 1,2,  Bo Wang 1,2Chuanming Wang 1,2Huiyuan Fu 1,2Huadong Ma 1,2
1 State Key Laboratory Of Networking And Switching Technology
2 Beijing University of Posts and Telecommunications

TL;DR: This paper constructs the first real-world paired video dataset DIME, including both underexposure and overexposure dynamic scenes, and proposes an end-to-end video exposure correction network, in which a dual-stream module is designed to deal with both underexposure and overexposure factors.


Abstract

Exposure correction aims to enhance visual data suffering from improper exposures, which can greatly improve satisfactory visual effects. However, previous methods mainly focus on the image modality, and the video counterpart is less explored in the literature. Directly applying prior image-based methods to videos results in temporal incoherence with low visual quality. Through thorough investigation, we find that the development of relevant communities is limited by the absence of a benchmark dataset. Therefore, in this paper, we construct the first real-world paired video dataset, including both underexposure and overexposure dynamic scenes. To achieve spatial alignment, we utilize two DSLR cameras and a beam splitter to simultaneously capture improper and normal exposure videos. Additionally, we propose an end-to-end video exposure correction network, in which a dual-stream module is designed to deal with both underexposure and overexposure factors, enhancing the illumination based on Retinex theory. The extensive experiments based on various metrics and user studies demonstrate the significance of our dataset and the effectiveness of our method. The code and dataset are available at https://github.com/kravrolens/VECNet.

Optical System to Collect Dataset

Optical system to collect dataset
We built the optical system to capture the normal and over-/under- exposure video pairs.

Network Architecture

RECNet architecture
Overview of our framework. It contains three modules, including Multi-frame Fourier Alignment (MFA), Dual-stream Illumination Construction (DIC), and Two-stage Synthesis Restoration (TSR).
BibTeX
@inproceedings{liuLearningExposure2024,
  title = {Learning Exposure Correction in Dynamic Scenes},
  author = {Liu, Jin and Wang, Bo and Wang, Chuanming and Fu, Huiyuan and Ma, Huadong},
  booktitle = {Proceedings of the 32nd ACM International Conference on Multimedia},
  year = {2024}    
}