Multi-task deep CNN model for no-reference image quality assessment on
smartphone camera photos
- URL: http://arxiv.org/abs/2008.11961v1
- Date: Thu, 27 Aug 2020 07:33:05 GMT
- Title: Multi-task deep CNN model for no-reference image quality assessment on
smartphone camera photos
- Authors: Chen-Hsiu Huang, Ja-Ling Wu
- Abstract summary: We propose a multi-task deep CNN model with scene type detection as an auxiliary task.
With the shared model parameters in the convolution layer, the learned feature maps could become more scene-relevant.
The evaluation result shows improved SROCC performance compared to traditional NR-IQA methods and single task CNN-based models.
- Score: 2.0305676256390934
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Smartphone is the most successful consumer electronic product in today's
mobile social network era. The smartphone camera quality and its image
post-processing capability is the dominant factor that impacts consumer's
buying decision. However, the quality evaluation of photos taken from
smartphones remains a labor-intensive work and relies on professional
photographers and experts. As an extension of the prior CNN-based NR-IQA
approach, we propose a multi-task deep CNN model with scene type detection as
an auxiliary task. With the shared model parameters in the convolution layer,
the learned feature maps could become more scene-relevant and enhance the
performance. The evaluation result shows improved SROCC performance compared to
traditional NR-IQA methods and single task CNN-based models.
Related papers
- AIM 2024 Challenge on UHD Blind Photo Quality Assessment [49.54883484094671]
The AIM 2024 UHD-IQA Challenge aims to advance the No-Reference Image Quality Assessment (NR-IQA) task for modern, high-resolution photos.
The challenge is based on the recently released UHD-IQA Benchmark Database, which comprises 6,073 UHD-1 (4K) images annotated with perceptual quality ratings from expert raters.
arXiv Detail & Related papers (2024-09-24T17:44:24Z) - MobileIQA: Exploiting Mobile-level Diverse Opinion Network For No-Reference Image Quality Assessment Using Knowledge Distillation [26.81879609001189]
No-Reference Image Quality Assessment (NR-IQA) can ecaluate image quality in real-time on mobile devices and enhance user experience.
Existing NR-IQA methods often resize or crop the HR images into small resolution, which leads to a loss of important details.
We propose MobileIQA, a novel approach that utilizes lightweight backbones to efficiently assess image quality while preserving image details.
arXiv Detail & Related papers (2024-09-02T12:42:50Z) - Dual-Branch Network for Portrait Image Quality Assessment [76.27716058987251]
We introduce a dual-branch network for portrait image quality assessment (PIQA)
We utilize two backbone networks (textiti.e., Swin Transformer-B) to extract the quality-aware features from the entire portrait image and the facial image cropped from it.
We leverage LIQE, an image scene classification and quality assessment model, to capture the quality-aware and scene-specific features as the auxiliary features.
arXiv Detail & Related papers (2024-05-14T12:43:43Z) - When No-Reference Image Quality Models Meet MAP Estimation in Diffusion Latents [92.45867913876691]
No-reference image quality assessment (NR-IQA) models can effectively quantify perceived image quality.
We show that NR-IQA models can be plugged into the maximum a posteriori (MAP) estimation framework for image enhancement.
arXiv Detail & Related papers (2024-03-11T03:35:41Z) - Real-Time Under-Display Cameras Image Restoration and HDR on Mobile
Devices [81.61356052916855]
The images captured by under-display cameras (UDCs) are degraded by the screen in front of them.
Deep learning methods for image restoration can significantly reduce the degradation of captured images.
We propose a lightweight model for blind UDC Image Restoration and HDR, and we also provide a benchmark comparing the performance and runtime of different methods on smartphones.
arXiv Detail & Related papers (2022-11-25T11:46:57Z) - Perceptual Image Enhancement for Smartphone Real-Time Applications [60.45737626529091]
We propose LPIENet, a lightweight network for perceptual image enhancement.
Our model can deal with noise artifacts, diffraction artifacts, blur, and HDR overexposure.
Our model can process 2K resolution images under 1 second in mid-level commercial smartphones.
arXiv Detail & Related papers (2022-10-24T19:16:33Z) - Controllable Image Enhancement [66.18525728881711]
We present a semiautomatic image enhancement algorithm that can generate high-quality images with multiple styles by controlling a few parameters.
An encoder-decoder framework encodes the retouching skills into latent codes and decodes them into the parameters of image signal processing functions.
arXiv Detail & Related papers (2022-06-16T23:54:53Z) - Learned Smartphone ISP on Mobile NPUs with Deep Learning, Mobile AI 2021
Challenge: Report [49.643297263102845]
This challenge aims to develop an end-to-end deep learning-based image signal processing pipeline.
The proposed solutions are capable of processing Full HD photos under 60-100 milliseconds while achieving high fidelity results.
arXiv Detail & Related papers (2021-05-17T13:20:35Z) - AWNet: Attentive Wavelet Network for Image ISP [14.58067200317891]
We introduce a novel network that utilizes the attention mechanism and wavelet transform, dubbed AWNet, to tackle this learnable image ISP problem.
Our proposed method enables us to restore favorable image details from RAW information and achieve a larger receptive field.
Experimental results indicate the advances of our design in both qualitative and quantitative measurements.
arXiv Detail & Related papers (2020-08-20T23:28:41Z) - No-Reference Image Quality Assessment via Feature Fusion and Multi-Task
Learning [29.19484863898778]
Blind or no-reference image quality assessment (NR-IQA) is a fundamental, unsolved, and yet challenging problem.
We propose a simple and yet effective general-purpose no-reference (NR) image quality assessment framework based on multi-task learning.
Our model employs distortion types as well as subjective human scores to predict image quality.
arXiv Detail & Related papers (2020-06-06T05:04:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.