Learned Smartphone ISP on Mobile NPUs with Deep Learning, Mobile AI 2021
Challenge: Report
- URL: http://arxiv.org/abs/2105.07809v1
- Date: Mon, 17 May 2021 13:20:35 GMT
- Title: Learned Smartphone ISP on Mobile NPUs with Deep Learning, Mobile AI 2021
Challenge: Report
- Authors: Andrey Ignatov, Cheng-Ming Chiang, Hsien-Kai Kuo, Anastasia Sycheva,
Radu Timofte, Min-Hung Chen, Man-Yu Lee, Yu-Syuan Xu, Yu Tseng, Shusong Xu,
Jin Guo, Chao-Hung Chen, Ming-Chun Hsyu, Wen-Chia Tsai, Chao-Wei Chen,
Grigory Malivenko, Minsu Kwon, Myungje Lee, Jaeyoon Yoo, Changbeom Kang,
Shinjo Wang, Zheng Shaolong, Hao Dejun, Xie Fen, Feng Zhuang, Yipeng Ma,
Jingyang Peng, Tao Wang, Fenglong Song, Chih-Chung Hsu, Kwan-Lin Chen,
Mei-Hsuang Wu, Vishal Chudasama, Kalpesh Prajapati, Heena Patel, Anjali
Sarvaiya, Kishor Upla, Kiran Raja, Raghavendra Ramachandra, Christoph Busch,
Etienne de Stoutz
- Abstract summary: This challenge aims to develop an end-to-end deep learning-based image signal processing pipeline.
The proposed solutions are capable of processing Full HD photos under 60-100 milliseconds while achieving high fidelity results.
- Score: 49.643297263102845
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: As the quality of mobile cameras starts to play a crucial role in modern
smartphones, more and more attention is now being paid to ISP algorithms used
to improve various perceptual aspects of mobile photos. In this Mobile AI
challenge, the target was to develop an end-to-end deep learning-based image
signal processing (ISP) pipeline that can replace classical hand-crafted ISPs
and achieve nearly real-time performance on smartphone NPUs. For this, the
participants were provided with a novel learned ISP dataset consisting of
RAW-RGB image pairs captured with the Sony IMX586 Quad Bayer mobile sensor and
a professional 102-megapixel medium format camera. The runtime of all models
was evaluated on the MediaTek Dimensity 1000+ platform with a dedicated AI
processing unit capable of accelerating both floating-point and quantized
neural networks. The proposed solutions are fully compatible with the above NPU
and are capable of processing Full HD photos under 60-100 milliseconds while
achieving high fidelity results. A detailed description of all models developed
in this challenge is provided in this paper.
Related papers
- MicroISP: Processing 32MP Photos on Mobile Devices with Deep Learning [114.66037224769005]
We present a novel MicroISP model designed specifically for edge devices.
The proposed solution is capable of processing up to 32MP photos on recent smartphones using the standard mobile ML libraries.
The architecture of the model is flexible, allowing to adjust its complexity to devices of different computational power.
arXiv Detail & Related papers (2022-11-08T17:40:50Z) - Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI &
AIM 2022 Challenge: Report [59.831324427712815]
This challenge aims to develop an efficient end-to-end AI-based image signal processing pipeline.
The models were evaluated on the Snapdragon's 8 Gen 1 GPU that provides excellent acceleration results for the majority of common deep learning ops.
The proposed solutions are compatible with all recent mobile GPUs, being able to process Full HD photos in less than 20-50 milliseconds while achieving high fidelity results.
arXiv Detail & Related papers (2022-11-07T22:13:10Z) - Real-Time Quantized Image Super-Resolution on Mobile NPUs, Mobile AI
2021 Challenge: Report [67.86837649834636]
We introduce the first Mobile AI challenge, where the target is to develop an end-to-end deep learning-based image super-resolution solution.
The proposed solutions are fully compatible with all major mobile AI accelerators and are capable of reconstructing Full HD images under 40-60 ms.
arXiv Detail & Related papers (2021-05-17T13:34:15Z) - Fast Camera Image Denoising on Mobile GPUs with Deep Learning, Mobile AI
2021 Challenge: Report [64.09439666916465]
We introduce the first Mobile AI challenge, where the target is to develop an end-to-end deep learning-based image denoising solution.
The proposed solutions are fully compatible with any mobile GPU and are capable of processing 480p resolution images under 40-80 ms.
arXiv Detail & Related papers (2021-05-17T13:27:56Z) - Replacing Mobile Camera ISP with a Single Deep Learning Model [171.49776472948957]
PyNET is a novel pyramidal CNN architecture designed for fine-grained image restoration.
The model is trained to convert RAW Bayer data obtained directly from mobile camera sensor into photos captured with a professional high-end DSLR camera.
arXiv Detail & Related papers (2020-02-13T14:22:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.