Evaluating Deep Neural Networks for Image Document Enhancement
- URL: http://arxiv.org/abs/2106.15286v1
- Date: Fri, 11 Jun 2021 19:48:28 GMT
- Title: Evaluating Deep Neural Networks for Image Document Enhancement
- Authors: Lucas N. Kirsten, Ricardo Piccoli and Ricardo Ribani
- Abstract summary: This work evaluates six state-of-the-art deep neural network (DNN) architectures applied to the problem of enhancing document images.
The best performing architectures generally produced good enhancement compared to the existing algorithm.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: This work evaluates six state-of-the-art deep neural network (DNN)
architectures applied to the problem of enhancing camera-captured document
images. The results from each network were evaluated both qualitatively and
quantitatively using Image Quality Assessment (IQA) metrics, and also compared
with an existing approach based on traditional computer vision techniques. The
best performing architectures generally produced good enhancement compared to
the existing algorithm, showing that it is possible to use DNNs for document
image enhancement. Furthermore, the best performing architectures could work as
a baseline for future investigations on document enhancement using deep
learning techniques. The main contributions of this paper are: a baseline of
deep learning techniques that can be further improved to provide better
results, and a evaluation methodology using IQA metrics for quantitatively
comparing the produced images from the neural networks to a ground truth.
Related papers
- MGAN-CRCM: A Novel Multiple Generative Adversarial Network and Coarse-Refinement Based Cognizant Method for Image Inpainting [3.560962705392617]
This paper introduces a novel architecture combining GAN and ResNet models to improve image inpainting outcomes.
Our framework integrates three components: Transpose Convolution-based GAN for guided and blind inpainting, Fast ResNet-Convolutional Neural Network (FR-CNN) for object removal, and Co-Modulation GAN (Co-Mod GAN) for refinement.
arXiv Detail & Related papers (2024-12-25T22:54:28Z) - MBInception: A new Multi-Block Inception Model for Enhancing Image Processing Efficiency [3.3748750222488657]
This article introduces an innovative image classification model that employs three consecutive inception blocks within a convolutional neural networks framework.
We compare our model with well-established architectures such as Visual Geometry Group, Residual Network, and MobileNet.
The outcomes reveal that our novel model consistently outperforms its counterparts across diverse datasets.
arXiv Detail & Related papers (2024-12-18T10:46:04Z) - Comparison Analysis of Traditional Machine Learning and Deep Learning
Techniques for Data and Image Classification [62.997667081978825]
The purpose of the study is to analyse and compare the most common machine learning and deep learning techniques used for computer vision 2D object classification tasks.
Firstly, we will present the theoretical background of the Bag of Visual words model and Deep Convolutional Neural Networks (DCNN)
Secondly, we will implement a Bag of Visual Words model, the VGG16 CNN Architecture.
arXiv Detail & Related papers (2022-04-11T11:34:43Z) - Deep Image Deblurring: A Survey [165.32391279761006]
Deblurring is a classic problem in low-level computer vision, which aims to recover a sharp image from a blurred input image.
Recent advances in deep learning have led to significant progress in solving this problem.
arXiv Detail & Related papers (2022-01-26T01:31:30Z) - SDT-DCSCN for Simultaneous Super-Resolution and Deblurring of Text
Images [3.5590597557917363]
We propose an approach called SDT-DCSCN that jointly performs super-resolution and deblurring of low-resolution blurry text images based on DCSCN.
Our approach uses subsampled blurry images in the input and original sharp images as ground truth.
arXiv Detail & Related papers (2022-01-15T14:51:50Z) - Image Quality Assessment using Contrastive Learning [50.265638572116984]
We train a deep Convolutional Neural Network (CNN) using a contrastive pairwise objective to solve the auxiliary problem.
We show through extensive experiments that CONTRIQUE achieves competitive performance when compared to state-of-the-art NR image quality models.
Our results suggest that powerful quality representations with perceptual relevance can be obtained without requiring large labeled subjective image quality datasets.
arXiv Detail & Related papers (2021-10-25T21:01:00Z) - Image Quality Assessment in the Modern Age [53.19271326110551]
This tutorial provides the audience with the basic theories, methodologies, and current progresses of image quality assessment (IQA)
We will first revisit several subjective quality assessment methodologies, with emphasis on how to properly select visual stimuli.
Both hand-engineered and (deep) learning-based methods will be covered.
arXiv Detail & Related papers (2021-10-19T02:38:46Z) - Joint Learning of Neural Transfer and Architecture Adaptation for Image
Recognition [77.95361323613147]
Current state-of-the-art visual recognition systems rely on pretraining a neural network on a large-scale dataset and finetuning the network weights on a smaller dataset.
In this work, we prove that dynamically adapting network architectures tailored for each domain task along with weight finetuning benefits in both efficiency and effectiveness.
Our method can be easily generalized to an unsupervised paradigm by replacing supernet training with self-supervised learning in the source domain tasks and performing linear evaluation in the downstream tasks.
arXiv Detail & Related papers (2021-03-31T08:15:17Z) - Deep Multi-Scale Features Learning for Distorted Image Quality
Assessment [20.7146855562825]
Existing deep neural networks (DNNs) have shown significant effectiveness for tackling the IQA problem.
We propose to use pyramid features learning to build a DNN with hierarchical multi-scale features for distorted image quality prediction.
Our proposed network is optimized in a deep end-to-end supervision manner.
arXiv Detail & Related papers (2020-12-01T23:39:01Z) - NAS-DIP: Learning Deep Image Prior with Neural Architecture Search [65.79109790446257]
Recent work has shown that the structure of deep convolutional neural networks can be used as a structured image prior.
We propose to search for neural architectures that capture stronger image priors.
We search for an improved network by leveraging an existing neural architecture search algorithm.
arXiv Detail & Related papers (2020-08-26T17:59:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.