Related papers: Evaluating Deep Neural Networks for Image Document Enhancement

Evaluating Deep Neural Networks for Image Document Enhancement

URL: http://arxiv.org/abs/2106.15286v1
Date: Fri, 11 Jun 2021 19:48:28 GMT
Title: Evaluating Deep Neural Networks for Image Document Enhancement
Authors: Lucas N. Kirsten, Ricardo Piccoli and Ricardo Ribani
Abstract summary: This work evaluates six state-of-the-art deep neural network (DNN) architectures applied to the problem of enhancing document images. The best performing architectures generally produced good enhancement compared to the existing algorithm.
Score: 0.0
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: This work evaluates six state-of-the-art deep neural network (DNN) architectures applied to the problem of enhancing camera-captured document images. The results from each network were evaluated both qualitatively and quantitatively using Image Quality Assessment (IQA) metrics, and also compared with an existing approach based on traditional computer vision techniques. The best performing architectures generally produced good enhancement compared to the existing algorithm, showing that it is possible to use DNNs for document image enhancement. Furthermore, the best performing architectures could work as a baseline for future investigations on document enhancement using deep learning techniques. The main contributions of this paper are: a baseline of deep learning techniques that can be further improved to provide better results, and a evaluation methodology using IQA metrics for quantitatively comparing the produced images from the neural networks to a ground truth.

Related papers

Enhancing Underwater Images Using Deep Learning with Subjective Image Quality Integration [0.8287206589886879]
This paper presents a deep learning-based approach to improving underwater image quality.<n>We use publicly available datasets containing underwater images labeled by experts as either high or low quality.<n>Results demonstrate that the proposed model achieves substantial improvements in both perceived and measured image quality.
arXiv Detail & Related papers (2025-07-07T18:25:13Z)
A Tree-guided CNN for image super-resolution [50.30242741813306]
We design a tree-guided CNN for image super-resolution (TSRNet)<n>It uses a tree architecture to guide a deep network to enhance effect of key nodes to amplify the relation of hierarchical information.<n>To prevent insufficiency of the obtained structural information, cosine transform techniques in the TSRNet are used to improve performance of image super-resolution.
arXiv Detail & Related papers (2025-06-03T08:05:11Z)
Scene Perceived Image Perceptual Score (SPIPS): combining global and local perception for image quality assessment [0.0]
We propose a novel IQA approach that bridges the gap between deep learning methods and human perception. Our model disentangles deep features into high-level semantic information and low-level perceptual details, treating each stream separately. This hybrid design enables the model to assess both global context and intricate image details, better reflecting the human visual process.
arXiv Detail & Related papers (2025-04-24T04:06:07Z)
MGAN-CRCM: A Novel Multiple Generative Adversarial Network and Coarse-Refinement Based Cognizant Method for Image Inpainting [3.560962705392617]
This paper introduces a novel architecture combining GAN and ResNet models to improve image inpainting outcomes. Our framework integrates three components: Transpose Convolution-based GAN for guided and blind inpainting, Fast ResNet-Convolutional Neural Network (FR-CNN) for object removal, and Co-Modulation GAN (Co-Mod GAN) for refinement.
arXiv Detail & Related papers (2024-12-25T22:54:28Z)
MBInception: A new Multi-Block Inception Model for Enhancing Image Processing Efficiency [3.3748750222488657]
This article introduces an innovative image classification model that employs three consecutive inception blocks within a convolutional neural networks framework. We compare our model with well-established architectures such as Visual Geometry Group, Residual Network, and MobileNet. The outcomes reveal that our novel model consistently outperforms its counterparts across diverse datasets.
arXiv Detail & Related papers (2024-12-18T10:46:04Z)
A Comprehensive Survey on Deep Neural Image Deblurring [0.76146285961466]
Image deblurring tries to eliminate degradation elements of an image causing blurriness and improve the quality of an image for better texture and object visualization. Traditionally, prior-based optimization approaches predominated in image deblurring, but deep neural networks recently brought a major breakthrough in the field. We outline the most popular deep neural network structures used in deblurring applications, describe their strengths and novelties, summarize performance metrics, and introduce broadly used datasets.
arXiv Detail & Related papers (2023-10-07T07:29:42Z)
Comparison Analysis of Traditional Machine Learning and Deep Learning Techniques for Data and Image Classification [62.997667081978825]
The purpose of the study is to analyse and compare the most common machine learning and deep learning techniques used for computer vision 2D object classification tasks. Firstly, we will present the theoretical background of the Bag of Visual words model and Deep Convolutional Neural Networks (DCNN) Secondly, we will implement a Bag of Visual Words model, the VGG16 CNN Architecture.
arXiv Detail & Related papers (2022-04-11T11:34:43Z)
Deep Image Deblurring: A Survey [165.32391279761006]
Deblurring is a classic problem in low-level computer vision, which aims to recover a sharp image from a blurred input image. Recent advances in deep learning have led to significant progress in solving this problem.
arXiv Detail & Related papers (2022-01-26T01:31:30Z)
SDT-DCSCN for Simultaneous Super-Resolution and Deblurring of Text Images [3.5590597557917363]
We propose an approach called SDT-DCSCN that jointly performs super-resolution and deblurring of low-resolution blurry text images based on DCSCN. Our approach uses subsampled blurry images in the input and original sharp images as ground truth.
arXiv Detail & Related papers (2022-01-15T14:51:50Z)
Image Quality Assessment using Contrastive Learning [50.265638572116984]
We train a deep Convolutional Neural Network (CNN) using a contrastive pairwise objective to solve the auxiliary problem. We show through extensive experiments that CONTRIQUE achieves competitive performance when compared to state-of-the-art NR image quality models. Our results suggest that powerful quality representations with perceptual relevance can be obtained without requiring large labeled subjective image quality datasets.
arXiv Detail & Related papers (2021-10-25T21:01:00Z)
Image Quality Assessment in the Modern Age [53.19271326110551]
This tutorial provides the audience with the basic theories, methodologies, and current progresses of image quality assessment (IQA) We will first revisit several subjective quality assessment methodologies, with emphasis on how to properly select visual stimuli. Both hand-engineered and (deep) learning-based methods will be covered.
arXiv Detail & Related papers (2021-10-19T02:38:46Z)
Joint Learning of Neural Transfer and Architecture Adaptation for Image Recognition [77.95361323613147]
Current state-of-the-art visual recognition systems rely on pretraining a neural network on a large-scale dataset and finetuning the network weights on a smaller dataset. In this work, we prove that dynamically adapting network architectures tailored for each domain task along with weight finetuning benefits in both efficiency and effectiveness. Our method can be easily generalized to an unsupervised paradigm by replacing supernet training with self-supervised learning in the source domain tasks and performing linear evaluation in the downstream tasks.
arXiv Detail & Related papers (2021-03-31T08:15:17Z)
Deep Multi-Scale Features Learning for Distorted Image Quality Assessment [20.7146855562825]
Existing deep neural networks (DNNs) have shown significant effectiveness for tackling the IQA problem. We propose to use pyramid features learning to build a DNN with hierarchical multi-scale features for distorted image quality prediction. Our proposed network is optimized in a deep end-to-end supervision manner.
arXiv Detail & Related papers (2020-12-01T23:39:01Z)
NAS-DIP: Learning Deep Image Prior with Neural Architecture Search [65.79109790446257]
Recent work has shown that the structure of deep convolutional neural networks can be used as a structured image prior. We propose to search for neural architectures that capture stronger image priors. We search for an improved network by leveraging an existing neural architecture search algorithm.
arXiv Detail & Related papers (2020-08-26T17:59:36Z)
Learning Local Complex Features using Randomized Neural Networks for Texture Analysis [0.1474723404975345]
We present a new approach that combines a learning technique and the Complex Network (CN) theory for texture analysis. This method takes advantage of the representation capacity of CN to model a texture image as a directed network. This neural network has a single hidden layer and uses a fast learning algorithm, which is able to learn local CN patterns for texture characterization.
arXiv Detail & Related papers (2020-07-10T23:18:01Z)

This list is automatically generated from the titles and abstracts of the papers in this site.