The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text
Recognition
- URL: http://arxiv.org/abs/2208.07682v1
- Date: Tue, 16 Aug 2022 11:44:16 GMT
- Title: The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text
Recognition
- Authors: Silvia Cascianelli, Vittorio Pippi, Martin Maarand, Marcella Cornia,
Lorenzo Baraldi, Christopher Kermorvant, Rita Cucchiara
- Abstract summary: Handwritten Text Recognition (HTR) is an open problem at the intersection of Computer Vision and Natural Language Processing.
We present the Ludovico Antonio Muratori dataset, a large line-level HTR dataset of Italian ancient manuscripts edited by a single author over 60 years.
- Score: 40.20527158935902
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Handwritten Text Recognition (HTR) is an open problem at the intersection of
Computer Vision and Natural Language Processing. The main challenges, when
dealing with historical manuscripts, are due to the preservation of the paper
support, the variability of the handwriting -- even of the same author over a
wide time-span -- and the scarcity of data from ancient, poorly represented
languages. With the aim of fostering the research on this topic, in this paper
we present the Ludovico Antonio Muratori (LAM) dataset, a large line-level HTR
dataset of Italian ancient manuscripts edited by a single author over 60 years.
The dataset comes in two configurations: a basic splitting and a date-based
splitting which takes into account the age of the author. The first setting is
intended to study HTR on ancient documents in Italian, while the second focuses
on the ability of HTR systems to recognize text written by the same writer in
time periods for which training data are not available. For both
configurations, we analyze quantitative and qualitative characteristics, also
with respect to other line-level HTR benchmarks, and present the recognition
performance of state-of-the-art HTR architectures. The dataset is available for
download at \url{https://aimagelab.ing.unimore.it/go/lam}.
Related papers
- Boosting Punctuation Restoration with Data Generation and Reinforcement
Learning [70.26450819702728]
Punctuation restoration is an important task in automatic speech recognition (ASR)
The discrepancy between written punctuated texts and ASR texts limits the usability of written texts in training punctuation restoration systems for ASR texts.
This paper proposes a reinforcement learning method to exploit in-topic written texts and recent advances in large pre-trained generative language models to bridge this gap.
arXiv Detail & Related papers (2023-07-24T17:22:04Z) - How to Choose Pretrained Handwriting Recognition Models for Single
Writer Fine-Tuning [23.274139396706264]
Recent advancements in Deep Learning-based Handwritten Text Recognition (HTR) have led to models with remarkable performance on modern and historical manuscripts.
Those models struggle to obtain the same performance when applied to manuscripts with peculiar characteristics, such as language, paper support, ink, and author handwriting.
In this paper, we take into account large, real benchmark datasets and synthetic ones obtained with a styled Handwritten Text Generation model.
We give a quantitative indication of the most relevant characteristics of such data for obtaining an HTR model able to effectively transcribe manuscripts in small collections with as little as five real fine-tuning lines
arXiv Detail & Related papers (2023-05-04T07:00:28Z) - CiteBench: A benchmark for Scientific Citation Text Generation [69.37571393032026]
CiteBench is a benchmark for citation text generation.
We make the code for CiteBench publicly available at https://github.com/UKPLab/citebench.
arXiv Detail & Related papers (2022-12-19T16:10:56Z) - PART: Pre-trained Authorship Representation Transformer [64.78260098263489]
Authors writing documents imprint identifying information within their texts: vocabulary, registry, punctuation, misspellings, or even emoji usage.
Previous works use hand-crafted features or classification tasks to train their authorship models, leading to poor performance on out-of-domain authors.
We propose a contrastively trained model fit to learn textbfauthorship embeddings instead of semantics.
arXiv Detail & Related papers (2022-09-30T11:08:39Z) - Boosting Modern and Historical Handwritten Text Recognition with
Deformable Convolutions [52.250269529057014]
Handwritten Text Recognition (HTR) in free-volution pages is a challenging image understanding task.
We propose to adopt deformable convolutions, which can deform depending on the input at hand and better adapt to the geometric variations of the text.
arXiv Detail & Related papers (2022-08-17T06:55:54Z) - Continuous Offline Handwriting Recognition using Deep Learning Models [0.0]
Handwritten text recognition is an open problem of great interest in the area of automatic document image analysis.
We have proposed a new recognition model based on integrating two types of deep learning architectures: convolutional neural networks (CNN) and sequence-to-sequence (seq2seq)
The new proposed model provides competitive results with those obtained with other well-established methodologies.
arXiv Detail & Related papers (2021-12-26T07:31:03Z) - StackMix and Blot Augmentations for Handwritten Text Recognition [0.0]
The paper describes the architecture of the neural net-work and two ways of increasing the volume of train-ing data.
StackMix can also be applied to the standalone task of gen-erating handwritten text based on printed text.
arXiv Detail & Related papers (2021-08-26T09:28:22Z) - One-shot Compositional Data Generation for Low Resource Handwritten Text
Recognition [10.473427493876422]
Low resource Handwritten Text Recognition is a hard problem due to the scarce annotated data and the very limited linguistic information.
In this paper we address this problem through a data generation technique based on Bayesian Program Learning.
Contrary to traditional generation approaches, which require a huge amount of annotated images, our method is able to generate human-like handwriting using only one sample of each symbol from the desired alphabet.
arXiv Detail & Related papers (2021-05-11T18:53:01Z) - Handwriting Classification for the Analysis of Art-Historical Documents [6.918282834668529]
We focus on the analysis of handwriting in scanned documents from the art-historic archive of the WPI.
We propose a handwriting classification model that labels extracted text fragments based on their visual structure.
arXiv Detail & Related papers (2020-11-04T13:06:46Z) - Learning to Select Bi-Aspect Information for Document-Scale Text Content
Manipulation [50.01708049531156]
We focus on a new practical task, document-scale text content manipulation, which is the opposite of text style transfer.
In detail, the input is a set of structured records and a reference text for describing another recordset.
The output is a summary that accurately describes the partial content in the source recordset with the same writing style of the reference.
arXiv Detail & Related papers (2020-02-24T12:52:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.