Handwriting Recognition with Novelty
- URL: http://arxiv.org/abs/2105.06582v1
- Date: Thu, 13 May 2021 23:01:07 GMT
- Title: Handwriting Recognition with Novelty
- Authors: Derek S. Prijatelj (1), Samuel Grieggs (1), Futoshi Yumoto (2), Eric
Robertson (2), Walter J. Scheirer (1) ((1) University of Notre Dame, (2) PAR
Government)
- Abstract summary: In handwritten documents, novelty can be a change in writer, character attributes, writing attributes, or overall document appearance.
This paper formalizes the domain of handwriting recognition with novelty, describes a baseline agent, introduces an evaluation protocol with benchmark data, and provides experimentation to set the state-of-the-art.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper introduces an agent-centric approach to handle novelty in the
visual recognition domain of handwriting recognition (HWR). An ideal
transcription agent would rival or surpass human perception, being able to
recognize known and new characters in an image, and detect any stylistic
changes that may occur within or across documents. A key confound is the
presence of novelty, which has continued to stymie even the best machine
learning-based algorithms for these tasks. In handwritten documents, novelty
can be a change in writer, character attributes, writing attributes, or overall
document appearance, among other things. Instead of looking at each aspect
independently, we suggest that an integrated agent that can process known
characters and novelties simultaneously is a better strategy. This paper
formalizes the domain of handwriting recognition with novelty, describes a
baseline agent, introduces an evaluation protocol with benchmark data, and
provides experimentation to set the state-of-the-art. Results show feasibility
for the agent-centric approach, but more work is needed to approach
human-levels of reading ability, giving the HWR community a formal basis to
build upon as they solve this challenging problem.
Related papers
- Contrastive Masked Autoencoders for Character-Level Open-Set Writer Identification [25.996617568144675]
This paper introduces Contrastive Masked Auto-Encoders (CMAE) for Character-level Open-Set Writer Identification.
We merge Masked Auto-Encoders (MAE) with Contrastive Learning (CL) to simultaneously and respectively capture sequential information and distinguish diverse handwriting styles.
Our model achieves state-of-the-art results on the CASIA online handwriting dataset, reaching an impressive precision rate of 89.7%.
arXiv Detail & Related papers (2025-01-21T05:15:10Z) - UNIT: Unifying Image and Text Recognition in One Vision Encoder [51.140564856352825]
UNIT is a novel training framework aimed at UNifying Image and Text recognition within a single model.
We show that UNIT significantly outperforms existing methods on document-related tasks.
Notably, UNIT retains the original vision encoder architecture, making it cost-free in terms of inference and deployment.
arXiv Detail & Related papers (2024-09-06T08:02:43Z) - Learning Robust Named Entity Recognizers From Noisy Data With Retrieval Augmentation [67.89838237013078]
Named entity recognition (NER) models often struggle with noisy inputs.
We propose a more realistic setting in which only noisy text and its NER labels are available.
We employ a multi-view training framework that improves robust NER without retrieving text during inference.
arXiv Detail & Related papers (2024-07-26T07:30:41Z) - Representing Online Handwriting for Recognition in Large Vision-Language
Models [8.344510330567495]
We propose a novel tokenized representation of digital ink (online handwriting) that includes both a time-ordered sequence of strokes as text, and as image.
We show that this representation yields results comparable to or better than state-of-the-art online handwriting recognizers.
arXiv Detail & Related papers (2024-02-23T13:11:10Z) - UIT-HWDB: Using Transferring Method to Construct A Novel Benchmark for
Evaluating Unconstrained Handwriting Image Recognition in Vietnamese [2.8360662552057323]
In Vietnamese, besides the modern Latin characters, there are accent and letter marks together with characters that draw confusion to state-of-the-art handwriting recognition methods.
As a low-resource language, there are not many datasets for researching handwriting recognition in Vietnamese.
Recent works evaluated offline handwriting recognition methods in Vietnamese using images from an online handwriting dataset constructed by connecting pen stroke coordinates without further processing.
This paper proposes the Transferring method to construct a handwriting image dataset that associates crucial natural attributes required for offline handwriting images.
arXiv Detail & Related papers (2022-11-10T08:23:54Z) - PART: Pre-trained Authorship Representation Transformer [64.78260098263489]
Authors writing documents imprint identifying information within their texts: vocabulary, registry, punctuation, misspellings, or even emoji usage.
Previous works use hand-crafted features or classification tasks to train their authorship models, leading to poor performance on out-of-domain authors.
We propose a contrastively trained model fit to learn textbfauthorship embeddings instead of semantics.
arXiv Detail & Related papers (2022-09-30T11:08:39Z) - Boosting Modern and Historical Handwritten Text Recognition with
Deformable Convolutions [52.250269529057014]
Handwritten Text Recognition (HTR) in free-volution pages is a challenging image understanding task.
We propose to adopt deformable convolutions, which can deform depending on the input at hand and better adapt to the geometric variations of the text.
arXiv Detail & Related papers (2022-08-17T06:55:54Z) - Continuous Offline Handwriting Recognition using Deep Learning Models [0.0]
Handwritten text recognition is an open problem of great interest in the area of automatic document image analysis.
We have proposed a new recognition model based on integrating two types of deep learning architectures: convolutional neural networks (CNN) and sequence-to-sequence (seq2seq)
The new proposed model provides competitive results with those obtained with other well-established methodologies.
arXiv Detail & Related papers (2021-12-26T07:31:03Z) - Letter-level Online Writer Identification [86.13203975836556]
We focus on a novel problem, letter-level online writer-id, which requires only a few trajectories of written letters as identification cues.
A main challenge is that a person often writes a letter in different styles from time to time.
We refer to this problem as the variance of online writing styles (Var-O-Styles)
arXiv Detail & Related papers (2021-12-06T07:21:53Z) - SmartPatch: Improving Handwritten Word Imitation with Patch
Discriminators [67.54204685189255]
We propose SmartPatch, a new technique increasing the performance of current state-of-the-art methods.
We combine the well-known patch loss with information gathered from the parallel trained handwritten text recognition system.
This leads to a more enhanced local discriminator and results in more realistic and higher-quality generated handwritten words.
arXiv Detail & Related papers (2021-05-21T18:34:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.