iOCR: Informed Optical Character Recognition for Election Ballot Tallies
- URL: http://arxiv.org/abs/2208.00865v1
- Date: Mon, 1 Aug 2022 13:50:13 GMT
- Title: iOCR: Informed Optical Character Recognition for Election Ballot Tallies
- Authors: Kenneth U. Oyibo, Jean D. Louis, and Juan E. Gilbert
- Abstract summary: iOCR was developed with a spell correction algorithm to fix errors introduced by conventional OCR for vote tabulation.
The results found that the iOCR system outperforms conventional OCR techniques.
- Score: 13.343515845758398
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: The purpose of this study is to explore the performance of Informed OCR or
iOCR. iOCR was developed with a spell correction algorithm to fix errors
introduced by conventional OCR for vote tabulation. The results found that the
iOCR system outperforms conventional OCR techniques.
Related papers
- Confidence-Aware Document OCR Error Detection [1.003485566379789]
We analyze the correlation between confidence scores and error rates across different OCR systems.
We develop ConfBERT, a BERT-based model that incorporates OCR confidence scores into token embeddings.
arXiv Detail & Related papers (2024-09-06T08:35:28Z) - Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter [57.64003871384959]
This work presents a new approach to fast context-biasing with CTC-based Word Spotter.
The proposed method matches CTC log-probabilities against a compact context graph to detect potential context-biasing candidates.
The results demonstrate a significant acceleration of the context-biasing recognition with a simultaneous improvement in F-score and WER.
arXiv Detail & Related papers (2024-06-11T09:37:52Z) - Oracle Character Recognition using Unsupervised Discriminative
Consistency Network [65.64172835624206]
We propose a novel unsupervised domain adaptation method for oracle character recognition (OrCR)
We leverage pseudo-labeling to incorporate the semantic information into adaptation and constrain augmentation consistency.
Our approach achieves state-of-the-art result on Oracle-241 dataset and substantially outperforms the recently proposed structure-texture separation network by 15.1%.
arXiv Detail & Related papers (2023-12-11T02:52:27Z) - Cross-modal Active Complementary Learning with Self-refining
Correspondence [54.61307946222386]
We propose a Cross-modal Robust Complementary Learning framework (CRCL) to improve the robustness of existing methods.
ACL exploits active and complementary learning losses to reduce the risk of providing erroneous supervision.
SCC utilizes multiple self-refining processes with momentum correction to enlarge the receptive field for correcting correspondences.
arXiv Detail & Related papers (2023-10-26T15:15:11Z) - Enhancing OCR Performance through Post-OCR Models: Adopting Glyph
Embedding for Improved Correction [0.0]
The novelty of our approach lies in embedding the OCR output using CharBERT and our unique embedding technique, capturing the visual characteristics of characters.
Our findings show that post-OCR correction effectively addresses deficiencies in inferior OCR models, and glyph embedding enables the model to achieve superior results.
arXiv Detail & Related papers (2023-08-29T12:41:50Z) - Bayesian Inverse Contextual Reasoning for Heterogeneous Semantics-Native
Communication [47.9462619619438]
When agents do not share the same communication context, the effectiveness of contextual reasoning is compromised.
This article proposes a novel framework for solving the inverse problem of CR in SNC using two Bayesian inference methods.
arXiv Detail & Related papers (2023-06-10T10:10:55Z) - User-Centric Evaluation of OCR Systems for Kwak'wala [92.73847703011353]
We show that utilizing OCR reduces the time spent in the manual transcription of culturally valuable documents by over 50%.
Our results demonstrate the potential benefits that OCR tools can have on downstream language documentation and revitalization efforts.
arXiv Detail & Related papers (2023-02-26T21:41:15Z) - An Evaluation of OCR on Egocentric Data [30.637021477342035]
In this paper, we evaluate state-of-the-art OCR methods on Egocentric data.
We demonstrate that existing OCR methods struggle with rotated text, which is frequently observed on objects being handled.
We introduce a simple rotate-and-merge procedure which can be applied to pre-trained OCR models that halves the normalized edit distance error.
arXiv Detail & Related papers (2022-06-11T10:37:20Z) - Neural Model Reprogramming with Similarity Based Mapping for
Low-Resource Spoken Command Recognition [71.96870151495536]
We propose a novel adversarial reprogramming (AR) approach for low-resource spoken command recognition (SCR)
The AR procedure aims to modify the acoustic signals (from the target domain) to repurpose a pretrained SCR model.
We evaluate the proposed AR-SCR system on three low-resource SCR datasets, including Arabic, Lithuanian, and dysarthric Mandarin speech.
arXiv Detail & Related papers (2021-10-08T05:07:35Z) - Rerunning OCR: A Machine Learning Approach to Quality Assessment and
Enhancement Prediction [0.0]
Iterating with new and improved OCR solutions enforces decisions to be taken when it comes to targeting the right reprocessing candidates.
This article captures the efforts of the National Library of Luxembourg to support those exact decisions.
arXiv Detail & Related papers (2021-10-04T18:52:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.