OCR quality affects perceived usefulness of historical newspaper
clippings -- a user study
- URL: http://arxiv.org/abs/2203.03557v1
- Date: Fri, 4 Mar 2022 11:49:54 GMT
- Title: OCR quality affects perceived usefulness of historical newspaper
clippings -- a user study
- Authors: Kimmo Kettunen, Heikki Keskustalo, Sanna Kumpulainen, Tuula
P\"a\"akk\"onen and Juha Rautiainen
- Abstract summary: The effects of Optical Character Recognition (OCR) quality are studied in a user-oriented information retrieval setting.
The main result of the study is that improved optical character recognition quality affects perceived usefulness of historical newspaper articles significantly.
- Score: 0.6299766708197884
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Effects of Optical Character Recognition (OCR) quality on historical
information retrieval have so far been studied in data-oriented scenarios
regarding the effectiveness of retrieval results. Such studies have either
focused on the effects of artificially degraded OCR quality (see, e.g., [1-2])
or utilized test collections containing texts based on authentic low quality
OCR data (see, e.g., [3]). In this paper the effects of OCR quality are studied
in a user-oriented information retrieval setting. Thirty-two users evaluated
subjectively query results of six topics each (out of 30 topics) based on
pre-formulated queries using a simulated work task setting. To the best of our
knowledge our simulated work task experiment is the first one showing
empirically that users' subjective relevance assessments of retrieved documents
are affected by a change in the quality of optically read text. Users of
historical newspaper collections have so far commented effects of OCR'ed data
quality mainly in impressionistic ways, and controlled user environments for
studying effects of OCR quality on users' relevance assessments of the
retrieval results have so far been missing. To remedy this The National Library
of Finland (NLF) set up an experimental query environment for the contents of
one Finnish historical newspaper, Uusi Suometar 1869-1918, to be able to
compare users' evaluation of search results of two different OCR qualities for
digitized newspaper articles. The query interface was able to present the same
underlying document for the user based on two alternatives: either based on the
lower OCR quality, or based on the higher OCR quality, and the choice was
randomized. The users did not know about quality differences in the article
texts they evaluated. The main result of the study is that improved optical
character recognition quality affects perceived usefulness of historical
newspaper articles significantly. The mean average evaluation score for the
improved OCR results was 7.94% higher than the mean average evaluation score of
the old OCR results.
Related papers
- Multi-Facet Counterfactual Learning for Content Quality Evaluation [48.73583736357489]
We propose a framework for efficiently constructing evaluators that perceive multiple facets of content quality evaluation.
We leverage a joint training strategy based on contrastive learning and supervised learning to enable the evaluator to distinguish between different quality facets.
arXiv Detail & Related papers (2024-10-10T08:04:10Z) - CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models [0.0]
This paper introduces Context Leveraging OCR Correction (CLOCR-C)
It uses the infilling and context-adaptive abilities of transformer-based language models (LMs) to improve OCR quality.
The study aims to determine if LMs can perform post-OCR correction, improve downstream NLP tasks, and the value of providing socio-cultural context as part of the correction process.
arXiv Detail & Related papers (2024-08-30T17:26:05Z) - Mashee at SemEval-2024 Task 8: The Impact of Samples Quality on the Performance of In-Context Learning for Machine Text Classification [0.0]
We employ the chi-square test to identify high-quality samples and compare the results with those obtained using low-quality samples.
Our findings demonstrate that utilizing high-quality samples leads to improved performance with respect to all evaluated metrics.
arXiv Detail & Related papers (2024-05-28T12:47:43Z) - CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation [87.44350003888646]
Eval-Instruct can acquire pointwise grading critiques with pseudo references and revise these critiques via multi-path prompting.
CritiqueLLM is empirically shown to outperform ChatGPT and all the open-source baselines.
arXiv Detail & Related papers (2023-11-30T16:52:42Z) - A Pretrainer's Guide to Training Data: Measuring the Effects of Data
Age, Domain Coverage, Quality, & Toxicity [84.6421260559093]
This study is the largest set of experiments to validate, quantify, and expose undocumented intuitions about text pretraining.
Our findings indicate there does not exist a one-size-fits-all solution to filtering training data.
arXiv Detail & Related papers (2023-05-22T15:57:53Z) - Exploring the Use of Large Language Models for Reference-Free Text
Quality Evaluation: An Empirical Study [63.27346930921658]
ChatGPT is capable of evaluating text quality effectively from various perspectives without reference.
The Explicit Score, which utilizes ChatGPT to generate a numeric score measuring text quality, is the most effective and reliable method among the three exploited approaches.
arXiv Detail & Related papers (2023-04-03T05:29:58Z) - User-Centric Evaluation of OCR Systems for Kwak'wala [92.73847703011353]
We show that utilizing OCR reduces the time spent in the manual transcription of culturally valuable documents by over 50%.
Our results demonstrate the potential benefits that OCR tools can have on downstream language documentation and revitalization efforts.
arXiv Detail & Related papers (2023-02-26T21:41:15Z) - Optical character recognition quality affects perceived usefulness of
historical newspaper clippings [0.6299766708197884]
Thirty-two users made searches to an article collection of Finnish newspaper Uusi Suometar 1869-1918.
Our article search database had two versions of each article with different quality optical character recognition.
arXiv Detail & Related papers (2022-06-01T10:07:50Z) - Ranking Scientific Papers Using Preference Learning [48.78161994501516]
We cast it as a paper ranking problem based on peer review texts and reviewer scores.
We introduce a novel, multi-faceted generic evaluation framework for making final decisions based on peer reviews.
arXiv Detail & Related papers (2021-09-02T19:41:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.