Advancements and Challenges in Arabic Optical Character Recognition: A
Comprehensive Survey
- URL: http://arxiv.org/abs/2312.11812v1
- Date: Tue, 19 Dec 2023 03:01:31 GMT
- Title: Advancements and Challenges in Arabic Optical Character Recognition: A
Comprehensive Survey
- Authors: Mahmoud SalahEldin Kasem, Mohamed Mahmoud, Hyun-Soo Kang
- Abstract summary: This paper seeks to offer an exhaustive review of contemporary applications, methodologies, and challenges associated with Arabic Optical Character Recognition (OCR)
A thorough analysis is conducted on prevailing techniques utilized throughout the OCR process, with a dedicated effort to discern the most efficacious approaches that demonstrate enhanced outcomes.
In addition to presenting cutting-edge techniques and methods, this paper critically identifies research gaps within the realm of Arabic OCR.
- Score: 0.6629765271909505
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Optical character recognition (OCR) is a vital process that involves the
extraction of handwritten or printed text from scanned or printed images,
converting it into a format that can be understood and processed by machines.
This enables further data processing activities such as searching and editing.
The automatic extraction of text through OCR plays a crucial role in digitizing
documents, enhancing productivity, improving accessibility, and preserving
historical records. This paper seeks to offer an exhaustive review of
contemporary applications, methodologies, and challenges associated with Arabic
Optical Character Recognition (OCR). A thorough analysis is conducted on
prevailing techniques utilized throughout the OCR process, with a dedicated
effort to discern the most efficacious approaches that demonstrate enhanced
outcomes. To ensure a thorough evaluation, a meticulous keyword-search
methodology is adopted, encompassing a comprehensive analysis of articles
relevant to Arabic OCR, including both backward and forward citation reviews.
In addition to presenting cutting-edge techniques and methods, this paper
critically identifies research gaps within the realm of Arabic OCR. By
highlighting these gaps, we shed light on potential areas for future
exploration and development, thereby guiding researchers toward promising
avenues in the field of Arabic OCR. The outcomes of this study provide valuable
insights for researchers, practitioners, and stakeholders involved in Arabic
OCR, ultimately fostering advancements in the field and facilitating the
creation of more accurate and efficient OCR systems for the Arabic language.
Related papers
- A comprehensive survey of oracle character recognition: challenges, benchmarks, and beyond [39.542007188857376]
Oracle character recognition has become a pivotal field intersecting archaeology, paleography, and historical cultural studies.
Traditional methods of oracle character recognition have relied heavily on manual interpretation by experts.
With recent breakthroughs in pattern recognition and deep learning, there is a growing movement towards the automation of oracle character recognition.
arXiv Detail & Related papers (2024-11-18T07:50:22Z) - Image Based Character Recognition, Documentation System To Decode Inscription From Temple [0.0]
This project undertakes the training and analysis of optical character recognition OCR methods applied to 10th century ancient Tamil inscriptions discovered on the walls of the Brihadeeswarar Temple.
The chosen OCR methods include Tesseract,a widely used OCR engine,using modern ICR techniques to pre process the raw data and a box editing software to finetune our model.
The analysis with Tesseract aims to evaluate their effectiveness in accurately deciphering the nuances of the ancient Tamil characters.
arXiv Detail & Related papers (2024-05-21T17:20:35Z) - Towards Possibilities & Impossibilities of AI-generated Text Detection:
A Survey [97.33926242130732]
Large Language Models (LLMs) have revolutionized the domain of natural language processing (NLP) with remarkable capabilities of generating human-like text responses.
Despite these advancements, several works in the existing literature have raised serious concerns about the potential misuse of LLMs.
To address these concerns, a consensus among the research community is to develop algorithmic solutions to detect AI-generated text.
arXiv Detail & Related papers (2023-10-23T18:11:32Z) - OCRBench: On the Hidden Mystery of OCR in Large Multimodal Models [122.27878464009181]
We conducted a comprehensive evaluation of Large Multimodal Models, such as GPT4V and Gemini, in various text-related visual tasks.
OCRBench contains 29 datasets, making it the most comprehensive OCR evaluation benchmark available.
arXiv Detail & Related papers (2023-05-13T11:28:37Z) - User-Centric Evaluation of OCR Systems for Kwak'wala [92.73847703011353]
We show that utilizing OCR reduces the time spent in the manual transcription of culturally valuable documents by over 50%.
Our results demonstrate the potential benefits that OCR tools can have on downstream language documentation and revitalization efforts.
arXiv Detail & Related papers (2023-02-26T21:41:15Z) - Review of coreference resolution in English and Persian [8.604145658574689]
Coreference resolution (CR) identifies expressions referring to the same real-world entity.
This paper explores the latest advancements in CR, spanning coreference and anaphora resolution.
Recognizing the unique challenges of Persian CR, we dedicate a focused analysis to this under-resourced language.
arXiv Detail & Related papers (2022-11-08T18:14:09Z) - MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and
Understanding [70.16678926775475]
MMOCR is an open-source toolbox for text detection and recognition.
It implements 14 state-of-the-art algorithms, which is more than all the existing open-source OCR projects we are aware of to date.
arXiv Detail & Related papers (2021-08-14T14:10:23Z) - ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction [70.71240097723745]
In recognition of the technical challenges, importance and huge commercial potentials of SROIE, we organized the ICDAR 2019 competition on SROIE.
A new dataset with 1000 whole scanned receipt images and annotations is created for the competition.
In this report we will presents the motivation, competition datasets, task definition, evaluation protocol, submission statistics, performance of submitted methods and results analysis.
arXiv Detail & Related papers (2021-03-18T12:33:41Z) - A Survey of Deep Learning Approaches for OCR and Document Understanding [68.65995739708525]
We review different techniques for document understanding for documents written in English.
We consolidate methodologies present in literature to act as a jumping-off point for researchers exploring this area.
arXiv Detail & Related papers (2020-11-27T03:05:59Z) - Handwritten Optical Character Recognition (OCR): A Comprehensive
Systematic Literature Review (SLR) [0.0]
This review article serves the purpose of presenting state of the art results and techniques on OCR.
Optical character recognition is a science that enables to translate various types of documents or images into analyzable, editable and searchable data.
arXiv Detail & Related papers (2020-01-01T04:55:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.