Optical Script Identification for multi-lingual Indic-script
- URL: http://arxiv.org/abs/2308.05780v1
- Date: Thu, 10 Aug 2023 14:02:05 GMT
- Title: Optical Script Identification for multi-lingual Indic-script
- Authors: Sidhantha Poddar and Rohan Gupta
- Abstract summary: The aim of this article is to discuss the advancement in the techniques for script pre-processing and text recognition.
In India there are twelve prominent Indic scripts, unlike the English language, these scripts have layers of characteristics.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Script identification and text recognition are some of the major domains in
the application of Artificial Intelligence. In this era of digitalization, the
use of digital note-taking has become a common practice. Still, conventional
methods of using pen and paper is a prominent way of writing. This leads to the
classification of scripts based on the method they are obtained. A survey on
the current methodologies and state-of-art methods used for processing and
identification would prove beneficial for researchers. The aim of this article
is to discuss the advancement in the techniques for script pre-processing and
text recognition. In India there are twelve prominent Indic scripts, unlike the
English language, these scripts have layers of characteristics. Complex
characteristics such as similarity in text shape make them difficult to
recognize and analyze, thus this requires advance preprocessing methods for
their accurate recognition. A sincere attempt is made in this survey to provide
a comparison between all algorithms. We hope that this survey would provide
insight to a researcher working not only on Indic scripts but also other
languages.
Related papers
- Efficiently Leveraging Linguistic Priors for Scene Text Spotting [63.22351047545888]
This paper proposes a method that leverages linguistic knowledge from a large text corpus to replace the traditional one-hot encoding used in auto-regressive scene text spotting and recognition models.
We generate text distributions that align well with scene text datasets, removing the need for in-domain fine-tuning.
Experimental results show that our method not only improves recognition accuracy but also enables more accurate localization of words.
arXiv Detail & Related papers (2024-02-27T01:57:09Z) - Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through
Image-IDS Aligning [61.34060587461462]
We propose a two-stage framework for Chinese Text Recognition (CTR)
We pre-train a CLIP-like model through aligning printed character images and Ideographic Description Sequences (IDS)
This pre-training stage simulates humans recognizing Chinese characters and obtains the canonical representation of each character.
The learned representations are employed to supervise the CTR model, such that traditional single-character recognition can be improved to text-line recognition.
arXiv Detail & Related papers (2023-09-03T05:33:16Z) - Separate Scene Text Detector for Unseen Scripts is Not All You Need [12.848024214330234]
In the last decade, some scripts have gained the attention of the research community and achieved good detection performance.
Many scripts are low-resourced for training deep learning-based scene text detectors.
It raises a critical question: Is there a need for separate training for new scripts?
This paper acknowledges this problem and proposes a solution to detect scripts not present during training.
arXiv Detail & Related papers (2023-07-29T14:03:05Z) - DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting [112.45423990924283]
DeepSolo++ is a simple DETR-like baseline that lets a single decoder with explicit points solo for text detection, recognition, and script identification simultaneously.
Our method not only performs well in English scenes but also masters the transcription with complex font structure and a thousand-level character classes, such as Chinese.
arXiv Detail & Related papers (2023-05-31T15:44:00Z) - CSSL-MHTR: Continual Self-Supervised Learning for Scalable Multi-script Handwritten Text Recognition [16.987008461171065]
We explore the potential of continual self-supervised learning to alleviate the catastrophic forgetting problem in handwritten text recognition.
Our method consists in adding intermediate layers called adapters for each task, and efficiently distilling knowledge from the previous model while learning the current task.
We attain state-of-the-art performance on English, Italian and Russian scripts, whilst adding only a few parameters per task.
arXiv Detail & Related papers (2023-03-16T14:27:45Z) - Writer Retrieval and Writer Identification in Greek Papyri [4.44566870214758]
Writer identification refers to the classification of known writers while writer retrieval seeks to find the writer by means of image similarity in a dataset of images.
While automatic writer identification/retrieval methods already provide promising results for many historical document types, papyri data is very challenging due to the fiber structures and severe artifacts.
We investigate several methods and show that a good binarization is key to an improved writer identification in papyri writings.
arXiv Detail & Related papers (2022-12-15T08:42:25Z) - Human-in-the-Loop Disinformation Detection: Stance, Sentiment, or
Something Else? [93.91375268580806]
Both politics and pandemics have recently provided ample motivation for the development of machine learning-enabled disinformation (a.k.a. fake news) detection algorithms.
Existing literature has focused primarily on the fully-automated case, but the resulting techniques cannot reliably detect disinformation on the varied topics, sources, and time scales required for military applications.
By leveraging an already-available analyst as a human-in-the-loop, canonical machine learning techniques of sentiment analysis, aspect-based sentiment analysis, and stance detection become plausible methods to use for a partially-automated disinformation detection system.
arXiv Detail & Related papers (2021-11-09T13:30:34Z) - Handwritten Script Identification from Text Lines [38.1188690493442]
We propose a robust method towards identifying scripts from handwritten documents at text line-level.
The recognition is based upon features extracted using Chain Code Histogram (CCH) and Discrete Fourier Transform (DFT)
The proposed method is experimented on 800 handwritten text lines written in seven Indic scripts namely, Gujarati, Kannada, Malayalam, Oriya, Tamil, Telugu, Urdu along with Roman script.
arXiv Detail & Related papers (2020-09-16T02:43:24Z) - Screenplay Quality Assessment: Can We Predict Who Gets Nominated? [53.9153892362629]
We present a method to evaluate the quality of a screenplay based on linguistic cues.
Based on industry opinions and narratology, we extract and integrate domain-specific features into common classification techniques.
arXiv Detail & Related papers (2020-05-13T02:39:56Z) - TextScanner: Reading Characters in Order for Robust Scene Text
Recognition [60.04267660533966]
TextScanner is an alternative approach for scene text recognition.
It generates pixel-wise, multi-channel segmentation maps for character class, position and order.
It also adopts RNN for context modeling and performs paralleled prediction for character position and class.
arXiv Detail & Related papers (2019-12-28T07:52:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.