HCR-Net: A deep learning based script independent handwritten character
recognition network
- URL: http://arxiv.org/abs/2108.06663v4
- Date: Sat, 17 Feb 2024 15:35:50 GMT
- Title: HCR-Net: A deep learning based script independent handwritten character
recognition network
- Authors: Vinod Kumar Chauhan, Sukhdeep Singh and Anuj Sharma
- Abstract summary: Handwritten character recognition (HCR) remains a challenging pattern recognition problem despite decades of research.
We have proposed a script independent deep learning network for HCR research, called HCR-Net, that sets a new research direction for the field.
- Score: 5.8067395321424975
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Handwritten character recognition (HCR) remains a challenging pattern
recognition problem despite decades of research, and lacks research on script
independent recognition techniques. {\color{black}This is mainly because of
similar character structures, different handwriting styles, diverse scripts,
handcrafted feature extraction techniques, unavailability of data and code, and
the development of script-specific deep learning techniques. To address these
limitations, we have proposed a script independent deep learning network for
HCR research, called HCR-Net, that sets a new research direction for the field.
HCR-Net is based on a novel transfer learning approach for HCR, which
\textit{partly utilizes} feature extraction layers of a pre-trained network.}
Due to transfer learning and image augmentation, HCR-Net provides faster and
computationally efficient training, better performance and generalizations, and
can work with small datasets. HCR-Net is extensively evaluated on 40 publicly
available datasets of Bangla, Punjabi, Hindi, English, Swedish, Urdu, Farsi,
Tibetan, Kannada, Malayalam, Telugu, Marathi, Nepali and Arabic languages, and
established 26 new benchmark results while performed close to the best results
in the rest cases. HCR-Net showed performance improvements up to 11\% against
the existing results and achieved a fast convergence rate showing up to 99\% of
final performance in the very first epoch. HCR-Net significantly outperformed
the state-of-the-art transfer learning techniques and also reduced the number
of trainable parameters by 34\% as compared with the corresponding pre-trained
network. To facilitate reproducibility and further advancements of HCR
research, the complete code is publicly released at
\url{https://github.com/jmdvinodjmd/HCR-Net}.
Related papers
- Classification of Non-native Handwritten Characters Using Convolutional Neural Network [0.0]
The classification of English characters written by non-native users is performed by proposing a custom-tailored CNN model.
We train this CNN with a new dataset called the handwritten isolated English character dataset.
The proposed model with five convolutional layers and one hidden layer outperforms state-of-the-art models in terms of character recognition accuracy.
arXiv Detail & Related papers (2024-06-06T21:08:07Z) - Retrieval is Accurate Generation [99.24267226311157]
We introduce a novel method that selects context-aware phrases from a collection of supporting documents.
Our model achieves the best performance and the lowest latency among several retrieval-augmented baselines.
arXiv Detail & Related papers (2024-02-27T14:16:19Z) - Optimizing the Neural Network Training for OCR Error Correction of
Historical Hebrew Texts [0.934612743192798]
This paper proposes an innovative method for training a light-weight neural network for Hebrew OCR post-correction using significantly less manually created data.
An analysis of historical OCRed newspapers was done to learn common language and corpus-specific OCR errors.
arXiv Detail & Related papers (2023-07-30T12:59:06Z) - OCRBench: On the Hidden Mystery of OCR in Large Multimodal Models [122.27878464009181]
We conducted a comprehensive evaluation of Large Multimodal Models, such as GPT4V and Gemini, in various text-related visual tasks.
OCRBench contains 29 datasets, making it the most comprehensive OCR evaluation benchmark available.
arXiv Detail & Related papers (2023-05-13T11:28:37Z) - Uncovering the Handwritten Text in the Margins: End-to-end Handwritten
Text Detection and Recognition [0.840835093659811]
This work presents an end-to-end framework for automatic detection and recognition of handwritten marginalia.
It uses data augmentation and transfer learning to overcome training data scarcity.
The effectiveness of the proposed framework has been empirically evaluated on the data from early book collections found in the Uppsala University Library in Sweden.
arXiv Detail & Related papers (2023-03-10T14:00:53Z) - Lexically Aware Semi-Supervised Learning for OCR Post-Correction [90.54336622024299]
Much of the existing linguistic data in many languages of the world is locked away in non-digitized books and documents.
Previous work has demonstrated the utility of neural post-correction methods on recognition of less-well-resourced languages.
We present a semi-supervised learning method that makes it possible to utilize raw images to improve performance.
arXiv Detail & Related papers (2021-11-04T04:39:02Z) - PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering
Network [54.03560668182197]
We propose a novel fully convolutional Point Gathering Network (PGNet) for reading arbitrarily-shaped text in real-time.
With a PG-CTC decoder, we gather high-level character classification vectors from two-dimensional space and decode them into text symbols without NMS and RoI operations.
Experiments prove that the proposed method achieves competitive accuracy, meanwhile significantly improving the running speed.
arXiv Detail & Related papers (2021-04-12T13:27:34Z) - Be More with Less: Hypergraph Attention Networks for Inductive Text
Classification [56.98218530073927]
Graph neural networks (GNNs) have received increasing attention in the research community and demonstrated their promising results on this canonical task.
Despite the success, their performance could be largely jeopardized in practice since they are unable to capture high-order interaction between words.
We propose a principled model -- hypergraph attention networks (HyperGAT) which can obtain more expressive power with less computational consumption for text representation learning.
arXiv Detail & Related papers (2020-11-01T00:21:59Z) - Offline Handwritten Chinese Text Recognition with Convolutional Neural
Networks [5.984124397831814]
In this paper, we build the models using only the convolutional neural networks and use CTC as the loss function.
We achieve 6.81% character error rate (CER) on the ICDAR 2013 competition set, which is the best published result without language model correction.
arXiv Detail & Related papers (2020-06-28T14:34:38Z) - Towards Accurate Scene Text Recognition with Semantic Reasoning Networks [52.86058031919856]
We propose a novel end-to-end trainable framework named semantic reasoning network (SRN) for accurate scene text recognition.
GSRM is introduced to capture global semantic context through multi-way parallel transmission.
Results on 7 public benchmarks, including regular text, irregular text and non-Latin long text, verify the effectiveness and robustness of the proposed method.
arXiv Detail & Related papers (2020-03-27T09:19:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.