A Hybrid Swarm and Gravitation based feature selection algorithm for
Handwritten Indic Script Classification problem
- URL: http://arxiv.org/abs/2005.04596v1
- Date: Sun, 10 May 2020 07:27:55 GMT
- Title: A Hybrid Swarm and Gravitation based feature selection algorithm for
Handwritten Indic Script Classification problem
- Authors: Ritam Guha, Manosij Ghosh, Pawan Kumar Singh, Ram Sarkar, Mita
Nasipuri
- Abstract summary: We introduce a new FS algorithm, called Hybrid Swarm and Gravitation based FS (HSGFS)
This algorithm is made to run on 3 feature vectors introduced in the literature recently - Distance-Hough Transform (DHT), Histogram of Oriented Gradients (HOG) and Modified log-Gabor (MLG) filter Transform.
Handwritten datasets, prepared at block, text-line and word level, consisting of officially recognized 12 Indic scripts are used for the evaluation of our method.
- Score: 39.11055166524374
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In any multi-script environment, handwritten script classification is of
paramount importance before the document images are fed to their respective
Optical Character Recognition (OCR) engines. Over the years, this complex
pattern classification problem has been solved by researchers proposing various
feature vectors mostly having large dimension, thereby increasing the
computation complexity of the whole classification model. Feature Selection
(FS) can serve as an intermediate step to reduce the size of the feature
vectors by restricting them only to the essential and relevant features. In our
paper, we have addressed this issue by introducing a new FS algorithm, called
Hybrid Swarm and Gravitation based FS (HSGFS). This algorithm is made to run on
3 feature vectors introduced in the literature recently - Distance-Hough
Transform (DHT), Histogram of Oriented Gradients (HOG) and Modified log-Gabor
(MLG) filter Transform. Three state-of-the-art classifiers namely, Multi-Layer
Perceptron (MLP), K-Nearest Neighbour (KNN) and Support Vector Machine (SVM)
are used for the handwritten script classification. Handwritten datasets,
prepared at block, text-line and word level, consisting of officially
recognized 12 Indic scripts are used for the evaluation of our method. An
average improvement in the range of 2-5 % is achieved in the classification
accuracies by utilizing only about 75-80 % of the original feature vectors on
all three datasets. The proposed methodology also shows better performance when
compared to some popularly used FS models.
Related papers
- Multi-Vector Index Compression in Any Modality [73.7330345057813]
Late interaction has emerged as a dominant paradigm for information retrieval in text, images, visual documents, and videos.<n>We introduce four approaches for index compression: sequence resizing, memory tokens, hierarchical pooling, and a novel attention-guided clustering (AGC)<n>AGC uses an attention-guided mechanism to identify the most semantically salient regions of a document as cluster centroids and to weight token aggregation.
arXiv Detail & Related papers (2026-02-24T18:57:33Z) - Benchmarking Filtered Approximate Nearest Neighbor Search Algorithms on Transformer-based Embedding Vectors [18.796661826646616]
Filtered Approximate Nearest Neighbor Search is a problem known as Filtered Approximate Nearest Neighbor Search (FANNS)<n>We present a comprehensive survey and taxonomy of FANNS methods and analyze how they are benchmarked in the literature.<n>We introduce a novel dataset consisting of embedding vectors for the abstracts of over 2.7 million research articles from the arXiv repository, accompanied by 11 real-world attributes such as authors and categories.
arXiv Detail & Related papers (2025-07-29T16:39:54Z) - An Enhanced Model-based Approach for Short Text Clustering [58.60681789677676]
Short text clustering has become increasingly important with the popularity of social media like Twitter, Google+, and Facebook.<n>Existing methods can be broadly categorized into two paradigms: topic model-based approaches and deep representation learning-based approaches.<n>We propose a collapsed Gibbs Sampling algorithm for the Dirichlet Multinomial Mixture model (GSDMM), which effectively handles the sparsity and high dimensionality of short texts.<n>Based on several aspects of GSDMM that warrant further refinement, we propose an improved approach, GSDMM+, designed to further optimize its performance.
arXiv Detail & Related papers (2025-07-18T10:07:42Z) - Improving Handwritten Text Recognition via 3D Attention and Multi-Scale Training [19.05500901000957]
We propose a new recognition network by using a novel three-dimensional (3D) attention module and global-local context information.<n>Main canonical neural units including attention mechanisms, fully-connected layers, recurrent units and convolutional layers are efficiently organized into a network.
arXiv Detail & Related papers (2024-10-24T02:33:12Z) - Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification [0.0]
Efficient categorization of historical documents is crucial for fields such as genealogy, legal research and historical scholarship.
We propose a representational learning strategy that integrates deep learning models such as ResNet, masked Image Transformer (Di), and embedding segmentation.
arXiv Detail & Related papers (2024-05-23T04:28:50Z) - Rethinking Few-shot 3D Point Cloud Semantic Segmentation [62.80639841429669]
This paper revisits few-shot 3D point cloud semantic segmentation (FS-PCS)
We focus on two significant issues in the state-of-the-art: foreground leakage and sparse point distribution.
To address these issues, we introduce a standardized FS-PCS setting, upon which a new benchmark is built.
arXiv Detail & Related papers (2024-03-01T15:14:47Z) - Tuning Traditional Language Processing Approaches for Pashto Text
Classification [0.0]
The main aim of this study is to establish a Pashto automatic text classification system.
This study compares several models containing both statistical and neural network machine learning techniques.
This research obtained average testing accuracy rate 94% using classification algorithm and TFIDF feature extraction method.
arXiv Detail & Related papers (2023-05-04T22:57:45Z) - Texture image analysis based on joint of multi directions GLCM and local
ternary patterns [0.0]
Texture features can be used in many different applications in commuter vision or machine learning problems.
New approach is proposed based on combination of two texture descriptor, co-occurrence matrix and local ternary patterns.
Experimental results show that proposed approach provide higher classification rate in comparison with some state-of-the-art approaches.
arXiv Detail & Related papers (2022-09-05T09:53:00Z) - Learning Implicit Feature Alignment Function for Semantic Segmentation [51.36809814890326]
Implicit Feature Alignment function (IFA) is inspired by the rapidly expanding topic of implicit neural representations.
We show that IFA implicitly aligns the feature maps at different levels and is capable of producing segmentation maps in arbitrary resolutions.
Our method can be combined with improvement on various architectures, and it achieves state-of-the-art accuracy trade-off on common benchmarks.
arXiv Detail & Related papers (2022-06-17T09:40:14Z) - Autoregressive Search Engines: Generating Substrings as Document
Identifiers [53.0729058170278]
Autoregressive language models are emerging as the de-facto standard for generating answers.
Previous work has explored ways to partition the search space into hierarchical structures.
In this work we propose an alternative that doesn't force any structure in the search space: using all ngrams in a passage as its possible identifiers.
arXiv Detail & Related papers (2022-04-22T10:45:01Z) - Exploiting Local and Global Features in Transformer-based Extreme
Multi-label Text Classification [28.28186933768281]
We propose an approach that combines both the local and global features produced by Transformer models to improve the prediction power of the classifier.
Our experiments show that the proposed model either outperforms or is comparable to the state-of-the-art methods on benchmark datasets.
arXiv Detail & Related papers (2022-04-02T19:55:23Z) - Generalized Funnelling: Ensemble Learning and Heterogeneous Document
Embeddings for Cross-Lingual Text Classification [78.83284164605473]
emphFunnelling (Fun) is a recently proposed method for cross-lingual text classification.
We describe emphGeneralized Funnelling (gFun) as a generalization of Fun.
We show that gFun substantially improves over Fun and over state-of-the-art baselines.
arXiv Detail & Related papers (2021-09-17T23:33:04Z) - MultiGBS: A multi-layer graph approach to biomedical summarization [6.11737116137921]
We propose a domain-specific method that models a document as a multi-layer graph to enable multiple features of the text to be processed at the same time.
The unsupervised method selects sentences from the multi-layer graph based on the MultiRank algorithm and the number of concepts.
The proposed MultiGBS algorithm employs UMLS and extracts the concepts and relationships using different tools such as SemRep, MetaMap, and OGER.
arXiv Detail & Related papers (2020-08-27T04:22:37Z) - Fine-Grained Visual Classification with Efficient End-to-end
Localization [49.9887676289364]
We present an efficient localization module that can be fused with a classification network in an end-to-end setup.
We evaluate the new model on the three benchmark datasets CUB200-2011, Stanford Cars and FGVC-Aircraft.
arXiv Detail & Related papers (2020-05-11T14:07:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.