Classifier Combination Approach for Question Classification for Bengali
Question Answering System
- URL: http://arxiv.org/abs/2008.13597v2
- Date: Sun, 6 Sep 2020 14:47:12 GMT
- Title: Classifier Combination Approach for Question Classification for Bengali
Question Answering System
- Authors: Somnath Banerjee, Sudip Kumar Naskar, Paolo Rosso and Sivaji
Bandyopadhyay
- Abstract summary: The work presented here demonstrates that the combination of multiple models achieve better classification performance than those obtained with existing individual models for the question classification task in Bengali.
We have exploited state-of-the-art multiple model combination techniques, i.e., ensemble, stacking and voting, to increase QC accuracy.
The approach presented here could be used in other Indo-Aryan or Indic languages to develop a question answering system.
- Score: 17.567099458403707
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Question classification (QC) is a prime constituent of automated question
answering system. The work presented here demonstrates that the combination of
multiple models achieve better classification performance than those obtained
with existing individual models for the question classification task in
Bengali. We have exploited state-of-the-art multiple model combination
techniques, i.e., ensemble, stacking and voting, to increase QC accuracy.
Lexical, syntactic and semantic features of Bengali questions are used for four
well-known classifiers, namely Na\"{\i}ve Bayes, kernel Na\"{\i}ve Bayes, Rule
Induction, and Decision Tree, which serve as our base learners. Single-layer
question-class taxonomy with 8 coarse-grained classes is extended to two-layer
taxonomy by adding 69 fine-grained classes. We carried out the experiments both
on single-layer and two-layer taxonomies. Experimental results confirmed that
classifier combination approaches outperform single classifier classification
approaches by 4.02% for coarse-grained question classes. Overall, the stacking
approach produces the best results for fine-grained classification and achieves
87.79% of accuracy. The approach presented here could be used in other
Indo-Aryan or Indic languages to develop a question answering system.
Related papers
- Multi-Label Requirements Classification with Large Taxonomies [40.588683959176116]
Multi-label requirements classification with large labels could aid requirements traceability but is prohibitively costly with supervised training.
We associated 129 requirements with 769 labels from ranging between 250 and 1183 classes.
The sentence-based classification had a significantly higher recall compared to the word-based classification.
The hierarchical classification strategy did not always improve the performance of requirements classification.
arXiv Detail & Related papers (2024-06-07T09:53:55Z) - Prompt Algebra for Task Composition [131.97623832435812]
We consider Visual Language Models with prompt tuning as our base classifier.
We propose constrained prompt tuning to improve performance of the composite classifier.
On UTZappos it improves classification accuracy over the best base model by 8.45% on average.
arXiv Detail & Related papers (2023-06-01T03:20:54Z) - Anomaly Detection using Ensemble Classification and Evidence Theory [62.997667081978825]
We present a novel approach for novel detection using ensemble classification and evidence theory.
A pool selection strategy is presented to build a solid ensemble classifier.
We use uncertainty for the anomaly detection approach.
arXiv Detail & Related papers (2022-12-23T00:50:41Z) - Parametric Classification for Generalized Category Discovery: A Baseline
Study [70.73212959385387]
Generalized Category Discovery (GCD) aims to discover novel categories in unlabelled datasets using knowledge learned from labelled samples.
We investigate the failure of parametric classifiers, verify the effectiveness of previous design choices when high-quality supervision is available, and identify unreliable pseudo-labels as a key problem.
We propose a simple yet effective parametric classification method that benefits from entropy regularisation, achieves state-of-the-art performance on multiple GCD benchmarks and shows strong robustness to unknown class numbers.
arXiv Detail & Related papers (2022-11-21T18:47:11Z) - Two is Better than Many? Binary Classification as an Effective Approach
to Multi-Choice Question Answering [43.35258958775454]
We show that classifying (question, true answer) as positive instances and (question, false answer) as negative instances is significantly more effective across various models and datasets.
Our DeBERTa binary classification model reaches the top or close to the top performance on public leaderboards for different tasks.
arXiv Detail & Related papers (2022-10-29T05:11:45Z) - A Top-down Supervised Learning Approach to Hierarchical Multi-label
Classification in Networks [0.21485350418225244]
This paper presents a general prediction model to hierarchical multi-label classification (HMC), where the attributes to be inferred can be specified as a strict poset.
It is based on a top-down classification approach that addresses hierarchical multi-label classification with supervised learning by building a local classifier per class.
The proposed model is showcased with a case study on the prediction of gene functions for Oryza sativa Japonica, a variety of rice.
arXiv Detail & Related papers (2022-03-23T17:29:17Z) - Set-valued prediction in hierarchical classification with constrained
representation complexity [4.258263831866309]
We focus on hierarchical multi-class classification problems, where valid sets correspond to internal nodes of the hierarchy.
We propose three methods and evaluate them on benchmark datasets.
arXiv Detail & Related papers (2022-03-13T15:13:19Z) - Can Taxonomy Help? Improving Semantic Question Matching using Question
Taxonomy [37.57300969050908]
We propose a hybrid technique for semantic question matching.
It uses our proposed two-layered taxonomy for English questions by augmenting state-of-the-art deep learning models with question classes obtained from a deep learning based question.
arXiv Detail & Related papers (2021-01-20T16:23:04Z) - Learning and Evaluating Representations for Deep One-class
Classification [59.095144932794646]
We present a two-stage framework for deep one-class classification.
We first learn self-supervised representations from one-class data, and then build one-class classifiers on learned representations.
In experiments, we demonstrate state-of-the-art performance on visual domain one-class classification benchmarks.
arXiv Detail & Related papers (2020-11-04T23:33:41Z) - Cooperative Bi-path Metric for Few-shot Learning [50.98891758059389]
We make two contributions to investigate the few-shot classification problem.
We report a simple and effective baseline trained on base classes in the way of traditional supervised learning.
We propose a cooperative bi-path metric for classification, which leverages the correlations between base classes and novel classes to further improve the accuracy.
arXiv Detail & Related papers (2020-08-10T11:28:52Z) - A Systematic Evaluation: Fine-Grained CNN vs. Traditional CNN
Classifiers [54.996358399108566]
We investigate the performance of the landmark general CNN classifiers, which presented top-notch results on large scale classification datasets.
We compare it against state-of-the-art fine-grained classifiers.
We show an extensive evaluation on six datasets to determine whether the fine-grained classifier is able to elevate the baseline in their experiments.
arXiv Detail & Related papers (2020-03-24T23:49:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.