TagRec: Automated Tagging of Questions with Hierarchical Learning
Taxonomy
- URL: http://arxiv.org/abs/2107.10649v1
- Date: Sat, 3 Jul 2021 11:50:55 GMT
- Title: TagRec: Automated Tagging of Questions with Hierarchical Learning
Taxonomy
- Authors: Venktesh V, Mukesh Mohania, Vikram Goyal
- Abstract summary: Online educational platforms organize academic questions based on a hierarchical learning taxonomy (subject-chapter-topic)
This paper formulates the problem as a similarity-based retrieval task where we optimize the semantic relatedness between the taxonomy and the questions.
We demonstrate that our method helps to handle the unseen labels and hence can be used for taxonomy tagging in the wild.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Online educational platforms organize academic questions based on a
hierarchical learning taxonomy (subject-chapter-topic). Automatically tagging
new questions with existing taxonomy will help organize these questions into
different classes of hierarchical taxonomy so that they can be searched based
on the facets like chapter. This task can be formulated as a flat multi-class
classification problem. Usually, flat classification based methods ignore the
semantic relatedness between the terms in the hierarchical taxonomy and the
questions. Some traditional methods also suffer from the class imbalance issues
as they consider only the leaf nodes ignoring the hierarchy. Hence, we
formulate the problem as a similarity-based retrieval task where we optimize
the semantic relatedness between the taxonomy and the questions. We demonstrate
that our method helps to handle the unseen labels and hence can be used for
taxonomy tagging in the wild. In this method, we augment the question with its
corresponding answer to capture more semantic information and then align the
question-answer pair's contextualized embedding with the corresponding label
(taxonomy) vector representations. The representations are aligned by
fine-tuning a transformer based model with a loss function that is a
combination of the cosine similarity and hinge rank loss. The loss function
maximizes the similarity between the question-answer pair and the correct label
representations and minimizes the similarity to unrelated labels. Finally, we
perform experiments on two real-world datasets. We show that the proposed
learning method outperforms representations learned using the multi-class
classification method and other state of the art methods by 6% as measured by
Recall@k. We also demonstrate the performance of the proposed method on unseen
but related learning content like the learning objectives without re-training
the network.
Related papers
- Hierarchical Query Classification in E-commerce Search [38.67034103433015]
E-commerce platforms typically store and structure product information and search data in a hierarchy.
Efficiently categorizing user search queries into a similar hierarchical structure is paramount in enhancing user experience on e-commerce platforms as well as news curation and academic research.
The inherent complexity of hierarchical query classification is compounded by two primary challenges: (1) the pronounced class imbalance that skews towards dominant categories, and (2) the inherent brevity and ambiguity of search queries that hinder accurate classification.
arXiv Detail & Related papers (2024-03-09T21:55:55Z) - Multi-Label Knowledge Distillation [86.03990467785312]
We propose a novel multi-label knowledge distillation method.
On one hand, it exploits the informative semantic knowledge from the logits by dividing the multi-label learning problem into a set of binary classification problems.
On the other hand, it enhances the distinctiveness of the learned feature representations by leveraging the structural information of label-wise embeddings.
arXiv Detail & Related papers (2023-08-12T03:19:08Z) - Advancing Incremental Few-shot Semantic Segmentation via Semantic-guided
Relation Alignment and Adaptation [98.51938442785179]
Incremental few-shot semantic segmentation aims to incrementally extend a semantic segmentation model to novel classes.
This task faces a severe semantic-aliasing issue between base and novel classes due to data imbalance.
We propose the Semantic-guided Relation Alignment and Adaptation (SRAA) method that fully considers the guidance of prior semantic information.
arXiv Detail & Related papers (2023-05-18T10:40:52Z) - Association Graph Learning for Multi-Task Classification with Category
Shifts [68.58829338426712]
We focus on multi-task classification, where related classification tasks share the same label space and are learned simultaneously.
We learn an association graph to transfer knowledge among tasks for missing classes.
Our method consistently performs better than representative baselines.
arXiv Detail & Related papers (2022-10-10T12:37:41Z) - TagRec++: Hierarchical Label Aware Attention Network for Question
Categorization [0.3683202928838613]
Online learning systems organize the content according to a well defined taxonomy of hierarchical nature.
The task of categorizing inputs to the hierarchical labels is usually cast as a flat multi-class classification problem.
We formulate the task as a dense retrieval problem to retrieve the appropriate hierarchical labels for each content.
arXiv Detail & Related papers (2022-08-10T05:08:37Z) - Auxiliary Task Guided Interactive Attention Model for Question
Difficulty Prediction [6.951136079043972]
We propose a multi-task method with an interactive attention mechanism, Qdiff, for jointly predicting Bloom's taxonomy and difficulty levels of academic questions.
The proposed learning method would help learn representations that capture the relationship between Bloom's taxonomy and difficulty labels.
arXiv Detail & Related papers (2022-05-24T19:55:30Z) - Use All The Labels: A Hierarchical Multi-Label Contrastive Learning
Framework [75.79736930414715]
We present a hierarchical multi-label representation learning framework that can leverage all available labels and preserve the hierarchical relationship between classes.
We introduce novel hierarchy preserving losses, which jointly apply a hierarchical penalty to the contrastive loss, and enforce the hierarchy constraint.
arXiv Detail & Related papers (2022-04-27T21:41:44Z) - Rank-based loss for learning hierarchical representations [7.421724671710886]
In machine learning, the family of methods that use the 'extra' information is called hierarchical classification.
Here we focus on how to integrate the hierarchical information of a problem to learn embeddings representative of the hierarchical relationships.
We show that rank based loss is suitable to learn hierarchical representations of the data.
arXiv Detail & Related papers (2021-10-11T10:32:45Z) - Can Taxonomy Help? Improving Semantic Question Matching using Question
Taxonomy [37.57300969050908]
We propose a hybrid technique for semantic question matching.
It uses our proposed two-layered taxonomy for English questions by augmenting state-of-the-art deep learning models with question classes obtained from a deep learning based question.
arXiv Detail & Related papers (2021-01-20T16:23:04Z) - Exploring the Hierarchy in Relation Labels for Scene Graph Generation [75.88758055269948]
The proposed method can improve several state-of-the-art baselines by a large margin (up to $33%$ relative gain) in terms of Recall@50.
Experiments show that the proposed simple yet effective method can improve several state-of-the-art baselines by a large margin.
arXiv Detail & Related papers (2020-09-12T17:36:53Z) - Hierarchical Image Classification using Entailment Cone Embeddings [68.82490011036263]
We first inject label-hierarchy knowledge into an arbitrary CNN-based classifier.
We empirically show that availability of such external semantic information in conjunction with the visual semantics from images boosts overall performance.
arXiv Detail & Related papers (2020-04-02T10:22:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.