Multi-Label Few-Shot Learning for Aspect Category Detection
- URL: http://arxiv.org/abs/2105.14174v1
- Date: Sat, 29 May 2021 01:56:11 GMT
- Title: Multi-Label Few-Shot Learning for Aspect Category Detection
- Authors: Mengting Hu, Shiwan Zhao, Honglei Guo, Chao Xue, Hang Gao, Tiegang
Gao, Renhong Cheng, Zhong Su
- Abstract summary: Aspect category detection (ACD) in sentiment analysis aims to identify the aspect categories mentioned in a sentence.
Existing few-shot learning approaches mainly focus on single-label predictions.
We propose a multi-label few-shot learning method based on the prototypical network.
- Score: 23.92900196246631
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Aspect category detection (ACD) in sentiment analysis aims to identify the
aspect categories mentioned in a sentence. In this paper, we formulate ACD in
the few-shot learning scenario. However, existing few-shot learning approaches
mainly focus on single-label predictions. These methods can not work well for
the ACD task since a sentence may contain multiple aspect categories.
Therefore, we propose a multi-label few-shot learning method based on the
prototypical network. To alleviate the noise, we design two effective attention
mechanisms. The support-set attention aims to extract better prototypes by
removing irrelevant aspects. The query-set attention computes multiple
prototype-specific representations for each query instance, which are then used
to compute accurate distances with the corresponding prototypes. To achieve
multi-label inference, we further learn a dynamic threshold per instance by a
policy network. Extensive experimental results on three datasets demonstrate
that the proposed method significantly outperforms strong baselines.
Related papers
- Few-Shot Multi-Label Aspect Category Detection Utilizing Prototypical
Network with Sentence-Level Weighting and Label Augmentation [3.140750628178514]
Multi-label aspect category detection is intended to detect multiple aspect categories occurring in a given sentence.
We introduce support set attention along with the augmented label information to mitigate the noise at word-level for each support set instance.
We use a sentence-level attention mechanism that gives different weights to each instance in the support set in order to compute prototypes by weighted averaging.
arXiv Detail & Related papers (2023-09-27T11:44:04Z) - Label-Driven Denoising Framework for Multi-Label Few-Shot Aspect
Category Detection [27.319209062256448]
We propose a novel Label-Driven Denoising Framework (LDF) to tackle the above problems.
Our framework achieves better performance than other state-of-the-art methods.
arXiv Detail & Related papers (2022-10-09T10:37:43Z) - Automatically Discovering Novel Visual Categories with Self-supervised
Prototype Learning [68.63910949916209]
This paper tackles the problem of novel category discovery (NCD), which aims to discriminate unknown categories in large-scale image collections.
We propose a novel adaptive prototype learning method consisting of two main stages: prototypical representation learning and prototypical self-training.
We conduct extensive experiments on four benchmark datasets and demonstrate the effectiveness and robustness of the proposed method with state-of-the-art performance.
arXiv Detail & Related papers (2022-08-01T16:34:33Z) - Label-enhanced Prototypical Network with Contrastive Learning for
Multi-label Few-shot Aspect Category Detection [17.228616743739412]
Multi-label aspect category detection allows a given review sentence to contain multiple aspect categories.
We propose a novel label-enhanced network (LPN) for multi-label few-shot aspect category detection.
arXiv Detail & Related papers (2022-06-14T02:37:44Z) - Semantic Representation and Dependency Learning for Multi-Label Image
Recognition [76.52120002993728]
We propose a novel and effective semantic representation and dependency learning (SRDL) framework to learn category-specific semantic representation for each category.
Specifically, we design a category-specific attentional regions (CAR) module to generate channel/spatial-wise attention matrices to guide model.
We also design an object erasing (OE) module to implicitly learn semantic dependency among categories by erasing semantic-aware regions.
arXiv Detail & Related papers (2022-04-08T00:55:15Z) - CAD: Co-Adapting Discriminative Features for Improved Few-Shot
Classification [11.894289991529496]
Few-shot classification is a challenging problem that aims to learn a model that can adapt to unseen classes given a few labeled samples.
Recent approaches pre-train a feature extractor, and then fine-tune for episodic meta-learning.
We propose a strategy to cross-attend and re-weight discriminative features for few-shot classification.
arXiv Detail & Related papers (2022-03-25T06:14:51Z) - Dual Prototypical Contrastive Learning for Few-shot Semantic
Segmentation [55.339405417090084]
We propose a dual prototypical contrastive learning approach tailored to the few-shot semantic segmentation (FSS) task.
The main idea is to encourage the prototypes more discriminative by increasing inter-class distance while reducing intra-class distance in prototype feature space.
We demonstrate that the proposed dual contrastive learning approach outperforms state-of-the-art FSS methods on PASCAL-5i and COCO-20i datasets.
arXiv Detail & Related papers (2021-11-09T08:14:50Z) - Few-shot Classification via Adaptive Attention [93.06105498633492]
We propose a novel few-shot learning method via optimizing and fast adapting the query sample representation based on very few reference samples.
As demonstrated experimentally, the proposed model achieves state-of-the-art classification results on various benchmark few-shot classification and fine-grained recognition datasets.
arXiv Detail & Related papers (2020-08-06T05:52:59Z) - Part-aware Prototype Network for Few-shot Semantic Segmentation [50.581647306020095]
We propose a novel few-shot semantic segmentation framework based on the prototype representation.
Our key idea is to decompose the holistic class representation into a set of part-aware prototypes.
We develop a novel graph neural network model to generate and enhance the proposed part-aware prototypes.
arXiv Detail & Related papers (2020-07-13T11:03:09Z) - A Few-Shot Sequential Approach for Object Counting [63.82757025821265]
We introduce a class attention mechanism that sequentially attends to objects in the image and extracts their relevant features.
The proposed technique is trained on point-level annotations and uses a novel loss function that disentangles class-dependent and class-agnostic aspects of the model.
We present our results on a variety of object-counting/detection datasets, including FSOD and MS COCO.
arXiv Detail & Related papers (2020-07-03T18:23:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.