Topic-Guided Sampling For Data-Efficient Multi-Domain Stance Detection
- URL: http://arxiv.org/abs/2306.00765v1
- Date: Thu, 1 Jun 2023 15:00:39 GMT
- Title: Topic-Guided Sampling For Data-Efficient Multi-Domain Stance Detection
- Authors: Erik Arakelyan, Arnav Arora, Isabelle Augenstein
- Abstract summary: Stance Detection is concerned with identifying the attitudes expressed by an author towards a target of interest.
This task spans a variety of domains ranging from social media opinion identification to detecting the stance for a legal claim.
We present a topic-guided diversity sampling technique and a contrastive objective that is used for fine-tuning a stance.
- Score: 44.06173809190896
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Stance Detection is concerned with identifying the attitudes expressed by an
author towards a target of interest. This task spans a variety of domains
ranging from social media opinion identification to detecting the stance for a
legal claim. However, the framing of the task varies within these domains, in
terms of the data collection protocol, the label dictionary and the number of
available annotations. Furthermore, these stance annotations are significantly
imbalanced on a per-topic and inter-topic basis. These make multi-domain stance
detection a challenging task, requiring standardization and domain adaptation.
To overcome this challenge, we propose $\textbf{T}$opic $\textbf{E}$fficient
$\textbf{St}$anc$\textbf{E}$ $\textbf{D}$etection (TESTED), consisting of a
topic-guided diversity sampling technique and a contrastive objective that is
used for fine-tuning a stance classifier. We evaluate the method on an existing
benchmark of $16$ datasets with in-domain, i.e. all topics seen and
out-of-domain, i.e. unseen topics, experiments. The results show that our
method outperforms the state-of-the-art with an average of $3.5$ F1 points
increase in-domain, and is more generalizable with an averaged increase of
$10.2$ F1 on out-of-domain evaluation while using $\leq10\%$ of the training
data. We show that our sampling technique mitigates both inter- and per-topic
class imbalances. Finally, our analysis demonstrates that the contrastive
learning objective allows the model a more pronounced segmentation of samples
with varying labels.
Related papers
- High-order Neighborhoods Know More: HyperGraph Learning Meets Source-free Unsupervised Domain Adaptation [34.08681468394247]
Source-free Unsupervised Domain Adaptation aims to classify target samples by only accessing a pre-trained source model and unlabelled target samples.
Existing methods normally exploit the pair-wise relation among target samples and attempt to discover their correlations by clustering these samples based on semantic features.
We propose a new SFDA method that exploits the high-order neighborhood relation and explicitly takes the domain shift effect into account.
arXiv Detail & Related papers (2024-05-11T05:07:43Z) - USB: A Unified Summarization Benchmark Across Tasks and Domains [68.82726887802856]
We introduce a Wikipedia-derived benchmark, complemented by a rich set of crowd-sourced annotations, that supports $8$ interrelated tasks.
We compare various methods on this benchmark and discover that on multiple tasks, moderately-sized fine-tuned models consistently outperform much larger few-shot prompted language models.
arXiv Detail & Related papers (2023-05-23T17:39:54Z) - Cyclically Disentangled Feature Translation for Face Anti-spoofing [61.70377630461084]
We propose a novel domain adaptation method called cyclically disentangled feature translation network (CDFTN)
CDFTN generates pseudo-labeled samples that possess: 1) source domain-invariant liveness features and 2) target domain-specific content features, which are disentangled through domain adversarial training.
A robust classifier is trained based on the synthetic pseudo-labeled images under the supervision of source domain labels.
arXiv Detail & Related papers (2022-12-07T14:12:34Z) - Low-confidence Samples Matter for Domain Adaptation [47.552605279925736]
Domain adaptation (DA) aims to transfer knowledge from a label-rich source domain to a related but label-scarce target domain.
We propose a novel contrastive learning method by processing low-confidence samples.
We evaluate the proposed method in both unsupervised and semi-supervised DA settings.
arXiv Detail & Related papers (2022-02-06T15:45:45Z) - A Simple Information-Based Approach to Unsupervised Domain-Adaptive
Aspect-Based Sentiment Analysis [58.124424775536326]
We propose a simple but effective technique based on mutual information to extract their term.
Experiment results show that our proposed method outperforms the state-of-the-art methods for cross-domain ABSA by 4.32% Micro-F1.
arXiv Detail & Related papers (2022-01-29T10:18:07Z) - Frequency Spectrum Augmentation Consistency for Domain Adaptive Object
Detection [107.52026281057343]
We introduce a Frequency Spectrum Augmentation Consistency (FSAC) framework with four different low-frequency filter operations.
In the first stage, we utilize all the original and augmented source data to train an object detector.
In the second stage, augmented source and target data with pseudo labels are adopted to perform the self-training for prediction consistency.
arXiv Detail & Related papers (2021-12-16T04:07:01Z) - AFAN: Augmented Feature Alignment Network for Cross-Domain Object
Detection [90.18752912204778]
Unsupervised domain adaptation for object detection is a challenging problem with many real-world applications.
We propose a novel augmented feature alignment network (AFAN) which integrates intermediate domain image generation and domain-adversarial training.
Our approach significantly outperforms the state-of-the-art methods on standard benchmarks for both similar and dissimilar domain adaptations.
arXiv Detail & Related papers (2021-06-10T05:01:20Z) - Cross-Domain Label-Adaptive Stance Detection [32.800766653254634]
Stance detection concerns the classification of a writer's viewpoint towards a target.
In this paper, we perform an in-depth analysis of 16 stance detection datasets.
We propose an end-to-end unsupervised framework for out-of-domain prediction of unseen, user-defined labels.
arXiv Detail & Related papers (2021-04-15T14:04:29Z) - Inductive Unsupervised Domain Adaptation for Few-Shot Classification via
Clustering [16.39667909141402]
Few-shot classification tends to struggle when it needs to adapt to diverse domains.
We introduce a framework, DaFeC, to improve Domain adaptation performance for Few-shot classification via Clustering.
Our approach outperforms previous work with absolute gains (in classification accuracy) of 4.95%, 9.55%, 3.99% and 11.62%, respectively.
arXiv Detail & Related papers (2020-06-23T08:17:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.