Rating Facts under Coarse-to-fine Regimes
- URL: http://arxiv.org/abs/2107.06051v2
- Date: Wed, 14 Jul 2021 10:51:41 GMT
- Title: Rating Facts under Coarse-to-fine Regimes
- Authors: Guojun Wu
- Abstract summary: We collect 24K manually rated statements from PolitiFact.
Our task represents a twist from standard classification, due to the various degrees of similarity between classes.
After training, class similarity is sensible over the multi-class datasets, especially in the fine-grained one.
- Score: 0.533024001730262
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The rise of manipulating fake news as a political weapon has become a global
concern and highlighted the incapability of manually fact checking against
rapidly produced fake news. Thus, statistical approaches are required if we are
to address this problem efficiently. The shortage of publicly available
datasets is one major bottleneck of automated fact checking. To remedy this, we
collected 24K manually rated statements from PolitiFact. The class values
exhibit a natural order with respect to truthfulness as shown in Table 1. Thus,
our task represents a twist from standard classification, due to the various
degrees of similarity between classes. To investigate this, we defined
coarse-to-fine classification regimes, which presents new challenge for
classification. To address this, we propose BERT-based models. After training,
class similarity is sensible over the multi-class datasets, especially in the
fine-grained one. Under all the regimes, BERT achieves state of the art, while
the additional layers provide insignificant improvement.
Related papers
- Exploring Vacant Classes in Label-Skewed Federated Learning [113.65301899666645]
Label skews, characterized by disparities in local label distribution across clients, pose a significant challenge in federated learning.
This paper introduces FedVLS, a novel approach to label-skewed federated learning that integrates vacant-class distillation and logit suppression simultaneously.
arXiv Detail & Related papers (2024-01-04T16:06:31Z) - Generating Unbiased Pseudo-labels via a Theoretically Guaranteed
Chebyshev Constraint to Unify Semi-supervised Classification and Regression [57.17120203327993]
threshold-to-pseudo label process (T2L) in classification uses confidence to determine the quality of label.
In nature, regression also requires unbiased methods to generate high-quality labels.
We propose a theoretically guaranteed constraint for generating unbiased labels based on Chebyshev's inequality.
arXiv Detail & Related papers (2023-11-03T08:39:35Z) - ProTeCt: Prompt Tuning for Taxonomic Open Set Classification [59.59442518849203]
Few-shot adaptation methods do not fare well in the taxonomic open set (TOS) setting.
We propose a prompt tuning technique that calibrates the hierarchical consistency of model predictions.
A new Prompt Tuning for Hierarchical Consistency (ProTeCt) technique is then proposed to calibrate classification across label set granularities.
arXiv Detail & Related papers (2023-06-04T02:55:25Z) - Aggregating Pairwise Semantic Differences for Few-Shot Claim Veracity
Classification [21.842139093124512]
We introduce SEED, a novel vector-based method to claim veracity classification.
We build on the hypothesis that we can simulate class representative vectors that capture average semantic differences for claim-evidence pairs in a class.
Experiments conducted on the FEVER and SCIFACT datasets show consistent improvements over competitive baselines in few-shot settings.
arXiv Detail & Related papers (2022-05-11T17:23:37Z) - SCARF: Self-Supervised Contrastive Learning using Random Feature
Corruption [72.35532598131176]
We propose SCARF, a technique for contrastive learning, where views are formed by corrupting a random subset of features.
We show that SCARF complements existing strategies and outperforms alternatives like autoencoders.
arXiv Detail & Related papers (2021-06-29T08:08:33Z) - Binary Classification from Multiple Unlabeled Datasets via Surrogate Set
Classification [94.55805516167369]
We propose a new approach for binary classification from m U-sets for $mge2$.
Our key idea is to consider an auxiliary classification task called surrogate set classification (SSC)
arXiv Detail & Related papers (2021-02-01T07:36:38Z) - Exploiting stance hierarchies for cost-sensitive stance detection of Web
documents [24.898077978955406]
stance detection aims at identifying the position (stance) of a document towards a claim.
We propose a modular pipeline of cascading binary classifiers, enabling performance tuning on a per step and class basis.
We implement our approach through a combination of neural and traditional classification models that highlight the misclassification costs of minority classes.
arXiv Detail & Related papers (2020-07-29T21:40:01Z) - Joint Visual and Temporal Consistency for Unsupervised Domain Adaptive
Person Re-Identification [64.37745443119942]
This paper jointly enforces visual and temporal consistency in the combination of a local one-hot classification and a global multi-class classification.
Experimental results on three large-scale ReID datasets demonstrate the superiority of proposed method in both unsupervised and unsupervised domain adaptive ReID tasks.
arXiv Detail & Related papers (2020-07-21T14:31:27Z) - Cost-Sensitive BERT for Generalisable Sentence Classification with
Imbalanced Data [5.08128537391027]
We show that BERT does not generalise well when the training and test data are sufficiently dissimilar.
We show how to address this problem by providing a statistical measure of similarity between datasets and a method of incorporating cost-weighting into BERT.
We achieve the second-highest score on sentence-level propaganda classification.
arXiv Detail & Related papers (2020-03-16T19:10:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.