DPOD: Domain-Specific Prompt Tuning for Multimodal Fake News Detection
- URL: http://arxiv.org/abs/2311.16496v3
- Date: Wed, 13 Mar 2024 02:32:32 GMT
- Title: DPOD: Domain-Specific Prompt Tuning for Multimodal Fake News Detection
- Authors: Debarshi Brahma, Amartya Bhattacharya, Suraj Nagaje Mahadev, Anmol
Asati, Vikas Verma, Soma Biswas
- Abstract summary: Fake news using out-of-context images has become widespread and is a relevant problem in this era of information overload.
We explore whether out-of-domain data can help to improve out-of-context misinformation detection of a desired domain.
We propose a novel framework termed DPOD (Domain-specific Prompt-tuning using Out-of-Domain data)
- Score: 15.599951180606947
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The spread of fake news using out-of-context images has become widespread and
is a relevant problem in this era of information overload. Such out-of-context
fake news may arise across different domains like politics, sports,
entertainment, etc. In practical scenarios, an inherent problem of imbalance
exists among news articles from such widely varying domains, resulting in a few
domains with abundant data, while the rest containing very limited data. Under
such circumstances, it is imperative to develop methods which can work in such
varying amounts of data setting. In this work, we explore whether out-of-domain
data can help to improve out-of-context misinformation detection (termed here
as multi-modal fake news detection) of a desired domain, to address this
challenging problem. Towards this goal, we propose a novel framework termed
DPOD (Domain-specific Prompt-tuning using Out-of-Domain data). First, to
compute generalizable features, we modify the Vision-Language Model, CLIP to
extract features that helps to align the representations of the images and
corresponding text captions of both the in-domain and out-of-domain data in a
label-aware manner. Further, we propose a domain-specific prompt learning
technique which leverages the training samples of all the available domains
based on the extent they can be useful to the desired domain. Extensive
experiments on a large-scale benchmark dataset, namely NewsCLIPpings
demonstrate that the proposed framework achieves state of-the-art performance,
significantly surpassing the existing approaches for this challenging task.
Code will be released on acceptance.
Related papers
- A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation [52.0964459842176]
Current state-of-the-art dialogue systems heavily rely on extensive training datasets.
We propose a novel data textbfAugmentation framework for textbfMulti-textbfDomain textbfDialogue textbfGeneration, referred to as textbfAMD$2$G.
The AMD$2$G framework consists of a data augmentation process and a two-stage training approach: domain-agnostic training and domain adaptation training.
arXiv Detail & Related papers (2024-06-14T09:52:27Z) - Prompt-based Visual Alignment for Zero-shot Policy Transfer [35.784936617675896]
Overfitting in reinforcement learning has become one of the main obstacles to applications in reinforcement learning.
We propose prompt-based visual alignment (PVA) to mitigate the detrimental domain bias in the image for zero-shot policy transfer.
We verify PVA on a vision-based autonomous driving task with CARLA simulator.
arXiv Detail & Related papers (2024-06-05T13:26:30Z) - WIDIn: Wording Image for Domain-Invariant Representation in Single-Source Domain Generalization [63.98650220772378]
We present WIDIn, Wording Images for Domain-Invariant representation, to disentangle discriminative visual representation.
We first estimate the language embedding with fine-grained alignment, which can be used to adaptively identify and then remove domain-specific counterpart.
We show that WIDIn can be applied to both pretrained vision-language models like CLIP, and separately trained uni-modal models like MoCo and BERT.
arXiv Detail & Related papers (2024-05-28T17:46:27Z) - Phrase Grounding-based Style Transfer for Single-Domain Generalized
Object Detection [109.58348694132091]
Single-domain generalized object detection aims to enhance a model's generalizability to multiple unseen target domains.
This is a practical yet challenging task as it requires the model to address domain shift without incorporating target domain data into training.
We propose a novel phrase grounding-based style transfer approach for the task.
arXiv Detail & Related papers (2024-02-02T10:48:43Z) - Domain-Controlled Prompt Learning [49.45309818782329]
Existing prompt learning methods often lack domain-awareness or domain-transfer mechanisms.
We propose a textbfDomain-Controlled Prompt Learning for the specific domains.
Our method achieves state-of-the-art performance in specific domain image recognition datasets.
arXiv Detail & Related papers (2023-09-30T02:59:49Z) - Using Language to Extend to Unseen Domains [81.37175826824625]
It is expensive to collect training data for every possible domain that a vision model may encounter when deployed.
We consider how simply verbalizing the training domain as well as domains we want to extend to but do not have data for can improve robustness.
Using a multimodal model with a joint image and language embedding space, our method LADS learns a transformation of the image embeddings from the training domain to each unseen test domain.
arXiv Detail & Related papers (2022-10-18T01:14:02Z) - Batch Normalization Embeddings for Deep Domain Generalization [50.51405390150066]
Domain generalization aims at training machine learning models to perform robustly across different and unseen domains.
We show a significant increase in classification accuracy over current state-of-the-art techniques on popular domain generalization benchmarks.
arXiv Detail & Related papers (2020-11-25T12:02:57Z) - Domain Generalized Person Re-Identification via Cross-Domain Episodic
Learning [31.17248105464821]
We present an episodic learning scheme which advances meta learning strategies to exploit the observed source-domain labeled data.
Our experiments on four benchmark datasets confirm the superiority of our method over the state-of-the-arts.
arXiv Detail & Related papers (2020-10-19T14:42:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.