Interpretable Fake News Detection with Topic and Deep Variational Models
- URL: http://arxiv.org/abs/2209.01536v1
- Date: Sun, 4 Sep 2022 05:31:00 GMT
- Title: Interpretable Fake News Detection with Topic and Deep Variational Models
- Authors: Marjan Hosseini, Alireza Javadian Sabet, Suining He, and Derek Aguiar
- Abstract summary: We focus on fake news detection using interpretable features and methods.
We have developed a deep probabilistic model that integrates a dense representation of textual news.
Our model achieves comparable performance to state-of-the-art competing models.
- Score: 2.15242029196761
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The growing societal dependence on social media and user generated content
for news and information has increased the influence of unreliable sources and
fake content, which muddles public discourse and lessens trust in the media.
Validating the credibility of such information is a difficult task that is
susceptible to confirmation bias, leading to the development of algorithmic
techniques to distinguish between fake and real news. However, most existing
methods are challenging to interpret, making it difficult to establish trust in
predictions, and make assumptions that are unrealistic in many real-world
scenarios, e.g., the availability of audiovisual features or provenance. In
this work, we focus on fake news detection of textual content using
interpretable features and methods. In particular, we have developed a deep
probabilistic model that integrates a dense representation of textual news
using a variational autoencoder and bi-directional Long Short-Term Memory
(LSTM) networks with semantic topic-related features inferred from a Bayesian
admixture model. Extensive experimental studies with 3 real-world datasets
demonstrate that our model achieves comparable performance to state-of-the-art
competing models while facilitating model interpretability from the learned
topics. Finally, we have conducted model ablation studies to justify the
effectiveness and accuracy of integrating neural embeddings and topic features
both quantitatively by evaluating performance and qualitatively through
separability in lower dimensional embeddings.
Related papers
- Context is Key: A Benchmark for Forecasting with Essential Textual Information [87.3175915185287]
"Context is Key" (CiK) is a time series forecasting benchmark that pairs numerical data with diverse types of carefully crafted textual context.
We evaluate a range of approaches, including statistical models, time series foundation models, and LLM-based forecasters.
Our experiments highlight the importance of incorporating contextual information, demonstrate surprising performance when using LLM-based forecasting models, and also reveal some of their critical shortcomings.
arXiv Detail & Related papers (2024-10-24T17:56:08Z) - Ethio-Fake: Cutting-Edge Approaches to Combat Fake News in Under-Resourced Languages Using Explainable AI [44.21078435758592]
Misinformation can spread quickly due to the ease of creating and disseminating content.
Traditional approaches to fake news detection often rely solely on content-based features.
We propose a comprehensive approach that integrates social context-based features with news content features.
arXiv Detail & Related papers (2024-10-03T15:49:35Z) - Finding fake reviews in e-commerce platforms by using hybrid algorithms [0.0]
We propose an innovative ensemble approach for sentiment analysis for finding fake reviews.
Our ensemble architecture strategically combines diverse models to capitalize on their strengths while mitigating inherent weaknesses.
Our findings underscore the potential of ensemble techniques in advancing the state-of-the-art in finding fake reviews.
arXiv Detail & Related papers (2024-04-09T14:25:27Z) - Corpus Considerations for Annotator Modeling and Scaling [9.263562546969695]
We show that the commonly used user token model consistently outperforms more complex models.
Our findings shed light on the relationship between corpus statistics and annotator modeling performance.
arXiv Detail & Related papers (2024-04-02T22:27:24Z) - Capturing Pertinent Symbolic Features for Enhanced Content-Based
Misinformation Detection [0.0]
The detection of misleading content presents a significant hurdle due to its extreme linguistic and domain variability.
This paper analyzes the linguistic attributes that characterize this phenomenon and how representative of such features some of the most popular misinformation datasets are.
We demonstrate that the appropriate use of pertinent symbolic knowledge in combination with neural language models is helpful in detecting misleading content.
arXiv Detail & Related papers (2024-01-29T16:42:34Z) - Preserving Knowledge Invariance: Rethinking Robustness Evaluation of
Open Information Extraction [50.62245481416744]
We present the first benchmark that simulates the evaluation of open information extraction models in the real world.
We design and annotate a large-scale testbed in which each example is a knowledge-invariant clique.
By further elaborating the robustness metric, a model is judged to be robust if its performance is consistently accurate on the overall cliques.
arXiv Detail & Related papers (2023-05-23T12:05:09Z) - Interpretable Detection of Out-of-Context Misinformation with Neural-Symbolic-Enhanced Large Multimodal Model [16.348950072491697]
Misinformation creators now more tend to use out-of- multimedia contents to deceive the public and fake news detection systems.
This new type of misinformation increases the difficulty of not only detection but also clarification, because every individual modality is close enough to true information.
In this paper we explore how to achieve interpretable cross-modal de-contextualization detection that simultaneously identifies the mismatched pairs and the cross-modal contradictions.
arXiv Detail & Related papers (2023-04-15T21:11:55Z) - Exploring the Trade-off between Plausibility, Change Intensity and
Adversarial Power in Counterfactual Explanations using Multi-objective
Optimization [73.89239820192894]
We argue that automated counterfactual generation should regard several aspects of the produced adversarial instances.
We present a novel framework for the generation of counterfactual examples.
arXiv Detail & Related papers (2022-05-20T15:02:53Z) - Generative Counterfactuals for Neural Networks via Attribute-Informed
Perturbation [51.29486247405601]
We design a framework to generate counterfactuals for raw data instances with the proposed Attribute-Informed Perturbation (AIP)
By utilizing generative models conditioned with different attributes, counterfactuals with desired labels can be obtained effectively and efficiently.
Experimental results on real-world texts and images demonstrate the effectiveness, sample quality as well as efficiency of our designed framework.
arXiv Detail & Related papers (2021-01-18T08:37:13Z) - InfoBERT: Improving Robustness of Language Models from An Information
Theoretic Perspective [84.78604733927887]
Large-scale language models such as BERT have achieved state-of-the-art performance across a wide range of NLP tasks.
Recent studies show that such BERT-based models are vulnerable facing the threats of textual adversarial attacks.
We propose InfoBERT, a novel learning framework for robust fine-tuning of pre-trained language models.
arXiv Detail & Related papers (2020-10-05T20:49:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.