Utilization of Multinomial Naive Bayes Algorithm and Term Frequency
Inverse Document Frequency (TF-IDF Vectorizer) in Checking the Credibility of
News Tweet in the Philippines
- URL: http://arxiv.org/abs/2306.00018v1
- Date: Tue, 30 May 2023 15:41:15 GMT
- Title: Utilization of Multinomial Naive Bayes Algorithm and Term Frequency
Inverse Document Frequency (TF-IDF Vectorizer) in Checking the Credibility of
News Tweet in the Philippines
- Authors: Neil Christian R. Riego and Danny Bell Villarba
- Abstract summary: This paper utilizes ground truth-based annotations and TF-IDF as feature extraction for the news articles.
The model has an accuracy of 99.46% in training and 88.98% in predicting unseen data.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The digitalization of news media become a good indicator of progress and
signal to more threats. Media disinformation or fake news is one of these
threats, and it is necessary to take any action in fighting disinformation.
This paper utilizes ground truth-based annotations and TF-IDF as feature
extraction for the news articles which is then used as a training data set for
Multinomial Naive Bayes. The model has an accuracy of 99.46% in training and
88.98% in predicting unseen data. Tagging fake news as real news is a
concerning point on the prediction that is indicated in the F1 score of 89.68%.
This could lead to a negative impact. To prevent this to happen it is suggested
to further improve the corpus collection, and use an ensemble machine learning
to reinforce the prediction
Related papers
- Prompt-and-Align: Prompt-Based Social Alignment for Few-Shot Fake News
Detection [50.07850264495737]
"Prompt-and-Align" (P&A) is a novel prompt-based paradigm for few-shot fake news detection.
We show that P&A sets new states-of-the-art for few-shot fake news detection performance by significant margins.
arXiv Detail & Related papers (2023-09-28T13:19:43Z) - fakenewsbr: A Fake News Detection Platform for Brazilian Portuguese [0.6775616141339018]
This paper presents a comprehensive study on detecting fake news in Brazilian Portuguese.
We propose a machine learning-based approach that leverages natural language processing techniques, including TF-IDF and Word2Vec.
We develop a user-friendly web platform, fakenewsbr.com, to facilitate the verification of news articles' veracity.
arXiv Detail & Related papers (2023-09-20T04:10:03Z) - Multiverse: Multilingual Evidence for Fake News Detection [71.51905606492376]
Multiverse is a new feature based on multilingual evidence that can be used for fake news detection.
The hypothesis of the usage of cross-lingual evidence as a feature for fake news detection is confirmed.
arXiv Detail & Related papers (2022-11-25T18:24:17Z) - Faking Fake News for Real Fake News Detection: Propaganda-loaded
Training Data Generation [105.20743048379387]
We propose a novel framework for generating training examples informed by the known styles and strategies of human-authored propaganda.
Specifically, we perform self-critical sequence training guided by natural language inference to ensure the validity of the generated articles.
Our experimental results show that fake news detectors trained on PropaNews are better at detecting human-written disinformation by 3.62 - 7.69% F1 score on two public datasets.
arXiv Detail & Related papers (2022-03-10T14:24:19Z) - MMCoVaR: Multimodal COVID-19 Vaccine Focused Data Repository for Fake
News Detection and a Baseline Architecture for Classification [1.587767395906846]
We provide a new multimodal labeled dataset containing news articles and tweets on the COVID-19 vaccine.
We combine ratings from three news media ranking sites to classify the news dataset into two levels of credibility: reliable and unreliable.
We find that the proposed architecture has an F-Score of 0.919 and accuracy of 0.882 for fake news detection.
arXiv Detail & Related papers (2021-09-14T03:57:50Z) - Explainable Tsetlin Machine framework for fake news detection with
credibility score assessment [16.457778420360537]
We propose a novel interpretable fake news detection framework based on the recently introduced Tsetlin Machine (TM)
We use the conjunctive clauses of the TM to capture lexical and semantic properties of both true and fake news text.
For evaluation, we conduct experiments on two publicly available datasets, PolitiFact and GossipCop, and demonstrate that the TM framework significantly outperforms previously published baselines by at least $5%$ in terms of accuracy.
arXiv Detail & Related papers (2021-05-19T13:18:02Z) - User Preference-aware Fake News Detection [61.86175081368782]
Existing fake news detection algorithms focus on mining news content for deceptive signals.
We propose a new framework, UPFD, which simultaneously captures various signals from user preferences by joint content and graph modeling.
arXiv Detail & Related papers (2021-04-25T21:19:24Z) - Misinfo Belief Frames: A Case Study on Covid & Climate News [49.979419711713795]
We propose a formalism for understanding how readers perceive the reliability of news and the impact of misinformation.
We introduce the Misinfo Belief Frames (MBF) corpus, a dataset of 66k inferences over 23.5k headlines.
Our results using large-scale language modeling to predict misinformation frames show that machine-generated inferences can influence readers' trust in news headlines.
arXiv Detail & Related papers (2021-04-18T09:50:11Z) - Evaluating Deep Learning Approaches for Covid19 Fake News Detection [0.0]
We look at automated techniques for fake news detection from a data mining perspective.
We evaluate different supervised text classification algorithms on Contraint@AAAI 2021 Covid-19 Fake news detection dataset.
We report the best accuracy of 98.41% on the Covid-19 Fake news detection dataset.
arXiv Detail & Related papers (2021-01-11T16:39:03Z) - An Event Correlation Filtering Method for Fake News Detection [0.0]
Existing deep learning models have achieved great progress to tackle the problem of fake news detection.
To improve the detection performance of fake news, we take advantage of the event correlations of news.
ECFM is proposed to integrate them to detect fake news in an event correlation filtering manner.
arXiv Detail & Related papers (2020-12-10T07:31:07Z) - Machine Learning Explanations to Prevent Overtrust in Fake News
Detection [64.46876057393703]
This research investigates the effects of an Explainable AI assistant embedded in news review platforms for combating the propagation of fake news.
We design a news reviewing and sharing interface, create a dataset of news stories, and train four interpretable fake news detection algorithms.
For a deeper understanding of Explainable AI systems, we discuss interactions between user engagement, mental model, trust, and performance measures in the process of explaining.
arXiv Detail & Related papers (2020-07-24T05:42:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.