Ensemble Deep Learning on Time-Series Representation of Tweets for Rumor
Detection in Social Media
- URL: http://arxiv.org/abs/2004.12500v1
- Date: Sun, 26 Apr 2020 23:13:31 GMT
- Title: Ensemble Deep Learning on Time-Series Representation of Tweets for Rumor
Detection in Social Media
- Authors: Chandra Mouli Madhav Kotteti, Xishuang Dong, Lijun Qian
- Abstract summary: We propose an ensemble model, which performs majority-voting on a collection of predictions by deep neural networks using time-series vector representation of Twitter data for timely detection of rumors.
Experimental results show that the classification performance has been improved by 7.9% in terms of micro F1 score compared to the baselines.
- Score: 2.6514980627603006
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Social media is a popular platform for timely information sharing. One of the
important challenges for social media platforms like Twitter is whether to
trust news shared on them when there is no systematic news verification
process. On the other hand, timely detection of rumors is a non-trivial task,
given the fast-paced social media environment. In this work, we proposed an
ensemble model, which performs majority-voting on a collection of predictions
by deep neural networks using time-series vector representation of Twitter data
for timely detection of rumors. By combining the proposed data pre-processing
method with the ensemble model, better performance of rumor detection has been
demonstrated in the experiments using PHEME dataset. Experimental results show
that the classification performance has been improved by 7.9% in terms of micro
F1 score compared to the baselines.
Related papers
- A Big Data Analytics System for Predicting Suicidal Ideation in Real-Time Based on Social Media Streaming Data [1.6574413179773761]
The paper proposed a new methodology based on a big data architecture to predict suicidal ideation from social media content.
The proposed approach provides a practical analysis of social media data in two phases: batch processing and real-time streaming prediction.
arXiv Detail & Related papers (2024-03-19T21:46:52Z) - ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media [74.93847489218008]
We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information.
To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles.
Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance.
arXiv Detail & Related papers (2023-05-23T16:40:07Z) - Rumour detection using graph neural network and oversampling in
benchmark Twitter dataset [0.30079490585515345]
We propose a novel method for building automatic rumour detection system by focusing on oversampling.
Our oversampling method relies on contextualised data augmentation to generate synthetic samples for underrepresented classes in the dataset.
Two graph neural networks(GNN) are proposed to model non-linear conversations on a thread.
arXiv Detail & Related papers (2022-12-20T08:43:10Z) - Probing Spurious Correlations in Popular Event-Based Rumor Detection
Benchmarks [28.550143417847373]
Open-source benchmark datasets suffer from spurious correlations, which are ignored by existing studies.
We propose event-separated rumor detection as a solution to eliminate spurious cues.
Our method outperforms existing baselines in terms of effectiveness, efficiency and generalizability.
arXiv Detail & Related papers (2022-09-19T07:11:36Z) - Learning Phone Recognition from Unpaired Audio and Phone Sequences Based
on Generative Adversarial Network [58.82343017711883]
This paper investigates how to learn directly from unpaired phone sequences and speech utterances.
GAN training is adopted in the first stage to find the mapping relationship between unpaired speech and phone sequence.
In the second stage, another HMM model is introduced to train from the generator's output, which boosts the performance.
arXiv Detail & Related papers (2022-07-29T09:29:28Z) - A Closer Look at Debiased Temporal Sentence Grounding in Videos:
Dataset, Metric, and Approach [53.727460222955266]
Temporal Sentence Grounding in Videos (TSGV) aims to ground a natural language sentence in an untrimmed video.
Recent studies have found that current benchmark datasets may have obvious moment annotation biases.
We introduce a new evaluation metric "dR@n,IoU@m" that discounts the basic recall scores to alleviate the inflating evaluation caused by biased datasets.
arXiv Detail & Related papers (2022-03-10T08:58:18Z) - SoundDet: Polyphonic Sound Event Detection and Localization from Raw
Waveform [48.68714598985078]
SoundDet is an end-to-end trainable and light-weight framework for polyphonic moving sound event detection and localization.
SoundDet directly consumes the raw, multichannel waveform and treats the temporal sound event as a complete sound-object" to be detected.
A dense sound proposal event map is then constructed to handle the challenges of predicting events with large varying temporal duration.
arXiv Detail & Related papers (2021-06-13T11:43:41Z) - SRLF: A Stance-aware Reinforcement Learning Framework for Content-based
Rumor Detection on Social Media [15.985224010346593]
Early content-based methods focused on finding clues from text and user profiles for rumor detection.
Recent studies combine the stances of users' comments with news content to capture the difference between true and false rumors.
We propose a novel Stance-aware Reinforcement Learning Framework (SRLF) to select high-quality labeled stance data for model training and rumor detection.
arXiv Detail & Related papers (2021-05-10T03:58:34Z) - Omni-supervised Facial Expression Recognition via Distilled Data [120.11782405714234]
We propose omni-supervised learning to exploit reliable samples in a large amount of unlabeled data for network training.
We experimentally verify that the new dataset can significantly improve the ability of the learned FER model.
To tackle this, we propose to apply a dataset distillation strategy to compress the created dataset into several informative class-wise images.
arXiv Detail & Related papers (2020-05-18T09:36:51Z) - Leveraging Multi-Source Weak Social Supervision for Early Detection of
Fake News [67.53424807783414]
Social media has greatly enabled people to participate in online activities at an unprecedented rate.
This unrestricted access also exacerbates the spread of misinformation and fake news online which might cause confusion and chaos unless being detected early for its mitigation.
We jointly leverage the limited amount of clean data along with weak signals from social engagements to train deep neural networks in a meta-learning framework to estimate the quality of different weak instances.
Experiments on realworld datasets demonstrate that the proposed framework outperforms state-of-the-art baselines for early detection of fake news without using any user engagements at prediction time.
arXiv Detail & Related papers (2020-04-03T18:26:33Z) - RP-DNN: A Tweet level propagation context based deep neural networks for
early rumor detection in Social Media [3.253418861583211]
Early rumor detection (ERD) on social media platform is very challenging when limited, incomplete and noisy information is available.
We present a novel hybrid neural network architecture, which combines a character-based bidirectional language model and stacked Long Short-Term Memory (LSTM) networks.
Our models achieve state-of-the-art(SoA) performance for detecting unseen rumors on large augmented data which covers more than 12 events and 2,967 rumors.
arXiv Detail & Related papers (2020-02-28T12:44:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.