ContCommRTD: A Distributed Content-based Misinformation-aware Community
Detection System for Real-Time Disaster Reporting
- URL: http://arxiv.org/abs/2301.12984v1
- Date: Mon, 30 Jan 2023 15:28:47 GMT
- Title: ContCommRTD: A Distributed Content-based Misinformation-aware Community
Detection System for Real-Time Disaster Reporting
- Authors: Elena-Simona Apostol and Ciprian-Octavian Truic\u{a} and Adrian
Paschke
- Abstract summary: We propose a novel distributed system that provides in near real-time information on hazard-related events and their evolution.
Our distributed disaster reporting system analyzes the social relationship among worldwide geolocated tweets.
As misinformation can lead to increase damage if propagated in hazards related tweets, we propose a new deep learning model to detect fake news.
- Score: 0.5156484100374059
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Real-time social media data can provide useful information on evolving
hazards. Alongside traditional methods of disaster detection, the integration
of social media data can considerably enhance disaster management. In this
paper, we investigate the problem of detecting geolocation-content communities
on Twitter and propose a novel distributed system that provides in near
real-time information on hazard-related events and their evolution. We show
that content-based community analysis leads to better and faster dissemination
of reports on hazards. Our distributed disaster reporting system analyzes the
social relationship among worldwide geolocated tweets, and applies topic
modeling to group tweets by topics. Considering for each tweet the following
information: user, timestamp, geolocation, retweets, and replies, we create a
publisher-subscriber distribution model for topics. We use content similarity
and the proximity of nodes to create a new model for geolocation-content based
communities. Users can subscribe to different topics in specific geographical
areas or worldwide and receive real-time reports regarding these topics. As
misinformation can lead to increase damage if propagated in hazards related
tweets, we propose a new deep learning model to detect fake news. The
misinformed tweets are then removed from display. We also show empirically the
scalability capabilities of the proposed system.
Related papers
- CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics [49.2719253711215]
This study introduces a novel approach to disaster text classification by enhancing a pre-trained Large Language Model (LLM)
Our methodology involves creating a comprehensive instruction dataset from disaster-related tweets, which is then used to fine-tune an open-source LLM.
This fine-tuned model can classify multiple aspects of disaster-related information simultaneously, such as the type of event, informativeness, and involvement of human aid.
arXiv Detail & Related papers (2024-06-16T23:01:10Z) - InfoPattern: Unveiling Information Propagation Patterns in Social Media [59.67008841974645]
InfoPattern centers on the interplay between language and human ideology.
The demo is capable of: (1) red teaming to simulate adversary responses from opposite ideology communities; (2) stance detection to identify the underlying political sentiments in each message; (3) information propagation graph discovery to reveal the evolution of claims across various communities over time.
arXiv Detail & Related papers (2023-11-27T09:12:35Z) - CrisisMatch: Semi-Supervised Few-Shot Learning for Fine-Grained Disaster
Tweet Classification [51.58605842457186]
We present a fine-grained disaster tweet classification model under the semi-supervised, few-shot learning setting.
Our model, CrisisMatch, effectively classifies tweets into fine-grained classes of interest using few labeled data and large amounts of unlabeled data.
arXiv Detail & Related papers (2023-10-23T07:01:09Z) - MIDDAG: Where Does Our News Go? Investigating Information Diffusion via
Community-Level Information Pathways [114.42360191723469]
We present MIDDAG, an intuitive, interactive system that visualizes the information propagation paths on social media triggered by COVID-19-related news articles.
We construct communities among users and develop the propagation forecasting capability, enabling tracing and understanding of how information is disseminated at a higher level.
arXiv Detail & Related papers (2023-10-04T02:08:11Z) - Sarcasm Detection in a Disaster Context [103.93691731605163]
We introduce HurricaneSARC, a dataset of 15,000 tweets annotated for intended sarcasm.
Our best model is able to obtain as much as 0.70 F1 on our dataset.
arXiv Detail & Related papers (2023-08-16T05:58:12Z) - Investigating disaster response through social media data and the
Susceptible-Infected-Recovered (SIR) model: A case study of 2020 Western U.S.
wildfire season [0.8999666725996975]
Social media can reflect public concerns and demands during a disaster.
We used Bidirectional Representations from Transformers (BERT) topic modeling to cluster topics from Twitter data.
Our study details how the SIR model and topic modeling using social media data can provide decision-makers with a quantitative approach to measure disaster response.
arXiv Detail & Related papers (2023-08-10T01:51:33Z) - TriggerCit: Early Flood Alerting using Twitter and Geolocation -- a
comparison with alternative sources [0.2603110718989132]
Social media can support emergency response with evidence-based content posted by citizens and organisations during ongoing events.
We propose TriggerCit: an early flood alerting tool with a multilanguage approach focused on timeliness and geolocation.
Geolocated visual evidence extracted from Twitter by TriggerCit was analysed in two case studies on floods in Thailand and Nepal in 2021.
arXiv Detail & Related papers (2022-02-24T10:55:49Z) - TBCOV: Two Billion Multilingual COVID-19 Tweets with Sentiment, Entity,
Geo, and Gender Labels [5.267993069044648]
This work presents TBCOV, a large-scale Twitter dataset comprising more than two billion multilingual tweets related to the COVID-19 pandemic collected worldwide over a continuous period of more than one year.
Several state-of-the-art deep learning models are used to enrich the data with important attributes, including sentiment labels, named-entities, mentions of persons, organizations, locations, user types, and gender information.
Our sentiment and trend analyses reveal interesting insights and confirm TBCOV's broad coverage of important topics.
arXiv Detail & Related papers (2021-10-04T06:17:12Z) - Graph-based Joint Pandemic Concern and Relation Extraction on Twitter [19.7176519744206]
Public concern detection provides potential guidance to the authorities for crisis management before or during a pandemic outbreak.
detecting concerns in time from massive information in social media turns out to be a big challenge.
We propose a novel end-to-end deep learning model to identify people's concerns and the corresponding relations.
arXiv Detail & Related papers (2021-06-18T06:06:35Z) - Leveraging Multi-Source Weak Social Supervision for Early Detection of
Fake News [67.53424807783414]
Social media has greatly enabled people to participate in online activities at an unprecedented rate.
This unrestricted access also exacerbates the spread of misinformation and fake news online which might cause confusion and chaos unless being detected early for its mitigation.
We jointly leverage the limited amount of clean data along with weak signals from social engagements to train deep neural networks in a meta-learning framework to estimate the quality of different weak instances.
Experiments on realworld datasets demonstrate that the proposed framework outperforms state-of-the-art baselines for early detection of fake news without using any user engagements at prediction time.
arXiv Detail & Related papers (2020-04-03T18:26:33Z) - A multi-layer approach to disinformation detection on Twitter [4.663548775064491]
We employ a multi-layer representation of Twitter diffusion networks, and we compute for each layer a set of global network features.
Experimental results with two large-scale datasets, corresponding to diffusion cascades of news shared respectively in the United States and Italy, show that a simple Logistic Regression model is able to classify disinformation vs mainstream networks with high accuracy.
We believe that our network-based approach provides useful insights which pave the way to the future development of a system to detect misleading and harmful information spreading on social media.
arXiv Detail & Related papers (2020-02-28T09:25:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.