Spatial Data Mining of Public Transport Incidents reported in Social
Media
- URL: http://arxiv.org/abs/2110.05573v1
- Date: Mon, 11 Oct 2021 19:28:11 GMT
- Title: Spatial Data Mining of Public Transport Incidents reported in Social
Media
- Authors: Kamil Raczycki, Marcin Szyma\'nski, Yahor Yeliseyenka, Piotr
Szyma\'nski, Tomasz Kajdanowicz
- Abstract summary: Social media communication of transport phenomena usually lacks GIS annotations.
Most social media platforms do not allow attaching non-POI GPS coordinates to posts.
We infer a six-class transport information typology through exploration.
We show that our approach enables citizen science and use it to analyze the impact of three years of infrastructure incidents on passenger mobility.
- Score: 7.144384940254773
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Public transport agencies use social media as an essential tool for
communicating mobility incidents to passengers. However, while the short term,
day-to-day information about transport phenomena is usually posted in social
media with low latency, its availability is short term as the content is rarely
made an aggregated form. Social media communication of transport phenomena
usually lacks GIS annotations as most social media platforms do not allow
attaching non-POI GPS coordinates to posts. As a result, the analysis of
transport phenomena information is minimal. We collected three years of social
media posts of a polish public transport company with user comments. Through
exploration, we infer a six-class transport information typology. We
successfully build an information type classifier for social media posts,
detect stop names in posts, and relate them to GPS coordinates, obtaining a
spatial understanding of long-term aggregated phenomena. We show that our
approach enables citizen science and use it to analyze the impact of three
years of infrastructure incidents on passenger mobility, and the sentiment and
reaction scale towards each of the events. All these results are achieved for
Polish, an under-resourced language when it comes to spatial language
understanding, especially in social media contexts. To improve the situation,
we released two of our annotated data sets: social media posts with incident
type labels and matched stop names and social media comments with the annotated
sentiment. We also opensource the experimental codebase.
Related papers
- From Twitter to Reasoner: Understand Mobility Travel Modes and Sentiment Using Large Language Models [8.438695039581141]
We introduce a novel methodological framework utilizing Large Language Models (LLMs) to infer the mentioned travel modes from social media posts.
We find that most social media posts manifest negative rather than positive sentiments.
arXiv Detail & Related papers (2024-11-04T23:04:13Z) - MIDDAG: Where Does Our News Go? Investigating Information Diffusion via
Community-Level Information Pathways [114.42360191723469]
We present MIDDAG, an intuitive, interactive system that visualizes the information propagation paths on social media triggered by COVID-19-related news articles.
We construct communities among users and develop the propagation forecasting capability, enabling tracing and understanding of how information is disseminated at a higher level.
arXiv Detail & Related papers (2023-10-04T02:08:11Z) - ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media [74.93847489218008]
We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information.
To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles.
Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance.
arXiv Detail & Related papers (2023-05-23T16:40:07Z) - Social Media Harms as a Trilemma: Asymmetry, Algorithms, and Audacious
Design Choices [0.0]
Social media has expanded in its use, and reach, since the inception of early social networks in the early 2000s.
We argue that as information (eco)systems, social media sites are vulnerable from three aspects.
We will unpack suggestions from various allied disciplines in untangling the 3A's above.
arXiv Detail & Related papers (2023-04-28T08:12:38Z) - On the Complexity of Object Detection on Real-world Public
Transportation Images for Social Distancing Measurement [0.8347190888362194]
Social distancing in public spaces has become an essential aspect in helping to reduce the impact of the COVID-19 pandemic.
There has been no study of social distance measurement on public transport.
We benchmark several state-of-the-art object detection algorithms using real-world footage taken from the London Underground and bus network.
arXiv Detail & Related papers (2022-02-14T11:47:26Z) - SSAGCN: Social Soft Attention Graph Convolution Network for Pedestrian
Trajectory Prediction [59.064925464991056]
We propose one new prediction model named Social Soft Attention Graph Convolution Network (SSAGCN)
SSAGCN aims to simultaneously handle social interactions among pedestrians and scene interactions between pedestrians and environments.
Experiments on public available datasets prove the effectiveness of SSAGCN and have achieved state-of-the-art results.
arXiv Detail & Related papers (2021-12-05T01:49:18Z) - Interpretation of Emergent Communication in Heterogeneous Collaborative
Embodied Agents [83.52684405389445]
We introduce the collaborative multi-object navigation task CoMON.
In this task, an oracle agent has detailed environment information in the form of a map.
It communicates with a navigator agent that perceives the environment visually and is tasked to find a sequence of goals.
We show that the emergent communication can be grounded to the agent observations and the spatial structure of the 3D environment.
arXiv Detail & Related papers (2021-10-12T06:56:11Z) - TBCOV: Two Billion Multilingual COVID-19 Tweets with Sentiment, Entity,
Geo, and Gender Labels [5.267993069044648]
This work presents TBCOV, a large-scale Twitter dataset comprising more than two billion multilingual tweets related to the COVID-19 pandemic collected worldwide over a continuous period of more than one year.
Several state-of-the-art deep learning models are used to enrich the data with important attributes, including sentiment labels, named-entities, mentions of persons, organizations, locations, user types, and gender information.
Our sentiment and trend analyses reveal interesting insights and confirm TBCOV's broad coverage of important topics.
arXiv Detail & Related papers (2021-10-04T06:17:12Z) - Streaming Social Event Detection and Evolution Discovery in
Heterogeneous Information Networks [90.3475746663728]
Events are happening in real-world and real-time, which can be planned and organized for occasions, such as social gatherings, festival celebrations, influential meetings or sports activities.
Social media platforms generate a lot of real-time text information regarding public events with different topics.
However, mining social events is challenging because events typically exhibit heterogeneous texture and metadata are often ambiguous.
arXiv Detail & Related papers (2021-04-02T02:13:10Z) - Analysing Social Media Network Data with R: Semi-Automated Screening of
Users, Comments and Communication Patterns [0.0]
Communication on social media platforms is increasingly widespread across societies.
Fake news, hate speech and radicalizing elements are part of this modern form of communication.
A basic understanding of these mechanisms and communication patterns could help to counteract negative forms of communication.
arXiv Detail & Related papers (2020-11-26T14:52:01Z) - Named Entity Recognition for Social Media Texts with Semantic
Augmentation [70.44281443975554]
Existing approaches for named entity recognition suffer from data sparsity problems when conducted on short and informal texts.
We propose a neural-based approach to NER for social media texts where both local (from running text) and augmented semantics are taken into account.
arXiv Detail & Related papers (2020-10-29T10:06:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.