Leveraging Twitter Data for Sentiment Analysis of Transit User Feedback:
An NLP Framework
- URL: http://arxiv.org/abs/2310.07086v1
- Date: Wed, 11 Oct 2023 00:01:05 GMT
- Title: Leveraging Twitter Data for Sentiment Analysis of Transit User Feedback:
An NLP Framework
- Authors: Adway Das, Abhishek Kumar Prajapati, Pengxiang Zhang, Mukund Srinath,
Andisheh Ranjbari
- Abstract summary: Traditional methods of collecting user feedback through transit surveys are often time-consuming, resource intensive, and costly.
We propose a novel NLP-based framework that harnesses the vast, abundant, and inexpensive data available on social media platforms like Twitter.
The proposed framework streamlines the process of gathering and analyzing user feedback without the need for costly and time-consuming user feedback surveys.
- Score: 1.2499537119440245
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Traditional methods of collecting user feedback through transit surveys are
often time-consuming, resource intensive, and costly. In this paper, we propose
a novel NLP-based framework that harnesses the vast, abundant, and inexpensive
data available on social media platforms like Twitter to understand users'
perceptions of various service issues. Twitter, being a microblogging platform,
hosts a wealth of real-time user-generated content that often includes valuable
feedback and opinions on various products, services, and experiences. The
proposed framework streamlines the process of gathering and analyzing user
feedback without the need for costly and time-consuming user feedback surveys
using two techniques. First, it utilizes few-shot learning for tweet
classification within predefined categories, allowing effective identification
of the issues described in tweets. It then employs a lexicon-based sentiment
analysis model to assess the intensity and polarity of the tweet sentiments,
distinguishing between positive, negative, and neutral tweets. The
effectiveness of the framework was validated on a subset of manually labeled
Twitter data and was applied to the NYC subway system as a case study. The
framework accurately classifies tweets into predefined categories related to
safety, reliability, and maintenance of the subway system and effectively
measured sentiment intensities within each category. The general findings were
corroborated through a comparison with an agency-run customer survey conducted
in the same year. The findings highlight the effectiveness of the proposed
framework in gauging user feedback through inexpensive social media data to
understand the pain points of the transit system and plan for targeted
improvements.
Related papers
- Transit Pulse: Utilizing Social Media as a Source for Customer Feedback and Information Extraction with Large Language Model [12.6020349733674]
We propose a novel approach to extracting and analyzing transit-related information.
Our method employs Large Language Models (LLM), specifically Llama 3, for a streamlined analysis.
Our results demonstrate the potential of LLMs to transform social media data analysis in the public transit domain.
arXiv Detail & Related papers (2024-10-19T07:08:40Z) - Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs [57.16442740983528]
In ad-hoc retrieval, evaluation relies heavily on user actions, including implicit feedback.
The role of user feedback in annotators' assessment of turns in a conversational perception has been little studied.
We focus on how the evaluation of task-oriented dialogue systems ( TDSs) is affected by considering user feedback, explicit or implicit, as provided through the follow-up utterance of a turn being evaluated.
arXiv Detail & Related papers (2024-04-19T16:45:50Z) - Decoding the Silent Majority: Inducing Belief Augmented Social Graph
with Large Language Model for Response Forecasting [74.68371461260946]
SocialSense is a framework that induces a belief-centered graph on top of an existent social network, along with graph-based propagation to capture social dynamics.
Our method surpasses existing state-of-the-art in experimental evaluations for both zero-shot and supervised settings.
arXiv Detail & Related papers (2023-10-20T06:17:02Z) - Happy or grumpy? A Machine Learning Approach to Analyze the Sentiment of
Airline Passengers' Tweets [17.611366838121064]
This study aims to measure customer satisfaction by analyzing sentiments of Tweets that mention airlines.
Relevant Tweets are retrieved from Twitter's API and processed through tokenization and vectorization.
In addition to sentiment analysis, we also perform lexical analysis on the collected Tweets to model keywords' frequencies.
arXiv Detail & Related papers (2022-09-28T18:50:11Z) - A Closer Look at Debiased Temporal Sentence Grounding in Videos:
Dataset, Metric, and Approach [53.727460222955266]
Temporal Sentence Grounding in Videos (TSGV) aims to ground a natural language sentence in an untrimmed video.
Recent studies have found that current benchmark datasets may have obvious moment annotation biases.
We introduce a new evaluation metric "dR@n,IoU@m" that discounts the basic recall scores to alleviate the inflating evaluation caused by biased datasets.
arXiv Detail & Related papers (2022-03-10T08:58:18Z) - Identification of Twitter Bots based on an Explainable ML Framework: the
US 2020 Elections Case Study [72.61531092316092]
This paper focuses on the design of a novel system for identifying Twitter bots based on labeled Twitter data.
Supervised machine learning (ML) framework is adopted using an Extreme Gradient Boosting (XGBoost) algorithm.
Our study also deploys Shapley Additive Explanations (SHAP) for explaining the ML model predictions.
arXiv Detail & Related papers (2021-12-08T14:12:24Z) - News consumption and social media regulations policy [70.31753171707005]
We analyze two social media that enforced opposite moderation methods, Twitter and Gab, to assess the interplay between news consumption and content regulation.
Our results show that the presence of moderation pursued by Twitter produces a significant reduction of questionable content.
The lack of clear regulation on Gab results in the tendency of the user to engage with both types of content, showing a slight preference for the questionable ones which may account for a dissing/endorsement behavior.
arXiv Detail & Related papers (2021-06-07T19:26:32Z) - Towards A Sentiment Analyzer for Low-Resource Languages [0.0]
This research aims to analyse a sentiment of the users towards a particular trending topic that has been actively and massively discussed at that time.
We use the hashtag textit#kpujangancurang that was the trending topic during the Indonesia presidential election in 2019.
This research utilizes rapid miner tool to generate the twitter data and comparing Naive Bayes, K-Nearest Neighbor, Decision Tree, and Multi-Layer Perceptron classification methods to classify the sentiment of the twitter data.
arXiv Detail & Related papers (2020-11-12T13:50:00Z) - Sentiment Analysis on Social Media Content [0.0]
The aim of this paper is to present a model that can perform sentiment analysis of real data collected from Twitter.
Data in Twitter is highly unstructured which makes it difficult to analyze.
Our proposed model is different from prior work in this field because it combined the use of supervised and unsupervised machine learning algorithms.
arXiv Detail & Related papers (2020-07-04T17:03:30Z) - An Iterative Approach for Identifying Complaint Based Tweets in Social
Media Platforms [76.9570531352697]
We propose an iterative methodology which aims to identify complaint based posts pertaining to the transport domain.
We perform comprehensive evaluations along with releasing a novel dataset for the research purposes.
arXiv Detail & Related papers (2020-01-24T22:23:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.