Happy or grumpy? A Machine Learning Approach to Analyze the Sentiment of
Airline Passengers' Tweets
- URL: http://arxiv.org/abs/2209.14363v1
- Date: Wed, 28 Sep 2022 18:50:11 GMT
- Title: Happy or grumpy? A Machine Learning Approach to Analyze the Sentiment of
Airline Passengers' Tweets
- Authors: Shengyang Wu, Yi Gao
- Abstract summary: This study aims to measure customer satisfaction by analyzing sentiments of Tweets that mention airlines.
Relevant Tweets are retrieved from Twitter's API and processed through tokenization and vectorization.
In addition to sentiment analysis, we also perform lexical analysis on the collected Tweets to model keywords' frequencies.
- Score: 17.611366838121064
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: As one of the most extensive social networking services, Twitter has more
than 300 million active users as of 2022. Among its many functions, Twitter is
now one of the go-to platforms for consumers to share their opinions about
products or experiences, including flight services provided by commercial
airlines. This study aims to measure customer satisfaction by analyzing
sentiments of Tweets that mention airlines using a machine learning approach.
Relevant Tweets are retrieved from Twitter's API and processed through
tokenization and vectorization. After that, these processed vectors are passed
into a pre-trained machine learning classifier to predict the sentiments. In
addition to sentiment analysis, we also perform lexical analysis on the
collected Tweets to model keywords' frequencies, which provide meaningful
contexts to facilitate the interpretation of sentiments. We then apply time
series methods such as Bollinger Bands to detect abnormalities in sentiment
data. Using historical records from January to July 2022, our approach is
proven to be capable of capturing sudden and significant changes in passengers'
sentiment. This study has the potential to be developed into an application
that can help airlines, along with several other customer-facing businesses,
efficiently detect abrupt changes in customers' sentiments and take adequate
measures to counteract them.
Related papers
- You Shall Know a Tool by the Traces it Leaves: The Predictability of Sentiment Analysis Tools [74.98850427240464]
We show that sentiment analysis tools disagree on the same dataset.
We show that the sentiment tool used for sentiment annotation can even be predicted from its outcome.
arXiv Detail & Related papers (2024-10-18T17:27:38Z) - A Machine Learning Approach to Detect Customer Satisfaction From
Multiple Tweet Parameters [0.8585882243614277]
A positive review can help the company grow, while a negative one can quickly ruin its revenue and reputation.
But studying thousands of tweets and analyzing them to find the satisfaction of the customer is quite a difficult task.
Some work has already been done on this strategy to automate the procedure using machine learning and deep learning techniques.
This work has broadened its perspective to include these qualities.
arXiv Detail & Related papers (2024-02-25T05:16:43Z) - Leveraging Twitter Data for Sentiment Analysis of Transit User Feedback:
An NLP Framework [1.2499537119440245]
Traditional methods of collecting user feedback through transit surveys are often time-consuming, resource intensive, and costly.
We propose a novel NLP-based framework that harnesses the vast, abundant, and inexpensive data available on social media platforms like Twitter.
The proposed framework streamlines the process of gathering and analyzing user feedback without the need for costly and time-consuming user feedback surveys.
arXiv Detail & Related papers (2023-10-11T00:01:05Z) - Identification of Twitter Bots based on an Explainable ML Framework: the
US 2020 Elections Case Study [72.61531092316092]
This paper focuses on the design of a novel system for identifying Twitter bots based on labeled Twitter data.
Supervised machine learning (ML) framework is adopted using an Extreme Gradient Boosting (XGBoost) algorithm.
Our study also deploys Shapley Additive Explanations (SHAP) for explaining the ML model predictions.
arXiv Detail & Related papers (2021-12-08T14:12:24Z) - Sentiment analysis in tweets: an assessment study from classical to
modern text representation models [59.107260266206445]
Short texts published on Twitter have earned significant attention as a rich source of information.
Their inherent characteristics, such as the informal, and noisy linguistic style, remain challenging to many natural language processing (NLP) tasks.
This study fulfils an assessment of existing language models in distinguishing the sentiment expressed in tweets by using a rich collection of 22 datasets.
arXiv Detail & Related papers (2021-05-29T21:05:28Z) - Birdspotter: A Tool for Analyzing and Labeling Twitter Users [12.558187319452657]
Birdspotter is a tool to analyze and label Twitter users.
Birdspotter.ml is an exploratory visualizer for the computed metrics.
We show how to train birdspotter into a fully-fledged bot detector.
arXiv Detail & Related papers (2020-12-04T02:25:07Z) - Improving Sentiment Analysis over non-English Tweets using Multilingual
Transformers and Automatic Translation for Data-Augmentation [77.69102711230248]
We propose the use of a multilingual transformer model, that we pre-train over English tweets and apply data-augmentation using automatic translation to adapt the model to non-English languages.
Our experiments in French, Spanish, German and Italian suggest that the proposed technique is an efficient way to improve the results of the transformers over small corpora of tweets in a non-English language.
arXiv Detail & Related papers (2020-10-07T15:44:55Z) - Writer Identification Using Microblogging Texts for Social Media
Forensics [53.180678723280145]
We evaluate popular stylometric features, widely used in literary analysis, and specific Twitter features like URLs, hashtags, replies or quotes.
We test varying sized author sets and varying amounts of training/test texts per author.
arXiv Detail & Related papers (2020-07-31T00:23:18Z) - Tweets Sentiment Analysis via Word Embeddings and Machine Learning
Techniques [1.345251051985899]
This paper aims to perform sentiment analysis of real-time 2019 election twitter data using the feature selection model word2vec and the machine learning algorithm random forest for sentiment classification.
Word2vec improves the quality of features by considering contextual semantics of words in a text hence improving the accuracy of machine learning and sentiment analysis.
arXiv Detail & Related papers (2020-07-05T08:10:30Z) - Sentiment Analysis on Social Media Content [0.0]
The aim of this paper is to present a model that can perform sentiment analysis of real data collected from Twitter.
Data in Twitter is highly unstructured which makes it difficult to analyze.
Our proposed model is different from prior work in this field because it combined the use of supervised and unsupervised machine learning algorithms.
arXiv Detail & Related papers (2020-07-04T17:03:30Z) - An Iterative Approach for Identifying Complaint Based Tweets in Social
Media Platforms [76.9570531352697]
We propose an iterative methodology which aims to identify complaint based posts pertaining to the transport domain.
We perform comprehensive evaluations along with releasing a novel dataset for the research purposes.
arXiv Detail & Related papers (2020-01-24T22:23:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.