A Comparative Study of Sentiment Analysis Using NLP and Different
Machine Learning Techniques on US Airline Twitter Data
- URL: http://arxiv.org/abs/2110.00859v1
- Date: Sat, 2 Oct 2021 18:05:00 GMT
- Title: A Comparative Study of Sentiment Analysis Using NLP and Different
Machine Learning Techniques on US Airline Twitter Data
- Authors: Md. Taufiqul Haque Khan Tusar, Md. Touhidul Islam
- Abstract summary: Sentiment Analysis is a technique of Natural Language Processing (NLP) and Machine Learning (ML)
In this paper, we have introduced two NLP techniques (Bag-of-Words and TF-IDF) and various ML classification algorithms.
Our best approaches provide 77% accuracy using Support Vector Machine and Logistic Regression with Bag-of-Words technique.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Today's business ecosystem has become very competitive. Customer satisfaction
has become a major focus for business growth. Business organizations are
spending a lot of money and human resources on various strategies to understand
and fulfill their customer's needs. But, because of defective manual analysis
on multifarious needs of customers, many organizations are failing to achieve
customer satisfaction. As a result, they are losing customer's loyalty and
spending extra money on marketing. We can solve the problems by implementing
Sentiment Analysis. It is a combined technique of Natural Language Processing
(NLP) and Machine Learning (ML). Sentiment Analysis is broadly used to extract
insights from wider public opinion behind certain topics, products, and
services. We can do it from any online available data. In this paper, we have
introduced two NLP techniques (Bag-of-Words and TF-IDF) and various ML
classification algorithms (Support Vector Machine, Logistic Regression,
Multinomial Naive Bayes, Random Forest) to find an effective approach for
Sentiment Analysis on a large, imbalanced, and multi-classed dataset. Our best
approaches provide 77% accuracy using Support Vector Machine and Logistic
Regression with Bag-of-Words technique.
Related papers
- Data Analysis in the Era of Generative AI [56.44807642944589]
This paper explores the potential of AI-powered tools to reshape data analysis, focusing on design considerations and challenges.
We explore how the emergence of large language and multimodal models offers new opportunities to enhance various stages of data analysis workflow.
We then examine human-centered design principles that facilitate intuitive interactions, build user trust, and streamline the AI-assisted analysis workflow across multiple apps.
arXiv Detail & Related papers (2024-09-27T06:31:03Z) - What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices [91.71951459594074]
Long language models (LLMs) with extended context windows have significantly improved tasks such as information extraction, question answering, and complex planning scenarios.
Existing methods typically utilize the Self-Instruct framework to generate instruction tuning data for better long context capability improvement.
We propose the Multi-agent Interactive Multi-hop Generation framework, incorporating a Quality Verification Agent, a Single-hop Question Generation Agent, a Multiple Question Sampling Strategy, and a Multi-hop Question Merger Agent.
Our findings show that our synthetic high-quality long-context instruction data significantly enhances model performance, even surpassing models trained on larger amounts of human
arXiv Detail & Related papers (2024-09-03T13:30:00Z) - AROhI: An Interactive Tool for Estimating ROI of Data Analytics [0.0]
It is crucial to consider Return On Investment when performing data analytics.
This work details a comprehensive tool that provides conventional and advanced ML approaches for demonstration.
arXiv Detail & Related papers (2024-07-18T18:19:17Z) - Automating Customer Needs Analysis: A Comparative Study of Large Language Models in the Travel Industry [2.4244694855867275]
Large Language Models (LLMs) have emerged as powerful tools for extracting valuable insights from vast amounts of textual data.
In this study, we conduct a comparative analysis of LLMs for the extraction of travel customer needs from TripAdvisor posts.
Our findings highlight the efficacy of opensource LLMs, particularly Mistral 7B, in achieving comparable performance to larger closed models.
arXiv Detail & Related papers (2024-04-27T18:28:10Z) - An explainable machine learning-based approach for analyzing customers'
online data to identify the importance of product attributes [0.6437284704257459]
We propose a game theory machine learning (ML) method that extracts comprehensive design implications for product development.
We apply our method to a real-world dataset of laptops from Kaggle, and derive design implications based on the results.
arXiv Detail & Related papers (2024-02-03T20:50:48Z) - Buy when? Survival machine learning model comparison for purchase timing [0.0]
This article examines marketing machine learning techniques such as Support Vector Machines, Genetic Algorithms, Deep Learning, and K-Means.
Gender, Income, Location, PurchaseHistory, OnlineDiscounts, Interests, Promotionss and CustomerExperience all have an influence on purchasing time.
The study shows that the DeepSurv model predicted purchase completion the best.
arXiv Detail & Related papers (2023-08-28T06:40:02Z) - Unsupervised Sentiment Analysis of Plastic Surgery Social Media Posts [91.3755431537592]
The massive collection of user posts across social media platforms is primarily untapped for artificial intelligence (AI) use cases.
Natural language processing (NLP) is a subfield of AI that leverages bodies of documents, known as corpora, to train computers in human-like language understanding.
This study demonstrates that the applied results of unsupervised analysis allow a computer to predict either negative, positive, or neutral user sentiment towards plastic surgery.
arXiv Detail & Related papers (2023-07-05T20:16:20Z) - OverPrompt: Enhancing ChatGPT through Efficient In-Context Learning [49.38867353135258]
We propose OverPrompt, leveraging the in-context learning capability of LLMs to handle multiple task inputs.
Our experiments show that OverPrompt can achieve cost-efficient zero-shot classification without causing significant detriment to task performance.
arXiv Detail & Related papers (2023-05-24T10:08:04Z) - Causality-Aided Trade-off Analysis for Machine Learning Fairness [11.149507394656709]
This paper uses causality analysis as a principled method for analyzing trade-offs between fairness parameters and other crucial metrics in machine learning pipelines.
We propose a set of domain-specific optimizations to facilitate accurate causal discovery and a unified, novel interface for trade-off analysis based on well-established causal inference methods.
arXiv Detail & Related papers (2023-05-22T14:14:43Z) - AI for IT Operations (AIOps) on Cloud Platforms: Reviews, Opportunities
and Challenges [60.56413461109281]
Artificial Intelligence for IT operations (AIOps) aims to combine the power of AI with the big data generated by IT Operations processes.
We discuss in depth the key types of data emitted by IT Operations activities, the scale and challenges in analyzing them, and where they can be helpful.
We categorize the key AIOps tasks as - incident detection, failure prediction, root cause analysis and automated actions.
arXiv Detail & Related papers (2023-04-10T15:38:12Z) - Meta Knowledge Condensation for Federated Learning [65.20774786251683]
Existing federated learning paradigms usually extensively exchange distributed models at a central solver to achieve a more powerful model.
This would incur severe communication burden between a server and multiple clients especially when data distributions are heterogeneous.
Unlike existing paradigms, we introduce an alternative perspective to significantly decrease the communication cost in federate learning.
arXiv Detail & Related papers (2022-09-29T15:07:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.