Sentiment Analysis of Cybersecurity Content on Twitter and Reddit
- URL: http://arxiv.org/abs/2204.12267v1
- Date: Tue, 26 Apr 2022 12:46:55 GMT
- Title: Sentiment Analysis of Cybersecurity Content on Twitter and Reddit
- Authors: Bipun Thapa
- Abstract summary: This research analyzed cybersecurity content on Twitter and Reddit to measure its sentiment, positive or negative, or neutral.
A random sample of cybersecurity content (ten tweets and posts) was also classified for sentiments by twenty human annotators to evaluate the performance of VADER.
When compared to human classification, which was considered the standard or source of truth, VADER produced 60% accuracy for Twitter and 70% for Reddit in assessing the sentiment.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Sentiment Analysis provides an opportunity to understand the subject(s),
especially in the digital age, due to an abundance of public data and effective
algorithms. Cybersecurity is a subject where opinions are plentiful and
differing in the public domain. This descriptive research analyzed
cybersecurity content on Twitter and Reddit to measure its sentiment, positive
or negative, or neutral. The data from Twitter and Reddit was amassed via
technology-specific APIs during a selected timeframe to create datasets, which
were then analyzed individually for their sentiment by VADER, an NLP (Natural
Language Processing) algorithm. A random sample of cybersecurity content (ten
tweets and posts) was also classified for sentiments by twenty human annotators
to evaluate the performance of VADER. Cybersecurity content on Twitter was at
least 48% positive, and Reddit was at least 26.5% positive. The positive or
neutral content far outweighed negative sentiments across both platforms. When
compared to human classification, which was considered the standard or source
of truth, VADER produced 60% accuracy for Twitter and 70% for Reddit in
assessing the sentiment; in other words, some agreement between algorithm and
human classifiers. Overall, the goal was to explore an uninhibited research
topic about cybersecurity sentiment
Related papers
- You Shall Know a Tool by the Traces it Leaves: The Predictability of Sentiment Analysis Tools [74.98850427240464]
We show that sentiment analysis tools disagree on the same dataset.
We show that the sentiment tool used for sentiment annotation can even be predicted from its outcome.
arXiv Detail & Related papers (2024-10-18T17:27:38Z) - Depression detection in social media posts using affective and social
norm features [84.12658971655253]
We propose a deep architecture for depression detection from social media posts.
We incorporate profanity and morality features of posts and words in our architecture using a late fusion scheme.
The inclusion of the proposed features yields state-of-the-art results in both settings.
arXiv Detail & Related papers (2023-03-24T21:26:27Z) - Fine-tuned Sentiment Analysis of COVID-19 Vaccine-Related Social Media
Data: Comparative Study [0.7874708385247353]
This study investigated and compared public sentiment related to COVID-19 vaccines expressed on two popular social media platforms, Reddit and Twitter.
We created a fine-tuned DistilRoBERTa model to predict sentiments of approximately 9.5 million Tweets and 70 thousand Reddit comments.
Our results determined that the average sentiment expressed on Twitter was more negative (52% positive) than positive.
arXiv Detail & Related papers (2022-10-17T16:22:18Z) - Perspectives of Non-Expert Users on Cyber Security and Privacy: An
Analysis of Online Discussions on Twitter [3.5174884177930448]
Two machine learning-based classifiers were developed to identify the 413,985 tweets.
We observed a 54% increase in non-expert users tweets on cyber security and/or privacy related topics in 2021, compared to before the start of global COVID-19 lockdowns (January 2019 to February 2020)
Our analysis revealed a diverse range of topics discussed by non-expert users across the three years, including VPNs, Wi-Fi, smartphones, laptops, smart home devices, financial security, and security and privacy issues involving different stakeholders.
arXiv Detail & Related papers (2022-06-05T11:54:48Z) - Sentiment Analysis of Microblogging dataset on Coronavirus Pandemic [0.8252679746749371]
Coronavirus (COVID-19) is a contagious illness caused by the coronavirus 2 that causes severe respiratory symptoms.
In this paper, we have analyzed the Twitter dataset for evaluating the sentiment using several machine learning algorithms.
arXiv Detail & Related papers (2021-11-17T18:13:54Z) - Identifying negativity factors from social media text corpus using
sentiment analysis method [0.0]
In this study, we hierarchically goes down into negative comments, and link them with more classes.
Tweets are extracted from social media sites such as Twitter and Facebook.
Based on expert opinions, the negative comments/tweets are further classified into 8 classes.
arXiv Detail & Related papers (2021-07-06T10:15:31Z) - News consumption and social media regulations policy [70.31753171707005]
We analyze two social media that enforced opposite moderation methods, Twitter and Gab, to assess the interplay between news consumption and content regulation.
Our results show that the presence of moderation pursued by Twitter produces a significant reduction of questionable content.
The lack of clear regulation on Gab results in the tendency of the user to engage with both types of content, showing a slight preference for the questionable ones which may account for a dissing/endorsement behavior.
arXiv Detail & Related papers (2021-06-07T19:26:32Z) - Political audience diversity and news reliability in algorithmic ranking [54.23273310155137]
We propose using the political diversity of a website's audience as a quality signal.
Using news source reliability ratings from domain experts and web browsing data from a diverse sample of 6,890 U.S. citizens, we first show that websites with more extreme and less politically diverse audiences have lower journalistic standards.
arXiv Detail & Related papers (2020-07-16T02:13:55Z) - Tweets Sentiment Analysis via Word Embeddings and Machine Learning
Techniques [1.345251051985899]
This paper aims to perform sentiment analysis of real-time 2019 election twitter data using the feature selection model word2vec and the machine learning algorithm random forest for sentiment classification.
Word2vec improves the quality of features by considering contextual semantics of words in a text hence improving the accuracy of machine learning and sentiment analysis.
arXiv Detail & Related papers (2020-07-05T08:10:30Z) - Information Consumption and Social Response in a Segregated Environment:
the Case of Gab [74.5095691235917]
This work provides a characterization of the interaction patterns within Gab around the COVID-19 topic.
We find that there are no strong statistical differences in the social response to questionable and reliable content.
Our results provide insights toward the understanding of coordinated inauthentic behavior and on the early-warning of information operation.
arXiv Detail & Related papers (2020-06-03T11:34:25Z) - Echo Chambers on Social Media: A comparative analysis [64.2256216637683]
We introduce an operational definition of echo chambers and perform a massive comparative analysis on 1B pieces of contents produced by 1M users on four social media platforms.
We infer the leaning of users about controversial topics and reconstruct their interaction networks by analyzing different features.
We find support for the hypothesis that platforms implementing news feed algorithms like Facebook may elicit the emergence of echo-chambers.
arXiv Detail & Related papers (2020-04-20T20:00:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.