Sentiment Analysis for Troll Detection on Weibo
- URL: http://arxiv.org/abs/2103.09054v1
- Date: Sun, 7 Mar 2021 14:59:12 GMT
- Title: Sentiment Analysis for Troll Detection on Weibo
- Authors: Zidong Jiang and Fabio Di Troia and Mark Stamp
- Abstract summary: In China, the micro-blogging service provider, Sina Weibo, is the most popular such service.
To influence public opinion, Weibo trolls can be hired to post deceptive comments.
In this paper, we focus on troll detection via sentiment analysis and other user activity data.
- Score: 6.961253535504979
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The impact of social media on the modern world is difficult to overstate.
Virtually all companies and public figures have social media accounts on
popular platforms such as Twitter and Facebook. In China, the micro-blogging
service provider, Sina Weibo, is the most popular such service. To influence
public opinion, Weibo trolls -- the so called Water Army -- can be hired to
post deceptive comments. In this paper, we focus on troll detection via
sentiment analysis and other user activity data on the Sina Weibo platform. We
implement techniques for Chinese sentence segmentation, word embedding, and
sentiment score calculation. In recent years, troll detection and sentiment
analysis have been studied, but we are not aware of previous research that
considers troll detection based on sentiment analysis. We employ the resulting
techniques to develop and test a sentiment analysis approach for troll
detection, based on a variety of machine learning strategies. Experimental
results are generated and analyzed. A Chrome extension is presented that
implements our proposed technique, which enables real-time troll detection when
a user browses Sina Weibo.
Related papers
- DisTrack: a new Tool for Semi-automatic Misinformation Tracking in Online Social Networks [46.38614083502535]
DisTrack is designed to combat the spread of misinformation through a combination of Natural Language Processing (NLP) Social Network Analysis (SNA) and graph visualization.
The tool is tailored to capture and analyze the dynamic nature of misinformation spread in digital environments.
arXiv Detail & Related papers (2024-08-01T15:17:33Z) - ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media [74.93847489218008]
We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information.
To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles.
Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance.
arXiv Detail & Related papers (2023-05-23T16:40:07Z) - Identification of Twitter Bots based on an Explainable ML Framework: the
US 2020 Elections Case Study [72.61531092316092]
This paper focuses on the design of a novel system for identifying Twitter bots based on labeled Twitter data.
Supervised machine learning (ML) framework is adopted using an Extreme Gradient Boosting (XGBoost) algorithm.
Our study also deploys Shapley Additive Explanations (SHAP) for explaining the ML model predictions.
arXiv Detail & Related papers (2021-12-08T14:12:24Z) - A Review of Web Infodemic Analysis and Detection Trends across
Multi-modalities using Deep Neural Networks [3.42658286826597]
Fake news detection is one of the most analyzed and prominent areas of research.
Facebook, Reddit, WhatsApp, YouTube, and other social applications are gradually gaining attention in this emerging field.
This review primarily deals with multi-modal fake news detection techniques that include images, videos, and their combinations with text.
arXiv Detail & Related papers (2021-11-23T16:02:28Z) - A Study on Herd Behavior Using Sentiment Analysis in Online Social
Network [1.5673338088641469]
This paper represents and analyze the capacity of diverse strategies to predict critical opinions from online social networking sites.
Social media becomes a good outlet since the last decades to share the opinions globally.
This study demonstrates the evaluation of sentiment analysis techniques using social media contents.
arXiv Detail & Related papers (2021-07-25T05:22:35Z) - News consumption and social media regulations policy [70.31753171707005]
We analyze two social media that enforced opposite moderation methods, Twitter and Gab, to assess the interplay between news consumption and content regulation.
Our results show that the presence of moderation pursued by Twitter produces a significant reduction of questionable content.
The lack of clear regulation on Gab results in the tendency of the user to engage with both types of content, showing a slight preference for the questionable ones which may account for a dissing/endorsement behavior.
arXiv Detail & Related papers (2021-06-07T19:26:32Z) - Analysis of Online Toxicity Detection Using Machine Learning Approaches [6.548580592686076]
Social media and the internet have become an integral part of how people spread and consume information.
Almost half of the population is using social media to express their views and opinions.
Online hate speech is one of the drawbacks of social media nowadays, which needs to be controlled.
arXiv Detail & Related papers (2021-04-23T04:29:13Z) - Twitter Spam Detection: A Systematic Review [4.348112720799065]
This review focuses on comparing the existing research techniques on Twitter spam detection systematically.
Most of the existing methods rely on Machine Learning-based algorithms.
We propose a taxonomy based on different feature selection methods and analyses, namely content analysis, user analysis, tweet analysis, network analysis, and hybrid analysis.
arXiv Detail & Related papers (2020-11-30T13:10:24Z) - Tweets Sentiment Analysis via Word Embeddings and Machine Learning
Techniques [1.345251051985899]
This paper aims to perform sentiment analysis of real-time 2019 election twitter data using the feature selection model word2vec and the machine learning algorithm random forest for sentiment classification.
Word2vec improves the quality of features by considering contextual semantics of words in a text hence improving the accuracy of machine learning and sentiment analysis.
arXiv Detail & Related papers (2020-07-05T08:10:30Z) - Russian trolls speaking Russian: Regional Twitter operations and MH17 [68.8204255655161]
In 2018, Twitter released data on accounts identified as Russian trolls.
We analyze the Russian-language operations of these trolls.
We find that trolls' information campaign on the MH17 crash was the largest in terms of tweet count.
arXiv Detail & Related papers (2020-05-13T19:48:12Z) - Echo Chambers on Social Media: A comparative analysis [64.2256216637683]
We introduce an operational definition of echo chambers and perform a massive comparative analysis on 1B pieces of contents produced by 1M users on four social media platforms.
We infer the leaning of users about controversial topics and reconstruct their interaction networks by analyzing different features.
We find support for the hypothesis that platforms implementing news feed algorithms like Facebook may elicit the emergence of echo-chambers.
arXiv Detail & Related papers (2020-04-20T20:00:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.