Manipulating Twitter Through Deletions
- URL: http://arxiv.org/abs/2203.13893v1
- Date: Fri, 25 Mar 2022 20:07:08 GMT
- Title: Manipulating Twitter Through Deletions
- Authors: Christopher Torres-Lugo, Manita Pote, Alexander Nwala, Filippo Menczer
- Abstract summary: Research into influence campaigns on Twitter has mostly relied on identifying malicious activities from tweets obtained via public APIs.
Here, we provide the first exhaustive, large-scale analysis of anomalous deletion patterns involving more than a billion deletions by over 11 million accounts.
We find that a small fraction of accounts delete a large number of tweets daily.
First, limits on tweet volume are circumvented, allowing certain accounts to flood the network with over 26 thousand daily tweets.
Second, coordinated networks of accounts engage in repetitive likes and unlikes of content that is eventually deleted, which can manipulate ranking algorithms.
- Score: 64.33261764633504
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Research into influence campaigns on Twitter has mostly relied on identifying
malicious activities from tweets obtained via public APIs. These APIs provide
access to public tweets that have not been deleted. However, bad actors can
delete content strategically to manipulate the system. Unfortunately, estimates
based on publicly available Twitter data underestimate the true deletion
volume. Here, we provide the first exhaustive, large-scale analysis of
anomalous deletion patterns involving more than a billion deletions by over 11
million accounts. We find that a small fraction of accounts delete a large
number of tweets daily. We also uncover two abusive behaviors that exploit
deletions. First, limits on tweet volume are circumvented, allowing certain
accounts to flood the network with over 26 thousand daily tweets. Second,
coordinated networks of accounts engage in repetitive likes and unlikes of
content that is eventually deleted, which can manipulate ranking algorithms.
These kinds of abuse can be exploited to amplify content and inflate
popularity, while evading detection. Our study provides platforms and
researchers with new methods for identifying social media abuse.
Related papers
- Russo-Ukrainian War: Prediction and explanation of Twitter suspension [47.61306219245444]
This study focuses on the Twitter suspension mechanism and the analysis of shared content and features of user accounts that may lead to this.
We have obtained a dataset containing 107.7M tweets, originating from 9.8 million users, using Twitter API.
Our results reveal scam campaigns taking advantage of trending topics regarding the Russia-Ukrainian conflict for Bitcoin fraud, spam, and advertisement campaigns.
arXiv Detail & Related papers (2023-06-06T08:41:02Z) - Detecting and Reasoning of Deleted Tweets before they are Posted [5.300190188468289]
We identify deleted tweets, particularly within the Arabic context, and label them with a corresponding fine-grained disinformation category.
We then develop models that can predict the potentiality of tweets getting deleted, as well as the potential reasons behind deletion.
arXiv Detail & Related papers (2023-05-05T08:25:07Z) - LAMBRETTA: Learning to Rank for Twitter Soft Moderation [11.319938541673578]
LAMBRETTA is a system that automatically identifies tweets that are candidates for soft moderation.
We run LAMBRETTA on Twitter data to moderate false claims related to the 2020 US Election.
It flags over 20 times more tweets than Twitter, with only 3.93% false positives and 18.81% false negatives.
arXiv Detail & Related papers (2022-12-12T14:41:46Z) - Twitter Dataset on the Russo-Ukrainian War [68.713984286035]
We have initiated an ongoing dataset acquisition from Twitter API.
The dataset has reached the amount of 57.3 million tweets, originating from 7.7 million users.
We apply an initial volume and sentiment analysis, while the dataset can be used to further exploratory investigation towards topic analysis, hate speech, propaganda recognition, or even show potential malicious entities like botnets.
arXiv Detail & Related papers (2022-04-07T12:33:06Z) - Identification of Twitter Bots based on an Explainable ML Framework: the
US 2020 Elections Case Study [72.61531092316092]
This paper focuses on the design of a novel system for identifying Twitter bots based on labeled Twitter data.
Supervised machine learning (ML) framework is adopted using an Extreme Gradient Boosting (XGBoost) algorithm.
Our study also deploys Shapley Additive Explanations (SHAP) for explaining the ML model predictions.
arXiv Detail & Related papers (2021-12-08T14:12:24Z) - BotSpot: Deep Learning Classification of Bot Accounts within Twitter [2.099922236065961]
The openness feature of Twitter allows programs to generate and control Twitter accounts automatically via the Twitter API.
These accounts, which are known as bots, can automatically perform actions such as tweeting, re-tweeting, following, unfollowing, or direct messaging other accounts.
We introduce a novel bot detection approach using deep learning, with the Multi-layer Perceptron Neural Networks and nine features of a bot account.
arXiv Detail & Related papers (2021-09-08T15:17:10Z) - Retweet communities reveal the main sources of hate speech [0.6999740786886536]
We deploy advanced deep learning to produce high quality hate speech classification models.
We create retweet networks, detect communities and monitor their evolution through time.
Hate speech is dominated by offensive tweets, related to political and ideological issues.
About 60% of unacceptable tweets are produced by a single right-wing community of only moderate size.
arXiv Detail & Related papers (2021-05-31T11:43:19Z) - A Dataset of State-Censored Tweets [3.0254442724635173]
We release a dataset of 583,437 tweets by 155,715 users that were censored between 2012-2020 July.
We also release 4,301 accounts that were censored in their entirety.
Our dataset will not only aid in the study of government censorship but will also aid in studying hate speech detection and the effect of censorship on social media users.
arXiv Detail & Related papers (2021-01-15T00:18:27Z) - Writer Identification Using Microblogging Texts for Social Media
Forensics [53.180678723280145]
We evaluate popular stylometric features, widely used in literary analysis, and specific Twitter features like URLs, hashtags, replies or quotes.
We test varying sized author sets and varying amounts of training/test texts per author.
arXiv Detail & Related papers (2020-07-31T00:23:18Z) - Privacy-Aware Recommender Systems Challenge on Twitter's Home Timeline [47.434392695347924]
RecSys 2020 Challenge organized by ACM RecSys in partnership with Twitter using this dataset.
This paper touches on the key challenges faced by researchers and professionals striving to predict user engagements.
arXiv Detail & Related papers (2020-04-28T23:54:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.