Towards Detecting Inauthentic Coordination in Twitter Likes Data
- URL: http://arxiv.org/abs/2305.07384v1
- Date: Fri, 12 May 2023 11:24:26 GMT
- Title: Towards Detecting Inauthentic Coordination in Twitter Likes Data
- Authors: Laura Jahn and Rasmus K. Rendsvig
- Abstract summary: Users customarily take engagement metrics such as likes as a neutral proxy for quality and authority.
This incentivizes like manipulation to influence public opinion through *coordinated inauthentic behavior* (CIB)
CIB targeted at likes is largely unstudied as collecting suitable data about users' liking behavior is non-trivial.
This paper contributes a scripted algorithm to collect suitable liking data from Twitter and a collected 30 day dataset of liking data from the Danish political Twittersphere #dkpol.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Social media feeds typically favor posts according to user engagement. The
most ubiquitous type of engagement (and the type we study) is *likes*. Users
customarily take engagement metrics such as likes as a neutral proxy for
quality and authority. This incentivizes like manipulation to influence public
opinion through *coordinated inauthentic behavior* (CIB). CIB targeted at likes
is largely unstudied as collecting suitable data about users' liking behavior
is non-trivial. This paper contributes a scripted algorithm to collect suitable
liking data from Twitter and a collected 30 day dataset of liking data from the
Danish political Twittersphere #dkpol, over which we analyze the script's
performance. Using only the binary matrix of users and the tweets they liked,
we identify large clusters of perfectly correlated users, and discuss our
findings in relation to CIB.
Related papers
- Towards Realistic Evaluation of Commit Message Generation by Matching Online and Offline Settings [77.20838441870151]
Commit message generation is a crucial task in software engineering that is challenging to evaluate correctly.
We use an online metric - the number of edits users introduce before committing the generated messages to the VCS - to select metrics for offline experiments.
Our results indicate that edit distance exhibits the highest correlation, whereas commonly used similarity metrics such as BLEU and METEOR demonstrate low correlation.
arXiv Detail & Related papers (2024-10-15T20:32:07Z) - Measuring Strategization in Recommendation: Users Adapt Their Behavior to Shape Future Content [66.71102704873185]
We test for user strategization by conducting a lab experiment and survey.
We find strong evidence of strategization across outcome metrics, including participants' dwell time and use of "likes"
Our findings suggest that platforms cannot ignore the effect of their algorithms on user behavior.
arXiv Detail & Related papers (2024-05-09T07:36:08Z) - Verified authors shape X/Twitter discursive communities [0.24999074238880484]
We show that the core of ideological/discursive communities on X/Twitter can be effectively identified by uncovering the most informative interactions.
The analysis is performed considering three X/Twitter datasets related to the main political events of 2022 in Italy.
arXiv Detail & Related papers (2024-05-08T09:04:46Z) - Going beyond research datasets: Novel intent discovery in the industry
setting [60.90117614762879]
This paper proposes methods to improve the intent discovery pipeline deployed in a large e-commerce platform.
We show the benefit of pre-training language models on in-domain data: both self-supervised and with weak supervision.
We also devise the best method to utilize the conversational structure (i.e., question and answer) of real-life datasets during fine-tuning for clustering tasks, which we call Conv.
arXiv Detail & Related papers (2023-05-09T14:21:29Z) - Detecting Political Opinions in Tweets through Bipartite Graph Analysis:
A Skip Aggregation Graph Convolution Approach [9.350629400940493]
We focus on the 2020 US presidential election and create a large-scale dataset from Twitter.
To detect political opinions in tweets, we build a user-tweet bipartite graph based on users' posting and retweeting behaviors.
We introduce a novel skip aggregation mechanism that makes tweet nodes aggregate information from second-order neighbors.
arXiv Detail & Related papers (2023-04-22T10:38:35Z) - D-BIAS: A Causality-Based Human-in-the-Loop System for Tackling
Algorithmic Bias [57.87117733071416]
We propose D-BIAS, a visual interactive tool that embodies human-in-the-loop AI approach for auditing and mitigating social biases.
A user can detect the presence of bias against a group by identifying unfair causal relationships in the causal network.
For each interaction, say weakening/deleting a biased causal edge, the system uses a novel method to simulate a new (debiased) dataset.
arXiv Detail & Related papers (2022-08-10T03:41:48Z) - Identification of Twitter Bots based on an Explainable ML Framework: the
US 2020 Elections Case Study [72.61531092316092]
This paper focuses on the design of a novel system for identifying Twitter bots based on labeled Twitter data.
Supervised machine learning (ML) framework is adopted using an Extreme Gradient Boosting (XGBoost) algorithm.
Our study also deploys Shapley Additive Explanations (SHAP) for explaining the ML model predictions.
arXiv Detail & Related papers (2021-12-08T14:12:24Z) - Towards A Sentiment Analyzer for Low-Resource Languages [0.0]
This research aims to analyse a sentiment of the users towards a particular trending topic that has been actively and massively discussed at that time.
We use the hashtag textit#kpujangancurang that was the trending topic during the Indonesia presidential election in 2019.
This research utilizes rapid miner tool to generate the twitter data and comparing Naive Bayes, K-Nearest Neighbor, Decision Tree, and Multi-Layer Perceptron classification methods to classify the sentiment of the twitter data.
arXiv Detail & Related papers (2020-11-12T13:50:00Z) - Sentiment Analysis on Social Media Content [0.0]
The aim of this paper is to present a model that can perform sentiment analysis of real data collected from Twitter.
Data in Twitter is highly unstructured which makes it difficult to analyze.
Our proposed model is different from prior work in this field because it combined the use of supervised and unsupervised machine learning algorithms.
arXiv Detail & Related papers (2020-07-04T17:03:30Z) - Disentangled Graph Collaborative Filtering [100.26835145396782]
Disentangled Graph Collaborative Filtering (DGCF) is a new model for learning informative representations of users and items from interaction data.
By modeling a distribution over intents for each user-item interaction, we iteratively refine the intent-aware interaction graphs and representations.
DGCF achieves significant improvements over several state-of-the-art models like NGCF, DisenGCN, and MacridVAE.
arXiv Detail & Related papers (2020-07-03T15:37:25Z) - TIMME: Twitter Ideology-detection via Multi-task Multi-relational
Embedding [26.074367752142198]
We aim at solving the problem of predicting people's ideology, or political tendency.
We estimate it by using Twitter data, and formalize it as a classification problem.
arXiv Detail & Related papers (2020-06-02T00:00:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.