Related papers: Retweet-BERT: Political Leaning Detection Using Language Features and Information Diffusion on Social Networks

Retweet-BERT: Political Leaning Detection Using Language Features and Information Diffusion on Social Networks

URL: http://arxiv.org/abs/2207.08349v4
Date: Thu, 6 Apr 2023 18:48:15 GMT
Title: Retweet-BERT: Political Leaning Detection Using Language Features and Information Diffusion on Social Networks
Authors: Julie Jiang, Xiang Ren, Emilio Ferrara
Abstract summary: We introduce Retweet-BERT, a simple and scalable model to estimate the political leanings of Twitter users. Our assumptions stem from patterns of networks and linguistics homophily among people who share similar ideologies.
Score: 30.143148646797265
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Estimating the political leanings of social media users is a challenging and ever more pressing problem given the increase in social media consumption. We introduce Retweet-BERT, a simple and scalable model to estimate the political leanings of Twitter users. Retweet-BERT leverages the retweet network structure and the language used in users' profile descriptions. Our assumptions stem from patterns of networks and linguistics homophily among people who share similar ideologies. Retweet-BERT demonstrates competitive performance against other state-of-the-art baselines, achieving 96%-97% macro-F1 on two recent Twitter datasets (a COVID-19 dataset and a 2020 United States presidential elections dataset). We also perform manual validation to validate the performance of Retweet-BERT on users not in the training data. Finally, in a case study of COVID-19, we illustrate the presence of political echo chambers on Twitter and show that it exists primarily among right-leaning users. Our code is open-sourced and our data is publicly available.

Related papers

Incentivizing News Consumption on Social Media Platforms Using Large Language Models and Realistic Bot Accounts [4.06613683722116]
This project examines how to enhance users' exposure to and engagement with verified and ideologically balanced news on Twitter. We created 28 bots that replied to users tweeting about sports, entertainment, or lifestyle with a contextual reply. To test differential effects by gender of the bots, treated users were randomly assigned to receive responses by bots presented as female or male. We find that the treated users followed more news accounts and the users in the female bot treatment were more likely to like news content than the control.
arXiv Detail & Related papers (2024-03-20T07:44:06Z)
Detecting Political Opinions in Tweets through Bipartite Graph Analysis: A Skip Aggregation Graph Convolution Approach [9.350629400940493]
We focus on the 2020 US presidential election and create a large-scale dataset from Twitter. To detect political opinions in tweets, we build a user-tweet bipartite graph based on users' posting and retweeting behaviors. We introduce a novel skip aggregation mechanism that makes tweet nodes aggregate information from second-order neighbors.
arXiv Detail & Related papers (2023-04-22T10:38:35Z)
Design and analysis of tweet-based election models for the 2021 Mexican legislative election [55.41644538483948]
We use a dataset of 15 million election-related tweets in the six months preceding election day. We find that models using data with geographical attributes determine the results of the election with better precision and accuracy than conventional polling methods.
arXiv Detail & Related papers (2023-01-02T12:40:05Z)
Tweets2Stance: Users stance detection exploiting Zero-Shot Learning Algorithms on Tweets [0.06372261626436675]
The aim of the study is to predict the stance of a Party p in regard to each statement s exploiting what the Twitter Party account wrote on Twitter. Results obtained from multiple experiments show that Tweets2Stance can correctly predict the stance with a general minimum MAE of 1.13, which is a great achievement considering the task complexity.
arXiv Detail & Related papers (2022-04-22T14:00:11Z)
Political Communities on Twitter: Case Study of the 2022 French Presidential Election [14.783829037950984]
We aim to identify political communities formed on Twitter during the 2022 French presidential election. We create a large-scale Twitter dataset containing 1.2 million users and 62.6 million tweets that mention keywords relevant to the election. We perform community detection on a retweet graph of users and propose an in-depth analysis of the stance of each community.
arXiv Detail & Related papers (2022-04-15T12:18:16Z)
Identification of Twitter Bots based on an Explainable ML Framework: the US 2020 Elections Case Study [72.61531092316092]
This paper focuses on the design of a novel system for identifying Twitter bots based on labeled Twitter data. Supervised machine learning (ML) framework is adopted using an Extreme Gradient Boosting (XGBoost) algorithm. Our study also deploys Shapley Additive Explanations (SHAP) for explaining the ML model predictions.
arXiv Detail & Related papers (2021-12-08T14:12:24Z)
News consumption and social media regulations policy [70.31753171707005]
We analyze two social media that enforced opposite moderation methods, Twitter and Gab, to assess the interplay between news consumption and content regulation. Our results show that the presence of moderation pursued by Twitter produces a significant reduction of questionable content. The lack of clear regulation on Gab results in the tendency of the user to engage with both types of content, showing a slight preference for the questionable ones which may account for a dissing/endorsement behavior.
arXiv Detail & Related papers (2021-06-07T19:26:32Z)
Understanding Information Spreading Mechanisms During COVID-19 Pandemic by Analyzing the Impact of Tweet Text and User Features for Retweet Prediction [6.658785818853953]
COVID-19 has affected the world economy and the daily life routine of almost everyone. Social media platforms enable users to share information with other users who can reshare this information. We propose two CNN and RNN based models and evaluate the performance of these models on a publicly available TweetsCOV19 dataset.
arXiv Detail & Related papers (2021-05-26T15:55:58Z)
Privacy-Aware Recommender Systems Challenge on Twitter's Home Timeline [47.434392695347924]
RecSys 2020 Challenge organized by ACM RecSys in partnership with Twitter using this dataset. This paper touches on the key challenges faced by researchers and professionals striving to predict user engagements.
arXiv Detail & Related papers (2020-04-28T23:54:33Z)
Echo Chambers on Social Media: A comparative analysis [64.2256216637683]
We introduce an operational definition of echo chambers and perform a massive comparative analysis on 1B pieces of contents produced by 1M users on four social media platforms. We infer the leaning of users about controversial topics and reconstruct their interaction networks by analyzing different features. We find support for the hypothesis that platforms implementing news feed algorithms like Facebook may elicit the emergence of echo-chambers.
arXiv Detail & Related papers (2020-04-20T20:00:27Z)

This list is automatically generated from the titles and abstracts of the papers in this site.