On mission Twitter Profiles: A Study of Selective Toxic Behavior
- URL: http://arxiv.org/abs/2401.14252v1
- Date: Thu, 25 Jan 2024 15:42:36 GMT
- Title: On mission Twitter Profiles: A Study of Selective Toxic Behavior
- Authors: Hina Qayyum, Muhammad Ikram, Benjamin Zi Hao Zhao, an D. Wood, Nicolas
Kourtellis, Mohamed Ali Kaafar
- Abstract summary: This study aims to characterize profiles potentially used for influence operations, termed 'on-mission profiles'
Longitudinal data from 138K Twitter or X, profiles and 293M tweets enables profiling based on theme diversity.
- Score: 5.0157204307764625
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The argument for persistent social media influence campaigns, often funded by
malicious entities, is gaining traction. These entities utilize instrumented
profiles to disseminate divisive content and disinformation, shaping public
perception. Despite ample evidence of these instrumented profiles, few
identification methods exist to locate them in the wild. To evade detection and
appear genuine, small clusters of instrumented profiles engage in unrelated
discussions, diverting attention from their true goals. This strategic thematic
diversity conceals their selective polarity towards certain topics and fosters
public trust.
This study aims to characterize profiles potentially used for influence
operations, termed 'on-mission profiles,' relying solely on thematic content
diversity within unlabeled data. Distinguishing this work is its focus on
content volume and toxicity towards specific themes. Longitudinal data from
138K Twitter or X, profiles and 293M tweets enables profiling based on theme
diversity. High thematic diversity groups predominantly produce toxic content
concerning specific themes, like politics, health, and news classifying them as
'on-mission' profiles.
Using the identified ``on-mission" profiles, we design a classifier for
unseen, unlabeled data. Employing a linear SVM model, we train and test it on
an 80/20% split of the most diverse profiles. The classifier achieves a
flawless 100% accuracy, facilitating the discovery of previously unknown
``on-mission" profiles in the wild.
Related papers
- On the Use of Proxies in Political Ad Targeting [49.61009579554272]
We show that major political advertisers circumvented mitigations by targeting proxy attributes.
Our findings have crucial implications for the ongoing discussion on the regulation of political advertising.
arXiv Detail & Related papers (2024-10-18T17:15:13Z) - Prompt-based Personality Profiling: Reinforcement Learning for Relevance Filtering [8.20929362102942]
Author profiling is the task of inferring characteristics about individuals by analyzing content they share.
We propose a new method for author profiling which aims at distinguishing relevant from irrelevant content first, followed by the actual user profiling only with relevant data.
We evaluate our method for Big Five personality trait prediction on two Twitter corpora.
arXiv Detail & Related papers (2024-09-06T08:43:10Z) - The Anatomy of Conspirators: Unveiling Traits using a Comprehensive
Twitter Dataset [0.0]
We present a novel methodology for constructing a Twitter dataset that encompasses accounts engaged in conspiracy-related activities throughout the year 2022.
This comprehensive collection effort yielded a total of 15K accounts and 37M tweets extracted from their timelines.
We conduct a comparative analysis of the two groups across three dimensions: topics, profiles, and behavioral characteristics.
arXiv Detail & Related papers (2023-08-29T09:35:23Z) - The Looming Threat of Fake and LLM-generated LinkedIn Profiles:
Challenges and Opportunities for Detection and Prevention [0.8808993671472349]
We present a novel method for detecting fake and Large Language Model (LLM)-generated profiles in the LinkedIn Online Social Network.
We show that the suggested method can distinguish between legitimate and fake profiles with an accuracy of about 95% across all word embeddings.
arXiv Detail & Related papers (2023-07-21T19:09:24Z) - Rumor Detection with Self-supervised Learning on Texts and Social Graph [101.94546286960642]
We propose contrastive self-supervised learning on heterogeneous information sources, so as to reveal their relations and characterize rumors better.
We term this framework as Self-supervised Rumor Detection (SRD)
Extensive experiments on three real-world datasets validate the effectiveness of SRD for automatic rumor detection on social media.
arXiv Detail & Related papers (2022-04-19T12:10:03Z) - A deep dive into the consistently toxic 1% of Twitter [9.669275987983447]
This study spans 14 years of tweets from 122K Twitter profiles and more than 293M tweets.
We selected the most extreme profiles in terms of consistency of toxic content and examined their tweet texts, and the domains, hashtags, and URLs they shared.
We found that these selected profiles keep to a narrow theme with lower diversity in hashtags, URLs, and domains, they are thematically similar to each other, and have a high likelihood of bot-like behavior.
arXiv Detail & Related papers (2022-02-16T04:21:48Z) - Decision Making with Differential Privacy under a Fairness Lens [65.16089054531395]
The U.S. Census Bureau releases data sets and statistics about groups of individuals that are used as input to a number of critical decision processes.
To conform to privacy and confidentiality requirements, these agencies are often required to release privacy-preserving versions of the data.
This paper studies the release of differentially private data sets and analyzes their impact on some critical resource allocation tasks under a fairness perspective.
arXiv Detail & Related papers (2021-05-16T21:04:19Z) - Balancing Biases and Preserving Privacy on Balanced Faces in the Wild [50.915684171879036]
There are demographic biases present in current facial recognition (FR) models.
We introduce our Balanced Faces in the Wild dataset to measure these biases across different ethnic and gender subgroups.
We find that relying on a single score threshold to differentiate between genuine and imposters sample pairs leads to suboptimal results.
We propose a novel domain adaptation learning scheme that uses facial features extracted from state-of-the-art neural networks.
arXiv Detail & Related papers (2021-03-16T15:05:49Z) - Information Consumption and Social Response in a Segregated Environment:
the Case of Gab [74.5095691235917]
This work provides a characterization of the interaction patterns within Gab around the COVID-19 topic.
We find that there are no strong statistical differences in the social response to questionable and reliable content.
Our results provide insights toward the understanding of coordinated inauthentic behavior and on the early-warning of information operation.
arXiv Detail & Related papers (2020-06-03T11:34:25Z) - Keystroke Biometrics in Response to Fake News Propagation in a Global
Pandemic [77.79066811371978]
This work proposes and analyzes the use of keystroke biometrics for content de-anonymization.
Fake news have become a powerful tool to manipulate public opinion, especially during major events.
arXiv Detail & Related papers (2020-05-15T17:56:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.