BotTriNet: A Unified and Efficient Embedding for Social Bots Detection
via Metric Learning
- URL: http://arxiv.org/abs/2304.03144v4
- Date: Sat, 6 May 2023 15:09:36 GMT
- Title: BotTriNet: A Unified and Efficient Embedding for Social Bots Detection
via Metric Learning
- Authors: Jun Wu, Xuesong Ye, and Yanyuet Man
- Abstract summary: We propose BOTTRINET, a unified embedding framework that leverages the textual content posted by accounts to detect bots.
The BOTTRINET framework produces word, sentence, and account embeddings, which we evaluate on a real-world dataset.
Our approach achieves state-of-the-art performance on two content-intensive bot sets, with an average accuracy of 98.34% and f1score of 97.99%.
- Score: 3.9026461169566673
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The rapid and accurate identification of bot accounts in online social
networks is an ongoing challenge. In this paper, we propose BOTTRINET, a
unified embedding framework that leverages the textual content posted by
accounts to detect bots. Our approach is based on the premise that account
personalities and habits can be revealed through their contextual content. To
achieve this, we designed a triplet network that refines raw embeddings using
metric learning techniques. The BOTTRINET framework produces word, sentence,
and account embeddings, which we evaluate on a real-world dataset, CRESCI2017,
consisting of three bot account categories and five bot sample sets. Our
approach achieves state-of-the-art performance on two content-intensive bot
sets, with an average accuracy of 98.34% and f1score of 97.99%. Moreover, our
method makes a significant breakthrough on four content-less bot sets, with an
average accuracy improvement of 11.52% and an average f1score increase of
16.70%. Our contribution is twofold: First, we propose a unified and effective
framework that combines various embeddings for bot detection. Second, we
demonstrate that metric learning techniques can be applied in this context to
refine raw embeddings and improve classification performance. Our approach
outperforms prior works and sets a new standard for bot detection in social
networks.
Related papers
- Unmasking Social Bots: How Confident Are We? [41.94295877935867]
We propose to address both bot detection and the quantification of uncertainty at the account level.
This dual focus is crucial as it allows us to leverage additional information related to the quantified uncertainty of each prediction.
Specifically, our approach facilitates targeted interventions for bots when predictions are made with high confidence and suggests caution (e.g., gathering more data) when predictions are uncertain.
arXiv Detail & Related papers (2024-07-18T22:33:52Z) - Bridging the Gap Between End-to-End and Two-Step Text Spotting [88.14552991115207]
Bridging Text Spotting is a novel approach that resolves the error accumulation and suboptimal performance issues in two-step methods.
We demonstrate the effectiveness of the proposed method through extensive experiments.
arXiv Detail & Related papers (2024-04-06T13:14:04Z) - BotSSCL: Social Bot Detection with Self-Supervised Contrastive Learning [6.317191658158437]
We propose a novel framework for social Bot detection with Self-Supervised Contrastive Learning (BotSSCL)
BotSSCL uses contrastive learning to distinguish between social bots and humans in the embedding space to improve linear separability.
We demonstrate BotSSCL's robustness against adversarial attempts to manipulate bot accounts to evade detection.
arXiv Detail & Related papers (2024-02-06T06:13:13Z) - What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection [48.572932773403274]
We investigate the opportunities and risks of large language models in social bot detection.
We propose a mixture-of-heterogeneous-experts framework to divide and conquer diverse user information modalities.
Experiments show that instruction tuning on 1,000 annotated examples produces specialized LLMs that outperform state-of-the-art baselines.
arXiv Detail & Related papers (2024-02-01T06:21:19Z) - My Brother Helps Me: Node Injection Based Adversarial Attack on Social Bot Detection [69.99192868521564]
Social platforms such as Twitter are under siege from a multitude of fraudulent users.
Due to the structure of social networks, the majority of methods are based on the graph neural network(GNN), which is susceptible to attacks.
We propose a node injection-based adversarial attack method designed to deceive bot detection models.
arXiv Detail & Related papers (2023-10-11T03:09:48Z) - BotArtist: Generic approach for bot detection in Twitter via semi-automatic machine learning pipeline [47.61306219245444]
Twitter has become a target for bots and fake accounts, resulting in the spread of false information and manipulation.
This paper introduces a semi-automatic machine learning pipeline (SAMLP) designed to address the challenges correlated with machine learning model development.
We develop a comprehensive bot detection model named BotArtist, based on user profile features.
arXiv Detail & Related papers (2023-05-31T09:12:35Z) - From Online Behaviours to Images: A Novel Approach to Social Bot
Detection [0.3867363075280544]
A particular type of social accounts is known to promote unreputable content, hyperpartisan, and propagandistic information.
We propose a novel approach to bot detection: we first propose a new algorithm that transforms the sequence of actions that an account performs into an image.
We compare our performances with state-of-the-art results for bot detection on genuine accounts / bot accounts datasets well known in the literature.
arXiv Detail & Related papers (2023-04-15T11:36:50Z) - BotShape: A Novel Social Bots Detection Approach via Behavioral Patterns [4.386183132284449]
Based on a real-world data set, we construct behavioral sequences from raw event logs.
We observe differences between bots and genuine users and similar patterns among bot accounts.
We present a novel social bot detection system BotShape, to automatically catch behavioral sequences and characteristics.
arXiv Detail & Related papers (2023-03-17T19:03:06Z) - Fast Uncertainty Quantification for Deep Object Pose Estimation [91.09217713805337]
Deep learning-based object pose estimators are often unreliable and overconfident.
In this work, we propose a simple, efficient, and plug-and-play UQ method for 6-DoF object pose estimation.
arXiv Detail & Related papers (2020-11-16T06:51:55Z) - Detection of Novel Social Bots by Ensembles of Specialized Classifiers [60.63582690037839]
Malicious actors create inauthentic social media accounts controlled in part by algorithms, known as social bots, to disseminate misinformation and agitate online discussion.
We show that different types of bots are characterized by different behavioral features.
We propose a new supervised learning method that trains classifiers specialized for each class of bots and combines their decisions through the maximum rule.
arXiv Detail & Related papers (2020-06-11T22:59:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.