LGB: Language Model and Graph Neural Network-Driven Social Bot Detection
- URL: http://arxiv.org/abs/2406.08762v2
- Date: Fri, 14 Jun 2024 00:31:25 GMT
- Title: LGB: Language Model and Graph Neural Network-Driven Social Bot Detection
- Authors: Ming Zhou, Dan Zhang, Yuandong Wang, Yangli-ao Geng, Yuxiao Dong, Jie Tang,
- Abstract summary: Malicious social bots achieve their malicious purposes by spreading misinformation and inciting social public opinion.
We propose a novel social bot detection framework LGB, which consists of two main components: language model (LM) and graph neural network (GNN)
Experiments on two real-world datasets demonstrate that LGB consistently outperforms state-of-the-art baseline models by up to 10.95%.
- Score: 43.92522451274129
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Malicious social bots achieve their malicious purposes by spreading misinformation and inciting social public opinion, seriously endangering social security, making their detection a critical concern. Recently, graph-based bot detection methods have achieved state-of-the-art (SOTA) performance. However, our research finds many isolated and poorly linked nodes in social networks, as shown in Fig.1, which graph-based methods cannot effectively detect. To address this problem, our research focuses on effectively utilizing node semantics and network structure to jointly detect sparsely linked nodes. Given the excellent performance of language models (LMs) in natural language understanding (NLU), we propose a novel social bot detection framework LGB, which consists of two main components: language model (LM) and graph neural network (GNN). Specifically, the social account information is first extracted into unified user textual sequences, which is then used to perform supervised fine-tuning (SFT) of the language model to improve its ability to understand social account semantics. Next, the semantically enriched node representation is fed into the pre-trained GNN to further enhance the node representation by aggregating information from neighbors. Finally, LGB fuses the information from both modalities to improve the detection performance of sparsely linked nodes. Extensive experiments on two real-world datasets demonstrate that LGB consistently outperforms state-of-the-art baseline models by up to 10.95%. LGB is already online: https://botdetection.aminer.cn/robotmain.
Related papers
- SeBot: Structural Entropy Guided Multi-View Contrastive Learning for Social Bot Detection [34.68635583099056]
We propose SEBot, a novel multi-view graph-based contrastive learning-enabled social bot detector.
In particular, we use structural entropy as an uncertainty metric to optimize the entire graph's structure.
And we design an encoder to enable message passing beyond the homophily assumption.
arXiv Detail & Related papers (2024-05-18T08:16:11Z) - Multitask Active Learning for Graph Anomaly Detection [48.690169078479116]
We propose a novel MultItask acTIve Graph Anomaly deTEction framework, namely MITIGATE.
By coupling node classification tasks, MITIGATE obtains the capability to detect out-of-distribution nodes without known anomalies.
Empirical studies on four datasets demonstrate that MITIGATE significantly outperforms the state-of-the-art methods for anomaly detection.
arXiv Detail & Related papers (2024-01-24T03:43:45Z) - Hierarchical and Incremental Structural Entropy Minimization for
Unsupervised Social Event Detection [61.87480191351659]
Graph neural network (GNN)-based methods enable a fusion of natural language semantics and the complex social network structural information.
In this work, we address social event detection via graph structural entropy (SE) minimization.
While keeping the merits of the GNN-based methods, the proposed framework, HISEvent, constructs more informative message graphs.
arXiv Detail & Related papers (2023-12-19T06:28:32Z) - Muti-scale Graph Neural Network with Signed-attention for Social Bot
Detection: A Frequency Perspective [10.089319405788277]
The presence of a large number of bots on social media has adverse effects.
The graph neural network (GNN) can effectively leverage the social relationships between users and achieve excellent results in detecting bots.
This paper proposes a Multi-scale with Signed-attention Graph Filter for social bot detection called MSGS.
arXiv Detail & Related papers (2023-07-05T00:40:19Z) - Over-Sampling Strategy in Feature Space for Graphs based
Class-imbalanced Bot Detection [10.882979272768502]
A large number of bots in Online Social Networks (OSN) leads to undesirable social effects.
We propose an over-sampling strategy for GNNs that generates samples for the minority class without edge synthesis.
The framework is evaluated using three real-world bot detection benchmark datasets.
arXiv Detail & Related papers (2023-02-14T08:35:33Z) - Evidential Temporal-aware Graph-based Social Event Detection via
Dempster-Shafer Theory [76.4580340399321]
We propose ETGNN, a novel Evidential Temporal-aware Graph Neural Network.
We construct view-specific graphs whose nodes are the texts and edges are determined by several types of shared elements respectively.
Considering the view-specific uncertainty, the representations of all views are converted into mass functions through evidential deep learning (EDL) neural networks.
arXiv Detail & Related papers (2022-05-24T16:22:40Z) - Higher-Order Attribute-Enhancing Heterogeneous Graph Neural Networks [67.25782890241496]
We propose a higher-order Attribute-Enhancing Graph Neural Network (HAEGNN) for heterogeneous network representation learning.
HAEGNN simultaneously incorporates meta-paths and meta-graphs for rich, heterogeneous semantics.
It shows superior performance against the state-of-the-art methods in node classification, node clustering, and visualization.
arXiv Detail & Related papers (2021-04-16T04:56:38Z) - Node2Seq: Towards Trainable Convolutions in Graph Neural Networks [59.378148590027735]
We propose a graph network layer, known as Node2Seq, to learn node embeddings with explicitly trainable weights for different neighboring nodes.
For a target node, our method sorts its neighboring nodes via attention mechanism and then employs 1D convolutional neural networks (CNNs) to enable explicit weights for information aggregation.
In addition, we propose to incorporate non-local information for feature learning in an adaptive manner based on the attention scores.
arXiv Detail & Related papers (2021-01-06T03:05:37Z) - Graph Neighborhood Attentive Pooling [0.5493410630077189]
Network representation learning (NRL) is a powerful technique for learning low-dimensional vector representation of high-dimensional and sparse graphs.
We propose a novel context-sensitive algorithm called GAP that learns to attend on different parts of a node's neighborhood using attentive pooling networks.
arXiv Detail & Related papers (2020-01-28T15:05:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.