BIC: Twitter Bot Detection with Text-Graph Interaction and Semantic
Consistency
- URL: http://arxiv.org/abs/2208.08320v1
- Date: Wed, 17 Aug 2022 14:34:40 GMT
- Title: BIC: Twitter Bot Detection with Text-Graph Interaction and Semantic
Consistency
- Authors: Zhenyu Lei, Herun Wan, Wenqian Zhang, Shangbin Feng, Zilong Chen,
Qinghua Zheng, Minnan Luo
- Abstract summary: We propose a novel model named BIC that makes the text and graph modalities deeply interactive and detects tweet semantic inconsistency.
BIC contains a semantic consistency detection module to learn semantic consistency information from tweets.
Our framework outperforms competitive baselines on a comprehensive Twitter bot benchmark.
- Score: 22.52777462831911
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Twitter bot detection is an important and meaningful task. Existing
text-based methods can deeply analyze user tweet content, achieving high
performance. However, novel Twitter bots evade these detections by stealing
genuine users' tweets and diluting malicious content with benign tweets. These
novel bots are proposed to be characterized by semantic inconsistency. In
addition, methods leveraging Twitter graph structure are recently emerging,
showing great competitiveness. However, hardly a method has made text and graph
modality deeply fused and interacted to leverage both advantages and learn the
relative importance of the two modalities. In this paper, we propose a novel
model named BIC that makes the text and graph modalities deeply interactive and
detects tweet semantic inconsistency. Specifically, BIC contains a text
propagation module, a graph propagation module to conduct bot detection
respectively on text and graph structure, and a proven effective text-graph
interactive module to make the two interact. Besides, BIC contains a semantic
consistency detection module to learn semantic consistency information from
tweets. Extensive experiments demonstrate that our framework outperforms
competitive baselines on a comprehensive Twitter bot benchmark. We also prove
the effectiveness of the proposed interaction and semantic consistency
detection.
Related papers
- Enhancing Fake News Detection in Social Media via Label Propagation on Cross-modal Tweet Graph [19.409935976725446]
We present a novel method for detecting fake news in social media.
Our method densifies the graph's connectivity to capture denser interaction better.
We use three publicly available fake news datasets, Twitter, PHEME, and Weibo, for evaluation.
arXiv Detail & Related papers (2024-06-14T09:55:54Z) - SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting [126.01629300244001]
We propose a new end-to-end scene text spotting framework termed SwinTextSpotter v2.
We enhance the relationship between two tasks using novel Recognition Conversion and Recognition Alignment modules.
SwinTextSpotter v2 achieved state-of-the-art performance on various multilingual (English, Chinese, and Vietnamese) benchmarks.
arXiv Detail & Related papers (2024-01-15T12:33:00Z) - Verifying the Robustness of Automatic Credibility Assessment [50.55687778699995]
We show that meaning-preserving changes in input text can mislead the models.
We also introduce BODEGA: a benchmark for testing both victim models and attack methods on misinformation detection tasks.
Our experimental results show that modern large language models are often more vulnerable to attacks than previous, smaller solutions.
arXiv Detail & Related papers (2023-03-14T16:11:47Z) - TwiBot-22: Towards Graph-Based Twitter Bot Detection [39.359825215347655]
TwiBot-22 is a graph-based Twitter bot detection benchmark that presents the largest dataset to date.
We re-implement 35 representative Twitter bot detection baselines and evaluate them on 9 datasets, including TwiBot-22.
To facilitate further research, we consolidate all implemented codes and datasets into the TwiBot-22 evaluation framework.
arXiv Detail & Related papers (2022-06-09T15:23:37Z) - Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal
Misinformation [83.2079454464572]
This paper describes our approach to the Image-Text Inconsistency Detection challenge of the DARPA Semantic Forensics (SemaFor) Program.
We collect Twitter-COMMs, a large-scale multimodal dataset with 884k tweets relevant to the topics of Climate Change, COVID-19, and Military Vehicles.
We train our approach, based on the state-of-the-art CLIP model, leveraging automatically generated random and hard negatives.
arXiv Detail & Related papers (2021-12-16T03:37:20Z) - Identification of Twitter Bots based on an Explainable ML Framework: the
US 2020 Elections Case Study [72.61531092316092]
This paper focuses on the design of a novel system for identifying Twitter bots based on labeled Twitter data.
Supervised machine learning (ML) framework is adopted using an Extreme Gradient Boosting (XGBoost) algorithm.
Our study also deploys Shapley Additive Explanations (SHAP) for explaining the ML model predictions.
arXiv Detail & Related papers (2021-12-08T14:12:24Z) - Exploiting Twitter as Source of Large Corpora of Weakly Similar Pairs
for Semantic Sentence Embeddings [3.8073142980733]
We propose a language-independent approach to build large datasets of pairs of informal texts weakly similar.
We exploit Twitter's intrinsic powerful signals of relatedness: replies and quotes of tweets.
Our model learns classical Semantic Textual Similarity, but also excels on tasks where pairs of sentences are not exact paraphrases.
arXiv Detail & Related papers (2021-10-05T13:21:40Z) - Exploiting BERT For Multimodal Target SentimentClassification Through
Input Space Translation [75.82110684355979]
We introduce a two-stream model that translates images in input space using an object-aware transformer.
We then leverage the translation to construct an auxiliary sentence that provides multimodal information to a language model.
We achieve state-of-the-art performance on two multimodal Twitter datasets.
arXiv Detail & Related papers (2021-08-03T18:02:38Z) - Towards Accurate Scene Text Recognition with Semantic Reasoning Networks [52.86058031919856]
We propose a novel end-to-end trainable framework named semantic reasoning network (SRN) for accurate scene text recognition.
GSRM is introduced to capture global semantic context through multi-way parallel transmission.
Results on 7 public benchmarks, including regular text, irregular text and non-Latin long text, verify the effectiveness and robustness of the proposed method.
arXiv Detail & Related papers (2020-03-27T09:19:25Z) - Twitter Bot Detection Using Bidirectional Long Short-term Memory Neural
Networks and Word Embeddings [6.09170287691728]
This paper develops a recurrent neural model with word embeddings to distinguish Twitter bots from human accounts.
Experiments show that our approach can achieve competitive performance compared with existing state-of-the-art bot detection systems.
arXiv Detail & Related papers (2020-02-03T17:07:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.