TweepFake: about Detecting Deepfake Tweets
- URL: http://arxiv.org/abs/2008.00036v2
- Date: Thu, 6 May 2021 16:40:26 GMT
- Title: TweepFake: about Detecting Deepfake Tweets
- Authors: Tiziano Fagni, Fabrizio Falchi, Margherita Gambini, Antonio Martella,
Maurizio Tesconi
- Abstract summary: Deep neural models can generate coherent, non-trivial and human-like text samples.
Social bots can write plausible deepfake messages, hoping to contaminate public debate.
We collect the first dataset of real deepfake tweets, TweepFake.
- Score: 3.3482093430607254
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The recent advances in language modeling significantly improved the
generative capabilities of deep neural models: in 2019 OpenAI released GPT-2, a
pre-trained language model that can autonomously generate coherent, non-trivial
and human-like text samples. Since then, ever more powerful text generative
models have been developed. Adversaries can exploit these tremendous generative
capabilities to enhance social bots that will have the ability to write
plausible deepfake messages, hoping to contaminate public debate. To prevent
this, it is crucial to develop deepfake social media messages detection
systems. However, to the best of our knowledge no one has ever addressed the
detection of machine-generated texts on social networks like Twitter or
Facebook. With the aim of helping the research in this detection field, we
collected the first dataset of \real deepfake tweets, TweepFake. It is real in
the sense that each deepfake tweet was actually posted on Twitter. We collected
tweets from a total of 23 bots, imitating 17 human accounts. The bots are based
on various generation techniques, i.e., Markov Chains, RNN, RNN+Markov, LSTM,
GPT-2. We also randomly selected tweets from the humans imitated by the bots to
have an overall balanced dataset of 25,572 tweets (half human and half bots
generated). The dataset is publicly available on Kaggle. Lastly, we evaluated
13 deepfake text detection methods (based on various state-of-the-art
approaches) to both demonstrate the challenges that Tweepfake poses and create
a solid baseline of detection techniques. We hope that TweepFake can offer the
opportunity to tackle the deepfake detection on social media messages as well.
Related papers
- My Brother Helps Me: Node Injection Based Adversarial Attack on Social Bot Detection [69.99192868521564]
Social platforms such as Twitter are under siege from a multitude of fraudulent users.
Due to the structure of social networks, the majority of methods are based on the graph neural network(GNN), which is susceptible to attacks.
We propose a node injection-based adversarial attack method designed to deceive bot detection models.
arXiv Detail & Related papers (2023-10-11T03:09:48Z) - BotArtist: Generic approach for bot detection in Twitter via semi-automatic machine learning pipeline [47.61306219245444]
Twitter has become a target for bots and fake accounts, resulting in the spread of false information and manipulation.
This paper introduces a semi-automatic machine learning pipeline (SAMLP) designed to address the challenges correlated with machine learning model development.
We develop a comprehensive bot detection model named BotArtist, based on user profile features.
arXiv Detail & Related papers (2023-05-31T09:12:35Z) - Paraphrasing evades detectors of AI-generated text, but retrieval is an
effective defense [56.077252790310176]
We present a paraphrase generation model (DIPPER) that can paraphrase paragraphs, condition on surrounding context, and control lexical diversity and content reordering.
Using DIPPER to paraphrase text generated by three large language models (including GPT3.5-davinci-003) successfully evades several detectors, including watermarking.
We introduce a simple defense that relies on retrieving semantically-similar generations and must be maintained by a language model API provider.
arXiv Detail & Related papers (2023-03-23T16:29:27Z) - Stylometric Detection of AI-Generated Text in Twitter Timelines [17.62006063931326]
Social media platforms like Twitter are highly susceptible to AI-generated misinformation.
A potential threat scenario is when an adversary hijacks a credible user account and incorporates a natural language generator to generate misinformation.
We present a novel algorithm using stylometric signals to aid detecting AI-generated tweets.
arXiv Detail & Related papers (2023-03-07T07:26:09Z) - Machine Learning-based Automatic Annotation and Detection of COVID-19
Fake News [8.020736472947581]
COVID-19 impacted every part of the world, although the misinformation about the outbreak traveled faster than the virus.
Existing work neglects the presence of bots that act as a catalyst in the spread.
We propose an automated approach for labeling data using verified fact-checked statements on a Twitter dataset.
arXiv Detail & Related papers (2022-09-07T13:55:59Z) - Partially Fake Audio Detection by Self-attention-based Fake Span
Discovery [89.21979663248007]
We propose a novel framework by introducing the question-answering (fake span discovery) strategy with the self-attention mechanism to detect partially fake audios.
Our submission ranked second in the partially fake audio detection track of ADD 2022.
arXiv Detail & Related papers (2022-02-14T13:20:55Z) - Identification of Twitter Bots based on an Explainable ML Framework: the
US 2020 Elections Case Study [72.61531092316092]
This paper focuses on the design of a novel system for identifying Twitter bots based on labeled Twitter data.
Supervised machine learning (ML) framework is adopted using an Extreme Gradient Boosting (XGBoost) algorithm.
Our study also deploys Shapley Additive Explanations (SHAP) for explaining the ML model predictions.
arXiv Detail & Related papers (2021-12-08T14:12:24Z) - BotSpot: Deep Learning Classification of Bot Accounts within Twitter [2.099922236065961]
The openness feature of Twitter allows programs to generate and control Twitter accounts automatically via the Twitter API.
These accounts, which are known as bots, can automatically perform actions such as tweeting, re-tweeting, following, unfollowing, or direct messaging other accounts.
We introduce a novel bot detection approach using deep learning, with the Multi-layer Perceptron Neural Networks and nine features of a bot account.
arXiv Detail & Related papers (2021-09-08T15:17:10Z) - BERT Transformer model for Detecting Arabic GPT2 Auto-Generated Tweets [6.18447297698017]
We propose a transfer learning based model that will be able to detect if an Arabic sentence is written by humans or automatically generated by bots.
Our new transfer-learning model has obtained an accuracy up to 98%.
To the best of our knowledge, this work is the first study where ARABERT and GPT2 were combined to detect and classify the Arabic auto-generated texts.
arXiv Detail & Related papers (2021-01-22T21:50:38Z) - WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection [82.42495493102805]
We introduce a new dataset WildDeepfake which consists of 7,314 face sequences extracted from 707 deepfake videos collected completely from the internet.
We conduct a systematic evaluation of a set of baseline detection networks on both existing and our WildDeepfake datasets, and show that WildDeepfake is indeed a more challenging dataset, where the detection performance can decrease drastically.
arXiv Detail & Related papers (2021-01-05T11:10:32Z) - Twitter Bot Detection Using Bidirectional Long Short-term Memory Neural
Networks and Word Embeddings [6.09170287691728]
This paper develops a recurrent neural model with word embeddings to distinguish Twitter bots from human accounts.
Experiments show that our approach can achieve competitive performance compared with existing state-of-the-art bot detection systems.
arXiv Detail & Related papers (2020-02-03T17:07:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.