TrollHunter [Evader]: Automated Detection [Evasion] of Twitter Trolls
During the COVID-19 Pandemic
- URL: http://arxiv.org/abs/2012.02586v2
- Date: Mon, 7 Dec 2020 03:00:54 GMT
- Title: TrollHunter [Evader]: Automated Detection [Evasion] of Twitter Trolls
During the COVID-19 Pandemic
- Authors: Peter Jachim and Filipo Sharevski and Paige Treebridge
- Abstract summary: TrollHunter is an automated reasoning mechanism used to hunt for trolls on Twitter during the COVID-19 pandemic in 2020.
To counter the COVID-19 infodemic, the TrollHunter leverages a unique linguistic analysis of a multi-dimensional set of Twitter content features.
TrollHunter achieved 98.5% accuracy, 75.4% precision and 69.8% recall over a dataset of 1.3 million tweets.
- Score: 1.5469452301122175
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents TrollHunter, an automated reasoning mechanism we used to
hunt for trolls on Twitter during the COVID-19 pandemic in 2020. Trolls, poised
to disrupt the online discourse and spread disinformation, quickly seized the
absence of a credible response to COVID-19 and created a COVID-19 infodemic by
promulgating dubious content on Twitter. To counter the COVID-19 infodemic, the
TrollHunter leverages a unique linguistic analysis of a multi-dimensional set
of Twitter content features to detect whether or not a tweet was meant to
troll. TrollHunter achieved 98.5% accuracy, 75.4% precision and 69.8% recall
over a dataset of 1.3 million tweets. Without a final resolution of the
pandemic in sight, it is unlikely that the trolls will go away, although they
might be forced to evade automated hunting. To explore the plausibility of this
strategy, we developed and tested an adversarial machine learning mechanism
called TrollHunter-Evader. TrollHunter-Evader employs a Test Time Evasion (TTE)
approach in a combination with a Markov chain-based mechanism to recycle
originally trolling tweets. The recycled tweets were able to achieve a
remarkable 40% decrease in the TrollHunter's ability to correctly identify
trolling tweets. Because the COVID-19 infodemic could have a harmful impact on
the COVID-19 pandemic, we provide an elaborate discussion about the
implications of employing adversarial machine learning to evade Twitter troll
hunts.
Related papers
- Fact-Saboteurs: A Taxonomy of Evidence Manipulation Attacks against
Fact-Verification Systems [80.3811072650087]
We show that it is possible to subtly modify claim-salient snippets in the evidence and generate diverse and claim-aligned evidence.
The attacks are also robust against post-hoc modifications of the claim.
These attacks can have harmful implications on the inspectable and human-in-the-loop usage scenarios.
arXiv Detail & Related papers (2022-09-07T13:39:24Z) - Overview of Abusive and Threatening Language Detection in Urdu at FIRE
2021 [50.591267188664666]
We present two shared tasks of abusive and threatening language detection for the Urdu language.
We present two manually annotated datasets containing tweets labelled as (i) Abusive and Non-Abusive, and (ii) Threatening and Non-Threatening.
For both subtasks, m-Bert based transformer model showed the best performance.
arXiv Detail & Related papers (2022-07-14T07:38:13Z) - TROLLMAGNIFIER: Detecting State-Sponsored Troll Accounts on Reddit [11.319938541673578]
We present TROLLMAGNIFIER, a detection system for troll accounts.
TROLLMAGNIFIER learns the typical behavior of known troll accounts and identifies more that behave similarly.
We show that using TROLLMAGNIFIER, one can grow the initial knowledge of potential trolls by over 300%.
arXiv Detail & Related papers (2021-12-01T12:10:24Z) - Exposing Paid Opinion Manipulation Trolls [19.834000431578737]
We show how to find paid trolls on the Web using machine learning.
In this paper, we assume that a user who is called a troll by several different people is likely to be such.
We compare the profiles of paid trolls vs. (ii)"mentioned" trolls vs. (iii) non-trolls, and we further show that a classifier trained to distinguish (ii) from (iii) does quite well also at telling apart (i) from (iii)
arXiv Detail & Related papers (2021-09-26T11:40:14Z) - TrollHunter2020: Real-Time Detection of Trolling Narratives on Twitter
During the 2020 US Elections [1.5469452301122175]
TrollHunter2020 is a real-time detection mechanism used to hunt for trolling narratives on Twitter during the 2020 U.S. elections.
Our results suggest that the TrollHunter 2020 indeed captures the emerging trolling narratives in a very early stage of an unfolding polarizing event.
arXiv Detail & Related papers (2020-12-04T14:03:06Z) - "Nice Try, Kiddo": Investigating Ad Hominems in Dialogue Responses [87.89632038677912]
Ad hominem attacks are those that target some feature of a person's character instead of the position the person is maintaining.
We propose categories of ad hominems, compose an annotated dataset, and build a system to analyze human and dialogue responses to English Twitter posts.
Our results indicate that 1) responses from both humans and DialoGPT contain more ad hominems for discussions around marginalized communities, 2) different quantities of ad hominems in the training data can influence the likelihood of generating ad hominems, and 3) we can constrained decoding techniques to reduce ad hominems
arXiv Detail & Related papers (2020-10-24T07:37:49Z) - Understanding the Hoarding Behaviors during the COVID-19 Pandemic using
Large Scale Social Media Data [77.34726150561087]
We analyze the hoarding and anti-hoarding patterns of over 42,000 unique Twitter users in the United States from March 1 to April 30, 2020.
We find the percentage of females in both hoarding and anti-hoarding groups is higher than that of the general Twitter user population.
The LIWC anxiety mean for the hoarding-related tweets is significantly higher than the baseline Twitter anxiety mean.
arXiv Detail & Related papers (2020-10-15T16:02:25Z) - Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media
during the COVID-19 Crisis [51.39895377836919]
COVID-19 has sparked racism and hate on social media targeted towards Asian communities.
We study the evolution and spread of anti-Asian hate speech through the lens of Twitter.
We create COVID-HATE, the largest dataset of anti-Asian hate and counterspeech spanning 14 months.
arXiv Detail & Related papers (2020-05-25T21:58:09Z) - Russian trolls speaking Russian: Regional Twitter operations and MH17 [68.8204255655161]
In 2018, Twitter released data on accounts identified as Russian trolls.
We analyze the Russian-language operations of these trolls.
We find that trolls' information campaign on the MH17 crash was the largest in terms of tweet count.
arXiv Detail & Related papers (2020-05-13T19:48:12Z) - Detecting Troll Behavior via Inverse Reinforcement Learning: A Case
Study of Russian Trolls in the 2016 US Election [8.332032237125897]
We propose an approach based on Inverse Reinforcement Learning (IRL) to capture troll behavior and identify troll accounts.
As a study case, we consider the troll accounts identified by the US Congress during the investigation of Russian meddling in the 2016 US Presidential election.
We report promising results: the IRL-based approach is able to accurately detect troll accounts.
arXiv Detail & Related papers (2020-01-28T19:50:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.