Hostility Detection in UK Politics: A Dataset on Online Abuse Targeting MPs
- URL: http://arxiv.org/abs/2412.04046v1
- Date: Thu, 05 Dec 2024 10:37:38 GMT
- Title: Hostility Detection in UK Politics: A Dataset on Online Abuse Targeting MPs
- Authors: Mugdha Pandya, Mali Jin, Kalina Bontcheva, Diana Maynard,
- Abstract summary: Politicians are typically targeted in relation to their governmental role, but the comments also tend to attack their personal identity.
We construct a dataset of 3,320 English tweets spanning a two-year period manually annotated for hostility towards UK MPs.
We perform linguistic and topical analyses to delve into the unique content of the UK political data.
- Score: 3.3760198210089345
- License:
- Abstract: Numerous politicians use social media platforms, particularly X, to engage with their constituents. This interaction allows constituents to pose questions and offer feedback but also exposes politicians to a barrage of hostile responses, especially given the anonymity afforded by social media. They are typically targeted in relation to their governmental role, but the comments also tend to attack their personal identity. This can discredit politicians and reduce public trust in the government. It can also incite anger and disrespect, leading to offline harm and violence. While numerous models exist for detecting hostility in general, they lack the specificity required for political contexts. Furthermore, addressing hostility towards politicians demands tailored approaches due to the distinct language and issues inherent to each country (e.g., Brexit for the UK). To bridge this gap, we construct a dataset of 3,320 English tweets spanning a two-year period manually annotated for hostility towards UK MPs. Our dataset also captures the targeted identity characteristics (race, gender, religion, none) in hostile tweets. We perform linguistic and topical analyses to delve into the unique content of the UK political data. Finally, we evaluate the performance of pre-trained language models and large language models on binary hostility detection and multi-class targeted identity type classification tasks. Our study offers valuable data and insights for future research on the prevalence and nature of politics-related hostility specific to the UK.
Related papers
- A Federated Approach to Few-Shot Hate Speech Detection for Marginalized Communities [43.37824420609252]
Hate speech online remains an understudied issue for marginalized communities.
In this paper, we aim to provide marginalized communities living in societies where the dominant language is low-resource with a privacy-preserving tool to protect themselves from hate speech on the internet.
arXiv Detail & Related papers (2024-12-06T11:00:05Z) - On the Use of Proxies in Political Ad Targeting [49.61009579554272]
We show that major political advertisers circumvented mitigations by targeting proxy attributes.
Our findings have crucial implications for the ongoing discussion on the regulation of political advertising.
arXiv Detail & Related papers (2024-10-18T17:15:13Z) - Exploring Cross-Cultural Differences in English Hate Speech Annotations: From Dataset Construction to Analysis [44.17106903728264]
Most hate speech datasets neglect the cultural diversity within a single language.
To address this, we introduce CREHate, a CRoss-cultural English Hate speech dataset.
Only 56.2% of the posts in CREHate achieve consensus among all countries, with the highest pairwise label difference rate of 26%.
arXiv Detail & Related papers (2023-08-31T13:14:47Z) - When the Majority is Wrong: Modeling Annotator Disagreement for Subjective Tasks [45.14664901245331]
A crucial problem in hate speech detection is determining whether a statement is offensive to a demographic group.
We construct a model that predicts individual annotator ratings on potentially offensive text.
We find that annotator ratings can be predicted using their demographic information and opinions on online content.
arXiv Detail & Related papers (2023-05-11T07:55:20Z) - A Spanish dataset for Targeted Sentiment Analysis of political headlines [0.0]
This work addresses the task of Targeted Sentiment Analysis for the domain of news headlines, published by the main outlets during the 2019 Argentinean Presidential Elections.
We present a polarity dataset of 1,976 headlines mentioning candidates in the 2019 elections at the target level.
Preliminary experiments with state-of-the-art classification algorithms based on pre-trained linguistic models suggest that target information is helpful for this task.
arXiv Detail & Related papers (2022-08-30T01:30:30Z) - You Don't Know My Favorite Color: Preventing Dialogue Representations
from Revealing Speakers' Private Personas [44.82330540456883]
We show that speakers' personas can be inferred through a simple neural network with high accuracy.
We conduct extensive experiments to demonstrate that our proposed defense objectives can greatly reduce the attack accuracy from 37.6% to 0.5%.
arXiv Detail & Related papers (2022-04-26T09:36:18Z) - Negativity Spreads Faster: A Large-Scale Multilingual Twitter Analysis
on the Role of Sentiment in Political Communication [7.136205674624813]
This paper attempts to analyse tweets of politicians from three European countries.
By utilising state-of-the-art pre-trained language models, we performed sentiment analysis on hundreds of thousands of tweets.
Our analysis indicates that politicians' negatively charged tweets spread more widely, especially in more recent times.
arXiv Detail & Related papers (2022-02-01T13:25:19Z) - The Spread of Propaganda by Coordinated Communities on Social Media [43.2770127582382]
We analyze the spread of propaganda and its interplay with coordinated behavior on a large Twitter dataset about the 2019 UK general election.
The combination of the use of propaganda and coordinated behavior allows us to uncover the authenticity and harmfulness of the different communities.
arXiv Detail & Related papers (2021-09-27T13:39:10Z) - Hate versus Politics: Detection of Hate against Policy makers in Italian
tweets [0.6289422225292998]
This paper addresses the issue of classification of hate speech against policy makers from Twitter in Italian.
We collected and annotated 1264 tweets, examined the cases of disagreements between annotators, and performed in-domain and cross-domain hate speech classifications.
We achieved a performance of ROC AUC 0.83 and analyzed the most predictive attributes, also finding the different language features in the anti-policymakers and anti-immigration domains.
arXiv Detail & Related papers (2021-07-12T12:24:45Z) - Towards Measuring Adversarial Twitter Interactions against Candidates in
the US Midterm Elections [25.374045377135307]
We measure the adversarial interactions against candidates for the US House of Representatives during the run-up to the 2018 US general election.
We develop a new technique for detecting tweets with toxic content that are directed at any specific candidate.
We use these techniques to outline the breadth of adversarial interactions seen in the election, including offensive name-calling, threats of violence, posting discrediting information, attacks on identity, and adversarial message repetition.
arXiv Detail & Related papers (2020-05-09T10:00:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.