Tackling Racial Bias in Automated Online Hate Detection: Towards Fair
and Accurate Classification of Hateful Online Users Using Geometric Deep
Learning
- URL: http://arxiv.org/abs/2103.11806v1
- Date: Mon, 22 Mar 2021 13:08:11 GMT
- Title: Tackling Racial Bias in Automated Online Hate Detection: Towards Fair
and Accurate Classification of Hateful Online Users Using Geometric Deep
Learning
- Authors: Zo Ahmed, Bertie Vidgen, and Scott A. Hale
- Abstract summary: This paper examines how fairer and more accurate hateful user detection systems can be developed by incorporating social network information.
Geometric deep learning dynamically learns information-rich network representations and can generalise to unseen nodes.
It produces the most accurate and fairest classifier, with an AUC score of 90.8% on the entire dataset.
- Score: 2.385774752937891
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Online hate is a growing concern on many social media platforms and other
sites. To combat it, technology companies are increasingly identifying and
sanctioning `hateful users' rather than simply moderating hateful content. Yet,
most research in online hate detection to date has focused on hateful content.
This paper examines how fairer and more accurate hateful user detection systems
can be developed by incorporating social network information through geometric
deep learning. Geometric deep learning dynamically learns information-rich
network representations and can generalise to unseen nodes. This is essential
for moving beyond manually engineered network features, which lack scalability
and produce information-sparse network representations. This paper compares the
accuracy of geometric deep learning with other techniques which either exclude
network information or incorporate it through manual feature engineering (e.g.,
node2vec). It also evaluates the fairness of these techniques using the
`predictive equality' criteria, comparing the false positive rates on a subset
of 136 African-American users with 4836 other users. Geometric deep learning
produces the most accurate and fairest classifier, with an AUC score of 90.8\%
on the entire dataset and a false positive rate of zero among the
African-American subset for the best performing model. This highlights the
benefits of more effectively incorporating social network features in automated
hateful user detection. Such an approach is also easily operationalized for
real-world content moderation as it has an efficient and scalable design.
Related papers
- AggregHate: An Efficient Aggregative Approach for the Detection of Hatemongers on Social Platforms [4.649475179575046]
We consider a multimodal aggregative approach for the detection of hate-mongers, taking into account the potentially hateful texts, user activity, and the user network.
Our method can be used to improve the classification of coded messages, dog-whistling, and racial gas-lighting, as well as inform intervention measures.
arXiv Detail & Related papers (2024-09-22T14:29:49Z) - Unsupervised Learning via Network-Aware Embeddings [0.0]
We show how to create network-aware embeddings by estimating the network distance between numeric node attributes.
Our method is fully open source and data and code are available to reproduce all results in the paper.
arXiv Detail & Related papers (2023-09-19T08:17:48Z) - Improving Deep Representation Learning via Auxiliary Learnable Target Coding [69.79343510578877]
This paper introduces a novel learnable target coding as an auxiliary regularization of deep representation learning.
Specifically, a margin-based triplet loss and a correlation consistency loss on the proposed target codes are designed to encourage more discriminative representations.
arXiv Detail & Related papers (2023-05-30T01:38:54Z) - Harnessing the Power of Text-image Contrastive Models for Automatic
Detection of Online Misinformation [50.46219766161111]
We develop a self-learning model to explore the constrastive learning in the domain of misinformation identification.
Our model shows the superior performance of non-matched image-text pair detection when the training data is insufficient.
arXiv Detail & Related papers (2023-04-19T02:53:59Z) - D-BIAS: A Causality-Based Human-in-the-Loop System for Tackling
Algorithmic Bias [57.87117733071416]
We propose D-BIAS, a visual interactive tool that embodies human-in-the-loop AI approach for auditing and mitigating social biases.
A user can detect the presence of bias against a group by identifying unfair causal relationships in the causal network.
For each interaction, say weakening/deleting a biased causal edge, the system uses a novel method to simulate a new (debiased) dataset.
arXiv Detail & Related papers (2022-08-10T03:41:48Z) - Active Learning of Ordinal Embeddings: A User Study on Football Data [4.856635699699126]
Humans innately measure distance between instances in an unlabeled dataset using an unknown similarity function.
This work uses deep metric learning to learn these user-defined similarity functions from few annotations for a large football trajectory dataset.
arXiv Detail & Related papers (2022-07-26T07:55:23Z) - Detecting Offensive Language on Social Networks: An End-to-end Detection
Method based on Graph Attention Networks [7.723697303436006]
We propose an end-to-end method based on community structure and text features for offensive language detection (CT-OLD)
We add user opinion to the community structure for representing user features. The user opinion is represented by user historical behavior information, which outperforms that represented by text information.
arXiv Detail & Related papers (2022-03-04T03:57:18Z) - A Comprehensive Survey on Community Detection with Deep Learning [93.40332347374712]
A community reveals the features and connections of its members that are different from those in other communities in a network.
This survey devises and proposes a new taxonomy covering different categories of the state-of-the-art methods.
The main category, i.e., deep neural networks, is further divided into convolutional networks, graph attention networks, generative adversarial networks and autoencoders.
arXiv Detail & Related papers (2021-05-26T14:37:07Z) - Leveraging Multi-domain, Heterogeneous Data using Deep Multitask
Learning for Hate Speech Detection [21.410160004193916]
We propose a Convolution Neural Network based multi-task learning models (MTLs)footnotecode to leverage information from multiple sources.
Empirical analysis performed on three benchmark datasets shows the efficacy of the proposed approach.
arXiv Detail & Related papers (2021-03-23T09:31:01Z) - Anomaly Detection on Attributed Networks via Contrastive Self-Supervised
Learning [50.24174211654775]
We present a novel contrastive self-supervised learning framework for anomaly detection on attributed networks.
Our framework fully exploits the local information from network data by sampling a novel type of contrastive instance pair.
A graph neural network-based contrastive learning model is proposed to learn informative embedding from high-dimensional attributes and local structure.
arXiv Detail & Related papers (2021-02-27T03:17:20Z) - Meta-Gradient Reinforcement Learning with an Objective Discovered Online [54.15180335046361]
We propose an algorithm based on meta-gradient descent that discovers its own objective, flexibly parameterised by a deep neural network.
Because the objective is discovered online, it can adapt to changes over time.
On the Atari Learning Environment, the meta-gradient algorithm adapts over time to learn with greater efficiency.
arXiv Detail & Related papers (2020-07-16T16:17:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.