A systematic review of Hate Speech automatic detection using Natural
Language Processing
- URL: http://arxiv.org/abs/2106.00742v1
- Date: Sat, 22 May 2021 21:48:14 GMT
- Title: A systematic review of Hate Speech automatic detection using Natural
Language Processing
- Authors: Md Saroar Jahan, Mourad Oussalah
- Abstract summary: This paper provides a systematic review of literature in this field, with a focus on natural language processing and deep learning technologies.
Existing surveys, limitations, and future research directions are extensively discussed.
- Score: 0.45687771576879593
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: With the multiplication of social media platforms, which offer anonymity,
easy access and online community formation, and online debate, the issue of
hate speech detection and tracking becomes a growing challenge to society,
individual, policy-makers and researchers. Despite efforts for leveraging
automatic techniques for automatic detection and monitoring, their performances
are still far from satisfactory, which constantly calls for future research on
the issue. This paper provides a systematic review of literature in this field,
with a focus on natural language processing and deep learning technologies,
highlighting the terminology, processing pipeline, core methods employed, with
a focal point on deep learning architecture. From a methodological perspective,
we adopt PRISMA guideline of systematic review of the last 10 years literature
from ACM Digital Library and Google Scholar. In the sequel, existing surveys,
limitations, and future research directions are extensively discussed.
Related papers
- A Survey of Stance Detection on Social Media: New Directions and Perspectives [50.27382951812502]
stance detection has emerged as a crucial subfield within affective computing.
Recent years have seen a surge of research interest in developing effective stance detection methods.
This paper provides a comprehensive survey of stance detection techniques on social media.
arXiv Detail & Related papers (2024-09-24T03:06:25Z) - Comprehensive Study on Sentiment Analysis: From Rule-based to modern LLM based system [0.0]
This study examines the historical development of sentiment analysis, highlighting the transition from lexicon-based and pattern-based approaches to more sophisticated machine learning and deep learning models.
The paper reviews state-of-the-art approaches, identifies emerging trends, and outlines future research directions to advance the field.
arXiv Detail & Related papers (2024-09-16T04:44:52Z) - Ontology Embedding: A Survey of Methods, Applications and Resources [54.3453925775069]
Ontologies are widely used for representing domain knowledge and meta data.
One straightforward solution is to integrate statistical analysis and machine learning.
Numerous papers have been published on embedding, but a lack of systematic reviews hinders researchers from gaining a comprehensive understanding of this field.
arXiv Detail & Related papers (2024-06-16T14:49:19Z) - Navigating the Landscape of Hint Generation Research: From the Past to the Future [34.47999708205151]
We present a review of prior research on hint generation, aiming to bridge the gap between research in education and cognitive science.
We propose a formal definition of the hint generation task, and discuss the roadmap of building an effective hint generation system.
arXiv Detail & Related papers (2024-04-06T20:42:46Z) - Combatting Human Trafficking in the Cyberspace: A Natural Language
Processing-Based Methodology to Analyze the Language in Online Advertisements [55.2480439325792]
This project tackles the pressing issue of human trafficking in online C2C marketplaces through advanced Natural Language Processing (NLP) techniques.
We introduce a novel methodology for generating pseudo-labeled datasets with minimal supervision, serving as a rich resource for training state-of-the-art NLP models.
A key contribution is the implementation of an interpretability framework using Integrated Gradients, providing explainable insights crucial for law enforcement.
arXiv Detail & Related papers (2023-11-22T02:45:01Z) - Deep Learning for Visual Speech Analysis: A Survey [54.53032361204449]
This paper presents a review of recent progress in deep learning methods on visual speech analysis.
We cover different aspects of visual speech, including fundamental problems, challenges, benchmark datasets, a taxonomy of existing methods, and state-of-the-art performance.
arXiv Detail & Related papers (2022-05-22T14:44:53Z) - Threat of Adversarial Attacks on Deep Learning in Computer Vision:
Survey II [86.51135909513047]
Deep Learning is vulnerable to adversarial attacks that can manipulate its predictions.
This article reviews the contributions made by the computer vision community in adversarial attacks on deep learning.
It provides definitions of technical terminologies for non-experts in this domain.
arXiv Detail & Related papers (2021-08-01T08:54:47Z) - Software-Based Dialogue Systems: Survey, Taxonomy and Challenges [4.2763155274587366]
This paper reports a survey of the current state of research of conversational agents through a systematic literature review of secondary studies.
As a result, this research proposes a holistic taxonomy of the different dimensions involved in the conversational agents' field.
arXiv Detail & Related papers (2021-06-21T07:41:44Z) - Fairness in Machine Learning: A Survey [0.0]
There is significant literature on approaches to mitigate bias and promote fairness.
This article seeks to provide an overview of the different schools of thought and approaches to mitigating (social) biases and increase fairness in the Machine Learning literature.
It organises approaches into the widely accepted framework of pre-processing, in-processing, and post-processing methods, subcategorizing into a further 11 method areas.
arXiv Detail & Related papers (2020-10-04T21:01:34Z) - On the Social and Technical Challenges of Web Search Autosuggestion
Moderation [118.47867428272878]
Autosuggestions are typically generated by machine learning (ML) systems trained on a corpus of search logs and document representations.
While current search engines have become increasingly proficient at suppressing such problematic suggestions, there are still persistent issues that remain.
We discuss several dimensions of problematic suggestions, difficult issues along the pipeline, and why our discussion applies to the increasing number of applications beyond web search.
arXiv Detail & Related papers (2020-07-09T19:22:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.