Hope Speech Detection on Social Media Platforms
- URL: http://arxiv.org/abs/2212.07424v1
- Date: Mon, 14 Nov 2022 10:58:22 GMT
- Title: Hope Speech Detection on Social Media Platforms
- Authors: Pranjal Aggarwal, Pasupuleti Chandana, Jagrut Nemade, Shubham Sharma,
Sunil Saumya, Shankar Biradar
- Abstract summary: This paper discusses various machine learning approaches to identify a sentence as Hope Speech, Non-Hope Speech, or a Neutral sentence.
The dataset used in the study contains English YouTube comments.
- Score: 1.2561455657923906
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Since personal computers became widely available in the consumer market, the
amount of harmful content on the internet has significantly expanded. In simple
terms, harmful content is anything online which causes a person distress or
harm. It may include hate speech, violent content, threats, non-hope speech,
etc. The online content must be positive, uplifting and supportive. Over the
past few years, many studies have focused on solving this problem through hate
speech detection, but very few focused on identifying hope speech. This paper
discusses various machine learning approaches to identify a sentence as Hope
Speech, Non-Hope Speech, or a Neutral sentence. The dataset used in the study
contains English YouTube comments and is released as a part of the shared task
"EACL-2021: Hope Speech Detection for Equality, Diversity, and Inclusion".
Initially, the dataset obtained from the shared task had three classes: Hope
Speech, non-Hope speech, and not in English; however, upon deeper inspection,
we discovered that dataset relabeling is required. A group of undergraduates
was hired to help perform the entire dataset's relabeling task. We experimented
with conventional machine learning models (such as Na\"ive Bayes, logistic
regression and support vector machine) and pre-trained models (such as BERT) on
relabeled data. According to the experimental results, the relabeled data has
achieved a better accuracy for Hope speech identification than the original
data set.
Related papers
- Explain Thyself Bully: Sentiment Aided Cyberbullying Detection with
Explanation [52.3781496277104]
Cyberbullying has become a big issue with the popularity of different social media networks and online communication apps.
Recent laws like "right to explanations" of General Data Protection Regulation have spurred research in developing interpretable models.
We develop first interpretable multi-task model called em mExCB for automatic cyberbullying detection from code-mixed languages.
arXiv Detail & Related papers (2024-01-17T07:36:22Z) - PolyHope: Two-Level Hope Speech Detection from Tweets [68.8204255655161]
Despite its importance, hope has rarely been studied as a social media analysis task.
This paper presents a hope speech dataset that classifies each tweet first into "Hope" and "Not Hope"
English tweets in the first half of 2022 were collected to build this dataset.
arXiv Detail & Related papers (2022-10-25T16:34:03Z) - Assessing the impact of contextual information in hate speech detection [0.48369513656026514]
We provide a novel corpus for contextualized hate speech detection based on user responses to news posts from media outlets on Twitter.
This corpus was collected in the Rioplatense dialectal variety of Spanish and focuses on hate speech associated with the COVID-19 pandemic.
arXiv Detail & Related papers (2022-10-02T09:04:47Z) - SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data [100.46303484627045]
We propose a cross-modal Speech and Language Model (SpeechLM) to align speech and text pre-training with a pre-defined unified representation.
Specifically, we introduce two alternative discrete tokenizers to bridge the speech and text modalities.
We evaluate SpeechLM on various spoken language processing tasks including speech recognition, speech translation, and universal representation evaluation framework SUPERB.
arXiv Detail & Related papers (2022-09-30T09:12:10Z) - Emotion Based Hate Speech Detection using Multimodal Learning [0.0]
This paper proposes the first multimodal deep learning framework to combine the auditory features representing emotion and the semantic features to detect hateful content.
Our results demonstrate that incorporating emotional attributes leads to significant improvement over text-based models in detecting hateful multimedia content.
arXiv Detail & Related papers (2022-02-13T05:39:47Z) - Addressing the Challenges of Cross-Lingual Hate Speech Detection [115.1352779982269]
In this paper we focus on cross-lingual transfer learning to support hate speech detection in low-resource languages.
We leverage cross-lingual word embeddings to train our neural network systems on the source language and apply it to the target language.
We investigate the issue of label imbalance of hate speech datasets, since the high ratio of non-hate examples compared to hate examples often leads to low model performance.
arXiv Detail & Related papers (2022-01-15T20:48:14Z) - Hate Speech detection in the Bengali language: A dataset and its
baseline evaluation [0.8793721044482612]
This paper presents a new dataset of 30,000 user comments tagged by crowd sourcing and varified by experts.
All the comments are collected from YouTube and Facebook comment section and classified into seven categories.
A total of 50 annotators annotated each comment three times and the majority vote was taken as the final annotation.
arXiv Detail & Related papers (2020-12-17T15:53:54Z) - Trawling for Trolling: A Dataset [56.1778095945542]
We present a dataset that models trolling as a subcategory of offensive content.
The dataset has 12,490 samples, split across 5 classes; Normal, Profanity, Trolling, Derogatory and Hate Speech.
arXiv Detail & Related papers (2020-08-02T17:23:55Z) - "Notic My Speech" -- Blending Speech Patterns With Multimedia [65.91370924641862]
We propose a view-temporal attention mechanism to model both the view dependence and the visemic importance in speech recognition and understanding.
Our proposed method outperformed the existing work by 4.99% in terms of the viseme error rate.
We show that there is a strong correlation between our model's understanding of multi-view speech and the human perception.
arXiv Detail & Related papers (2020-06-12T06:51:55Z) - Transfer Learning for Hate Speech Detection in Social Media [14.759208309842178]
This paper uses a transfer learning technique to leverage two independent datasets jointly.
We build an interpretable two-dimensional visualization tool of the constructed hate speech representation -- dubbed the Map of Hate.
We show that the joint representation boosts prediction performances when only a limited amount of supervision is available.
arXiv Detail & Related papers (2019-06-10T08:00:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.