Offensive Language Detection: A Comparative Analysis
- URL: http://arxiv.org/abs/2001.03131v1
- Date: Thu, 9 Jan 2020 17:48:44 GMT
- Title: Offensive Language Detection: A Comparative Analysis
- Authors: Vyshnav M T, Sachin Kumar S, Soman K P
- Abstract summary: We explore the effectiveness of Google sentence encoder, Fasttext, Dynamic mode decomposition (DMD) based features and Random kitchen sink (RKS) method for offensive language detection.
From the experiments and evaluation we observed that RKS with fastetxt achieved competing results.
- Score: 2.5739449801033842
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Offensive behaviour has become pervasive in the Internet community.
Individuals take the advantage of anonymity in the cyber world and indulge in
offensive communications which they may not consider in the real life.
Governments, online communities, companies etc are investing into prevention of
offensive behaviour content in social media. One of the most effective solution
for tacking this enigmatic problem is the use of computational techniques to
identify offensive content and take action. The current work focuses on
detecting offensive language in English tweets. The dataset used for the
experiment is obtained from SemEval-2019 Task 6 on Identifying and Categorizing
Offensive Language in Social Media (OffensEval). The dataset contains 14,460
annotated English tweets. The present paper provides a comparative analysis and
Random kitchen sink (RKS) based approach for offensive language detection. We
explore the effectiveness of Google sentence encoder, Fasttext, Dynamic mode
decomposition (DMD) based features and Random kitchen sink (RKS) method for
offensive language detection. From the experiments and evaluation we observed
that RKS with fastetxt achieved competing results. The evaluation measures used
are accuracy, precision, recall, f1-score.
Related papers
- Breaking the Silence Detecting and Mitigating Gendered Abuse in Hindi, Tamil, and Indian English Online Spaces [0.6543929004971272]
Team CNLP-NITS-PP developed an ensemble approach combining CNN and BiLSTM networks.
CNN captures localized features indicative of abusive language through its convolution filters applied on embedded input text.
BiLSTM analyzes this sequence for dependencies among words and phrases.
validation scores showed strong performance across f1-measures, especially for English 0.84.
arXiv Detail & Related papers (2024-04-02T14:55:47Z) - OffensiveLang: A Community Based Implicit Offensive Language Dataset [5.813922783967869]
Hate speech or offensive languages exist in both explicit and implicit forms.
OffensiveLang is a community based implicit offensive language dataset.
We present a prompt-based approach that effectively generates implicit offensive languages.
arXiv Detail & Related papers (2024-03-04T20:34:58Z) - Unlikelihood Tuning on Negative Samples Amazingly Improves Zero-Shot
Translation [79.96416609433724]
Zero-shot translation (ZST) aims to translate between unseen language pairs in training data.
The common practice to guide the zero-shot language mapping during inference is to deliberately insert the source and target language IDs.
Recent studies have shown that language IDs sometimes fail to navigate the ZST task, making them suffer from the off-target problem.
arXiv Detail & Related papers (2023-09-28T17:02:36Z) - Verifying the Robustness of Automatic Credibility Assessment [50.55687778699995]
We show that meaning-preserving changes in input text can mislead the models.
We also introduce BODEGA: a benchmark for testing both victim models and attack methods on misinformation detection tasks.
Our experimental results show that modern large language models are often more vulnerable to attacks than previous, smaller solutions.
arXiv Detail & Related papers (2023-03-14T16:11:47Z) - A New Generation of Perspective API: Efficient Multilingual
Character-level Transformers [66.9176610388952]
We present the fundamentals behind the next version of the Perspective API from Google Jigsaw.
At the heart of the approach is a single multilingual token-free Charformer model.
We demonstrate that by forgoing static vocabularies, we gain flexibility across a variety of settings.
arXiv Detail & Related papers (2022-02-22T20:55:31Z) - COLD: A Benchmark for Chinese Offensive Language Detection [54.60909500459201]
We use COLDataset, a Chinese offensive language dataset with 37k annotated sentences.
We also propose textscCOLDetector to study output offensiveness of popular Chinese language models.
Our resources and analyses are intended to help detoxify the Chinese online communities and evaluate the safety performance of generative language models.
arXiv Detail & Related papers (2022-01-16T11:47:23Z) - Ruddit: Norms of Offensiveness for English Reddit Comments [35.83156813452207]
We create the first dataset of English language Reddit comments that has fine-grained, real-valued scores between -1 and 1.
We show that the method produces highly reliable offensiveness scores.
We evaluate the ability of widely-used neural models to predict offensiveness scores on this new dataset.
arXiv Detail & Related papers (2021-06-10T11:27:47Z) - TextHide: Tackling Data Privacy in Language Understanding Tasks [54.11691303032022]
TextHide mitigates privacy risks without slowing down training or reducing accuracy.
It requires all participants to add a simple encryption step to prevent an eavesdropping attacker from recovering private text data.
We evaluate TextHide on the GLUE benchmark, and our experiments show that TextHide can effectively defend attacks on shared gradients or representations.
arXiv Detail & Related papers (2020-10-12T22:22:15Z) - Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-Task Learning for
Offensive Language Detection [55.445023584632175]
We build an offensive language detection system, which combines multi-task learning with BERT-based models.
Our model achieves 91.51% F1 score in English Sub-task A, which is comparable to the first place.
arXiv Detail & Related papers (2020-04-28T11:27:24Z) - Offensive Language Identification in Greek [17.38318315623124]
This paper presents the first Greek annotated dataset for offensive language identification: the Offensive Greek Tweet dataset (OGTD)
OGTD is a manually annotated dataset containing 4,779 posts from Twitter annotated as offensive and not offensive.
Along with a detailed description of the dataset, we evaluate several computational models trained and tested on this data.
arXiv Detail & Related papers (2020-03-16T22:47:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.