Towards A Friendly Online Community: An Unsupervised Style Transfer
Framework for Profanity Redaction
- URL: http://arxiv.org/abs/2011.00403v1
- Date: Sun, 1 Nov 2020 02:10:25 GMT
- Title: Towards A Friendly Online Community: An Unsupervised Style Transfer
Framework for Profanity Redaction
- Authors: Minh Tran, Yipeng Zhang, Mohammad Soleymani
- Abstract summary: Offensive and abusive language is a pressing problem on social media platforms.
We propose a method for transforming offensive comments, statements containing profanity or offensive language, into non-offensive ones.
- Score: 17.380204095038795
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Offensive and abusive language is a pressing problem on social media
platforms. In this work, we propose a method for transforming offensive
comments, statements containing profanity or offensive language, into
non-offensive ones. We design a RETRIEVE, GENERATE and EDIT unsupervised style
transfer pipeline to redact the offensive comments in a word-restricted manner
while maintaining a high level of fluency and preserving the content of the
original text. We extensively evaluate our method's performance and compare it
to previous style transfer models using both automatic metrics and human
evaluations. Experimental results show that our method outperforms other models
on human evaluations and is the only approach that consistently performs well
on all automatic evaluation metrics.
Related papers
- Cobra Effect in Reference-Free Image Captioning Metrics [58.438648377314436]
A proliferation of reference-free methods, leveraging visual-language pre-trained models (VLMs), has emerged.
In this paper, we study if there are any deficiencies in reference-free metrics.
We employ GPT-4V as an evaluative tool to assess generated sentences and the result reveals that our approach achieves state-of-the-art (SOTA) performance.
arXiv Detail & Related papers (2024-02-18T12:36:23Z) - Prompt-Based Editing for Text Style Transfer [25.863546922455498]
We present a prompt-based editing approach for text style transfer.
We transform a prompt-based generation problem into a classification one, which is a training-free process.
Our approach largely outperforms the state-of-the-art systems that have 20 times more parameters.
arXiv Detail & Related papers (2023-01-27T21:31:14Z) - APPDIA: A Discourse-aware Transformer-based Style Transfer Model for
Offensive Social Media Conversations [11.011242089340437]
We release the first publicly-available, parallel corpus of offensive Reddit comments and their style-transferred counterparts annotated by sociolinguists.
We introduce the first discourse-aware style-transfer models that can effectively reduce offensiveness in Reddit text while preserving the meaning of the original text.
arXiv Detail & Related papers (2022-09-17T00:50:24Z) - Bridge the Gap Between CV and NLP! A Gradient-based Textual Adversarial
Attack Framework [17.17479625646699]
We propose a unified framework to craft textual adversarial samples.
In this paper, we instantiate our framework with an attack algorithm named Textual Projected Gradient Descent (T-PGD)
arXiv Detail & Related papers (2021-10-28T17:31:51Z) - Text Detoxification using Large Pre-trained Neural Models [57.72086777177844]
We present two novel unsupervised methods for eliminating toxicity in text.
First method combines guidance of the generation process with small style-conditional language models.
Second method uses BERT to replace toxic words with their non-offensive synonyms.
arXiv Detail & Related papers (2021-09-18T11:55:32Z) - Methods for Detoxification of Texts for the Russian Language [55.337471467610094]
We introduce the first study of automatic detoxification of Russian texts to combat offensive language.
We test two types of models - unsupervised approach that performs local corrections and supervised approach based on pretrained language GPT-2 model.
The results show that the tested approaches can be successfully used for detoxification, although there is room for improvement.
arXiv Detail & Related papers (2021-05-19T10:37:44Z) - TextFlint: Unified Multilingual Robustness Evaluation Toolkit for
Natural Language Processing [73.16475763422446]
We propose a multilingual robustness evaluation platform for NLP tasks (TextFlint)
It incorporates universal text transformation, task-specific transformation, adversarial attack, subpopulation, and their combinations to provide comprehensive robustness analysis.
TextFlint generates complete analytical reports as well as targeted augmented data to address the shortcomings of the model's robustness.
arXiv Detail & Related papers (2021-03-21T17:20:38Z) - On the interaction of automatic evaluation and task framing in headline
style transfer [6.27489964982972]
In this paper, we propose an evaluation method for a task involving subtle textual differences, such as style transfer.
We show that it better reflects system differences than traditional metrics such as BLEU and ROUGE.
arXiv Detail & Related papers (2021-01-05T16:36:26Z) - On Learning Text Style Transfer with Direct Rewards [101.97136885111037]
Lack of parallel corpora makes it impossible to directly train supervised models for the text style transfer task.
We leverage semantic similarity metrics originally used for fine-tuning neural machine translation models.
Our model provides significant gains in both automatic and human evaluation over strong baselines.
arXiv Detail & Related papers (2020-10-24T04:30:02Z) - Politeness Transfer: A Tag and Generate Approach [167.9924201435888]
This paper introduces a new task of politeness transfer.
It involves converting non-polite sentences to polite sentences while preserving the meaning.
We design a tag and generate pipeline that identifies stylistic attributes and subsequently generates a sentence in the target style.
arXiv Detail & Related papers (2020-04-29T15:08:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.