CS-UM6P at SemEval-2022 Task 6: Transformer-based Models for Intended
Sarcasm Detection in English and Arabic
- URL: http://arxiv.org/abs/2206.08415v1
- Date: Thu, 16 Jun 2022 19:14:54 GMT
- Title: CS-UM6P at SemEval-2022 Task 6: Transformer-based Models for Intended
Sarcasm Detection in English and Arabic
- Authors: Abdelkader El Mahdaouy, Abdellah El Mekki, Kabil Essefar, Abderrahman
Skiredj, Ismail Berrada
- Abstract summary: Sarcasm is a form of figurative language where the intended meaning of a sentence differs from its literal meaning.
In this paper, we present our participating system to the intended sarcasm detection task in English and Arabic languages.
- Score: 6.221019624345408
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Sarcasm is a form of figurative language where the intended meaning of a
sentence differs from its literal meaning. This poses a serious challenge to
several Natural Language Processing (NLP) applications such as Sentiment
Analysis, Opinion Mining, and Author Profiling. In this paper, we present our
participating system to the intended sarcasm detection task in English and
Arabic languages. Our system\footnote{The source code of our system is
available at \url{https://github.com/AbdelkaderMH/iSarcasmEval}} consists of
three deep learning-based models leveraging two existing pre-trained language
models for Arabic and English. We have participated in all sub-tasks. Our
official submissions achieve the best performance on sub-task A for Arabic
language and rank second in sub-task B. For sub-task C, our system is ranked
7th and 11th on Arabic and English datasets, respectively.
Related papers
- SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection [68.858931667807]
Subtask A is a binary classification task determining whether a text is written by a human or generated by a machine.
Subtask B is to detect the exact source of a text, discerning whether it is written by a human or generated by a specific LLM.
Subtask C aims to identify the changing point within a text, at which the authorship transitions from human to machine.
arXiv Detail & Related papers (2024-04-22T13:56:07Z) - Cross-Lingual NER for Financial Transaction Data in Low-Resource
Languages [70.25418443146435]
We propose an efficient modeling framework for cross-lingual named entity recognition in semi-structured text data.
We employ two independent datasets of SMSs in English and Arabic, each carrying semi-structured banking transaction information.
With access to only 30 labeled samples, our model can generalize the recognition of merchants, amounts, and other fields from English to Arabic.
arXiv Detail & Related papers (2023-07-16T00:45:42Z) - Evaluating and Modeling Attribution for Cross-Lingual Question Answering [80.4807682093432]
This work is the first to study attribution for cross-lingual question answering.
We collect data in 5 languages to assess the attribution level of a state-of-the-art cross-lingual QA system.
We find that a substantial portion of the answers is not attributable to any retrieved passages.
arXiv Detail & Related papers (2023-05-23T17:57:46Z) - X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and
Arabic Sarcasm Detection [12.241656852060606]
This paper describes the X-PuDu system that participated in SemEval-2022 Task 6, iSarcasmEval - Intended Sarcasm Detection in English and Arabic.
Our solution finetunes pre-trained language models, such as ERNIE-M and DeBERTa, under the multilingual settings to recognize the irony from Arabic and English texts.
arXiv Detail & Related papers (2022-11-30T10:34:08Z) - BFCAI at SemEval-2022 Task 6: Multi-Layer Perceptron for Sarcasm
Detection in Arabic Texts [0.0]
This paper describes the systems submitted to iSarcasm shared task.
The aim of iSarcasm is to identify the sarcastic contents in Arabic and English text.
A multi-Layer machine learning based model has been submitted for Arabic sarcasm detection.
arXiv Detail & Related papers (2022-05-18T11:33:07Z) - Plumeria at SemEval-2022 Task 6: Robust Approaches for Sarcasm Detection
for English and Arabic Using Transformers and Data Augmentation [0.951828574518325]
This paper describes our submission to SemEval-2022 Task 6 on sarcasm detection and its five subtasks for English and Arabic.
For detecting sarcasm, we used deep learning techniques based on transformers due to its success in the field of Natural Language Processing (NLP) without the need for feature engineering.
arXiv Detail & Related papers (2022-03-08T14:33:45Z) - Intent Classification Using Pre-Trained Embeddings For Low Resource
Languages [67.40810139354028]
Building Spoken Language Understanding systems that do not rely on language specific Automatic Speech Recognition is an important yet less explored problem in language processing.
We present a comparative study aimed at employing a pre-trained acoustic model to perform Spoken Language Understanding in low resource scenarios.
We perform experiments across three different languages: English, Sinhala, and Tamil each with different data sizes to simulate high, medium, and low resource scenarios.
arXiv Detail & Related papers (2021-10-18T13:06:59Z) - Deep Multi-Task Model for Sarcasm Detection and Sentiment Analysis in
Arabic Language [1.1254693939127909]
This paper introduces an end-to-end deep Multi-Task Learning (MTL) model, allowing knowledge interaction between the two tasks.
Overall obtained results show that our proposed model outperforms its single-task counterparts on both Arabic Sentiment Analysis (SA) and sarcasm detection sub-tasks.
arXiv Detail & Related papers (2021-06-23T16:00:32Z) - Combining Context-Free and Contextualized Representations for Arabic
Sarcasm Detection and Sentiment Identification [0.0]
This paper proffers team SPPU-AASM's submission for the WANLP ArSarcasm shared-task 2021, which centers around the sarcasm and sentiment polarity detection of Arabic tweets.
The proposed system achieves a F1-sarcastic score of 0.62 and a F-PN score of 0.715 for the sarcasm and sentiment detection tasks, respectively.
arXiv Detail & Related papers (2021-03-09T19:39:43Z) - Multilingual Answer Sentence Reranking via Automatically Translated Data [97.98885151955467]
We present a study on the design of multilingual Answer Sentence Selection (AS2) models, which are a core component of modern Question Answering (QA) systems.
The main idea is to transfer data, created from one resource rich language, e.g., English, to other languages, less rich in terms of resources.
arXiv Detail & Related papers (2021-02-20T03:52:08Z) - Style Variation as a Vantage Point for Code-Switching [54.34370423151014]
Code-Switching (CS) is a common phenomenon observed in several bilingual and multilingual communities.
We present a novel vantage point of CS to be style variations between both the participating languages.
We propose a two-stage generative adversarial training approach where the first stage generates competitive negative examples for CS and the second stage generates more realistic CS sentences.
arXiv Detail & Related papers (2020-05-01T15:53:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.