Overview of the 2023 ICON Shared Task on Gendered Abuse Detection in
Indic Languages
- URL: http://arxiv.org/abs/2401.03677v1
- Date: Mon, 8 Jan 2024 05:54:26 GMT
- Title: Overview of the 2023 ICON Shared Task on Gendered Abuse Detection in
Indic Languages
- Authors: Aatman Vaidya, Arnav Arora, Aditya Joshi, Tarunima Prabhakar
- Abstract summary: The paper reports the findings of the ICON 2023 on Gendered Abuse Detection in Indic languages.
The shared task was conducted based on a novel dataset in Hindi, Tamil and the Indian dialect of English.
The paper contains examples of hateful content owing to its topic.
- Score: 7.869644160487393
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper reports the findings of the ICON 2023 on Gendered Abuse Detection
in Indic Languages. The shared task deals with the detection of gendered abuse
in online text. The shared task was conducted as a part of ICON 2023, based on
a novel dataset in Hindi, Tamil and the Indian dialect of English. The
participants were given three subtasks with the train dataset consisting of
approximately 6500 posts sourced from Twitter. For the test set, approximately
1200 posts were provided. The shared task received a total of 9 registrations.
The best F-1 scores are 0.616 for subtask 1, 0.572 for subtask 2 and, 0.616 and
0.582 for subtask 3. The paper contains examples of hateful content owing to
its topic.
Related papers
- Findings of the IWSLT 2024 Evaluation Campaign [102.7608597658451]
The paper reports on the shared tasks organized by the 21st IWSLT Conference.
The shared tasks address 7 scientific challenges in spoken language translation.
arXiv Detail & Related papers (2024-11-07T19:11:55Z) - NADI 2024: The Fifth Nuanced Arabic Dialect Identification Shared Task [28.40134178913119]
We describe the findings of the fifth Nuanced Arabic Dialect Identification Shared Task (NADI 2024)
NADI 2024 targeted both dialect identification cast as a multi-label task and identification of the Arabic level of dialectness.
Winning teams achieved 50.57 Ftextsubscript1 on Subtask1, 0.1403 RMSE for Subtask2, and 20.44 BLEU in Subtask3, respectively.
arXiv Detail & Related papers (2024-07-06T01:18:58Z) - SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection [68.858931667807]
Subtask A is a binary classification task determining whether a text is written by a human or generated by a machine.
Subtask B is to detect the exact source of a text, discerning whether it is written by a human or generated by a specific LLM.
Subtask C aims to identify the changing point within a text, at which the authorship transitions from human to machine.
arXiv Detail & Related papers (2024-04-22T13:56:07Z) - SemEval 2024 -- Task 10: Emotion Discovery and Reasoning its Flip in
Conversation (EDiReF) [61.49972925493912]
SemEval-2024 Task 10 is a shared task centred on identifying emotions in code-mixed dialogues.
This task comprises three distinct subtasks - emotion recognition in conversation for code-mixed dialogues, emotion flip reasoning for code-mixed dialogues, and emotion flip reasoning for English dialogues.
A total of 84 participants engaged in this task, with the most adept systems attaining F1-scores of 0.70, 0.79, and 0.76 for the respective subtasks.
arXiv Detail & Related papers (2024-02-29T08:20:06Z) - NADI 2023: The Fourth Nuanced Arabic Dialect Identification Shared Task [28.986040897360336]
We describe the findings of the fourth Nuanced Arabic Dialect Identification Shared Task (NADI 2023)
NADI 2023 targeted both dialect identification (Subtask 1) and dialect-to-MSA machine translation (Subtask 2 and Subtask 3).
We describe the methods employed by the participating teams and briefly offer an outlook for NADI.
arXiv Detail & Related papers (2023-10-24T18:41:24Z) - CL-UZH at SemEval-2023 Task 10: Sexism Detection through Incremental
Fine-Tuning and Multi-Task Learning with Label Descriptions [0.0]
SemEval shared task textitTowards Explainable Detection of Online Sexism (EDOS 2023) is to detect sexism in English social media posts.
We present our submitted systems for all three subtasks, based on a multi-task model that has been fine-tuned on a range of related tasks.
We implement multi-task learning by formulating each task as binary pairwise text classification, where the dataset and label descriptions are given along with the input text.
arXiv Detail & Related papers (2023-06-06T17:59:49Z) - Overview of Abusive and Threatening Language Detection in Urdu at FIRE
2021 [50.591267188664666]
We present two shared tasks of abusive and threatening language detection for the Urdu language.
We present two manually annotated datasets containing tweets labelled as (i) Abusive and Non-Abusive, and (ii) Threatening and Non-Threatening.
For both subtasks, m-Bert based transformer model showed the best performance.
arXiv Detail & Related papers (2022-07-14T07:38:13Z) - UrduFake@FIRE2021: Shared Track on Fake News Identification in Urdu [55.41644538483948]
This study reports the second shared task named as UrduFake@FIRE2021 on identifying fake news detection in Urdu language.
The proposed systems were based on various count-based features and used different classifiers as well as neural network architectures.
The gradient descent (SGD) algorithm outperformed other classifiers and achieved 0.679 F-score.
arXiv Detail & Related papers (2022-07-11T19:15:04Z) - RuArg-2022: Argument Mining Evaluation [69.87149207721035]
This paper is a report of the organizers on the first competition of argumentation analysis systems dealing with Russian language texts.
A corpus containing 9,550 sentences (comments on social media posts) on three topics related to the COVID-19 pandemic was prepared.
The system that won the first place in both tasks used the NLI (Natural Language Inference) variant of the BERT architecture.
arXiv Detail & Related papers (2022-06-18T17:13:37Z) - NADI 2020: The First Nuanced Arabic Dialect Identification Shared Task [18.23153068720659]
We present the results and findings of the First Nuanced Arabic Dialect Identification Shared Task (NADI)
Data for the shared task covers a total of 100 provinces from 21 Arab countries and are collected from the Twitter domain.
NADI is the first shared task to target naturally-occurring fine-grained dialectal text at the sub-country level.
arXiv Detail & Related papers (2020-10-21T22:14:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.