QCRI at SemEval-2023 Task 3: News Genre, Framing and Persuasion
Techniques Detection using Multilingual Models
- URL: http://arxiv.org/abs/2305.03336v1
- Date: Fri, 5 May 2023 07:40:41 GMT
- Title: QCRI at SemEval-2023 Task 3: News Genre, Framing and Persuasion
Techniques Detection using Multilingual Models
- Authors: Maram Hasanain, Ahmed Oumar El-Shangiti, Rabindra Nath Nandi, Preslav
Nakov and Firoj Alam
- Abstract summary: This paper describes our participating system to the SemEval-2023 Task 3.
The task addressed three subtasks with six languages, in addition to three surprise'' test languages, resulting in 27 different test setups.
Our system is ranked among the top 3 systems for 10 out of the 27 setups.
- Score: 20.003175365478228
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Misinformation spreading in mainstream and social media has been misleading
users in different ways. Manual detection and verification efforts by
journalists and fact-checkers can no longer cope with the great scale and quick
spread of misleading information. This motivated research and industry efforts
to develop systems for analyzing and verifying news spreading online. The
SemEval-2023 Task 3 is an attempt to address several subtasks under this
overarching problem, targeting writing techniques used in news articles to
affect readers' opinions. The task addressed three subtasks with six languages,
in addition to three ``surprise'' test languages, resulting in 27 different
test setups. This paper describes our participating system to this task. Our
team is one of the 6 teams that successfully submitted runs for all setups. The
official results show that our system is ranked among the top 3 systems for 10
out of the 27 setups.
Related papers
- SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection [68.858931667807]
Subtask A is a binary classification task determining whether a text is written by a human or generated by a machine.
Subtask B is to detect the exact source of a text, discerning whether it is written by a human or generated by a specific LLM.
Subtask C aims to identify the changing point within a text, at which the authorship transitions from human to machine.
arXiv Detail & Related papers (2024-04-22T13:56:07Z) - SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes [48.83290963506378]
This paper presents the results of the SHROOM, a shared task focused on detecting hallucinations.
We observe a number of key trends in how this approach was tackled.
While a majority of the teams did outperform our proposed baseline system, the performances of top-scoring systems are still consistent with a random handling of the more challenging items.
arXiv Detail & Related papers (2024-03-12T15:06:22Z) - MarsEclipse at SemEval-2023 Task 3: Multi-Lingual and Multi-Label
Framing Detection with Contrastive Learning [21.616089539381996]
This paper describes our system for SemEval-2023 Task 3 Subtask 2 on Framing Detection.
We used a multi-label contrastive loss for fine-tuning large pre-trained language models in a multi-lingual setting.
Our system was ranked first on the official test set and on the official shared task leaderboard for five of the six languages.
arXiv Detail & Related papers (2023-04-20T18:42:23Z) - Designing and Evaluating Interfaces that Highlight News Coverage
Diversity Using Discord Questions [84.55145223950427]
This paper shows that navigating large source collections for a news story can be challenging without further guidance.
We design three interfaces -- the Annotated Article, the Recomposed Article, and the Question Grid -- aimed at accompanying news readers in discovering coverage diversity while they read.
arXiv Detail & Related papers (2023-02-17T16:59:31Z) - IITD at the WANLP 2022 Shared Task: Multilingual Multi-Granularity
Network for Propaganda Detection [25.536546272915427]
We present our system for two subtasks of the shared task on propaganda detection in Arabic.
Subtask 1 is a multi-label classification problem to find the propaganda techniques used in a given tweet.
Subtask 2 asks to identify the textual span for each instance of each technique that is present in the tweet.
arXiv Detail & Related papers (2022-10-31T10:14:43Z) - Overview of the Shared Task on Fake News Detection in Urdu at FIRE 2020 [62.6928395368204]
Task was posed as a binary classification task, in which the goal is to differentiate between real and fake news.
We provided a dataset divided into 900 annotated news articles for training and 400 news articles for testing.
42 teams from 6 different countries (India, China, Egypt, Germany, Pakistan, and the UK) registered for the task.
arXiv Detail & Related papers (2022-07-25T03:41:32Z) - Overview of the Shared Task on Fake News Detection in Urdu at FIRE 2021 [55.41644538483948]
The goal of the shared task is to motivate the community to come up with efficient methods for solving this vital problem.
The training set contains 1300 annotated news articles -- 750 real news, 550 fake news, while the testing set contains 300 news articles -- 200 real, 100 fake news.
The best performing system obtained an F1-macro score of 0.679, which is lower than the past year's best result of 0.907 F1-macro.
arXiv Detail & Related papers (2022-07-11T18:58:36Z) - NewsEdits: A News Article Revision Dataset and a Document-Level
Reasoning Challenge [122.37011526554403]
NewsEdits is the first publicly available dataset of news revision histories.
It contains 1.2 million articles with 4.6 million versions from over 22 English- and French-language newspaper sources.
arXiv Detail & Related papers (2022-06-14T18:47:13Z) - Findings of the NLP4IF-2021 Shared Tasks on Fighting the COVID-19
Infodemic and Censorship Detection [23.280506220186425]
We present the results of the NLP4IF-2021 shared tasks.
Ten teams submitted systems for task 1, and one team participated in task 2.
The best systems used pre-trained Transformers and ensembles.
arXiv Detail & Related papers (2021-09-23T06:38:03Z) - Multitask Models for Supervised Protests Detection in Texts [3.8073142980733]
I apply multitask neural networks capable of producing predictions for two and three of these tasks simultaneously.
This paper demonstrates performance near or above the reported state-of-the-art for automated political event coding.
arXiv Detail & Related papers (2020-05-06T17:00:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.