Findings of the NLP4IF-2021 Shared Tasks on Fighting the COVID-19
Infodemic and Censorship Detection
- URL: http://arxiv.org/abs/2109.12986v1
- Date: Thu, 23 Sep 2021 06:38:03 GMT
- Title: Findings of the NLP4IF-2021 Shared Tasks on Fighting the COVID-19
Infodemic and Censorship Detection
- Authors: Shaden Shaar, Firoj Alam, Giovanni Da San Martino, Alex Nikolov, Wajdi
Zaghouani, Preslav Nakov, Anna Feldman
- Abstract summary: We present the results of the NLP4IF-2021 shared tasks.
Ten teams submitted systems for task 1, and one team participated in task 2.
The best systems used pre-trained Transformers and ensembles.
- Score: 23.280506220186425
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present the results and the main findings of the NLP4IF-2021 shared tasks.
Task 1 focused on fighting the COVID-19 infodemic in social media, and it was
offered in Arabic, Bulgarian, and English. Given a tweet, it asked to predict
whether that tweet contains a verifiable claim, and if so, whether it is likely
to be false, is of general interest, is likely to be harmful, and is worthy of
manual fact-checking; also, whether it is harmful to society, and whether it
requires the attention of policy makers. Task~2 focused on censorship
detection, and was offered in Chinese. A total of ten teams submitted systems
for task 1, and one team participated in task 2; nine teams also submitted a
system description paper. Here, we present the tasks, analyze the results, and
discuss the system submissions and the methods they used. Most submissions
achieved sizable improvements over several baselines, and the best systems used
pre-trained Transformers and ensembles. The data, the scorers and the
leaderboards for the tasks are available at
http://gitlab.com/NLP4IF/nlp4if-2021.
Related papers
- Mavericks at ArAIEval Shared Task: Towards a Safer Digital Space --
Transformer Ensemble Models Tackling Deception and Persuasion [0.0]
We present our approaches for task 1-A and task 2-A of the shared task which focus on persuasion technique detection and disinformation detection respectively.
The tasks use multigenre snippets of tweets and news articles for the given binary classification problem.
We achieved a micro F1-score of 0.742 on task 1-A (8th rank on the leaderboard) and 0.901 on task 2-A (7th rank on the leaderboard) respectively.
arXiv Detail & Related papers (2023-11-30T17:26:57Z) - ArAIEval Shared Task: Persuasion Techniques and Disinformation Detection
in Arabic Text [41.3267575540348]
We present an overview of the ArAIEval shared task, organized as part of the first Arabic 2023 conference co-located with EMNLP 2023.
ArAIEval offers two tasks over Arabic text: (i) persuasion technique detection, focusing on identifying persuasion techniques in tweets and news articles, and (ii) disinformation detection in binary and multiclass setups over tweets.
A total of 20 teams participated in the final evaluation phase, with 14 and 16 teams participating in Tasks 1 and 2, respectively.
arXiv Detail & Related papers (2023-11-06T15:21:19Z) - ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media [74.93847489218008]
We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information.
To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles.
Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance.
arXiv Detail & Related papers (2023-05-23T16:40:07Z) - QCRI at SemEval-2023 Task 3: News Genre, Framing and Persuasion
Techniques Detection using Multilingual Models [20.003175365478228]
This paper describes our participating system to the SemEval-2023 Task 3.
The task addressed three subtasks with six languages, in addition to three surprise'' test languages, resulting in 27 different test setups.
Our system is ranked among the top 3 systems for 10 out of the 27 setups.
arXiv Detail & Related papers (2023-05-05T07:40:41Z) - Findings of the WMT 2022 Shared Task on Translation Suggestion [63.457874930232926]
We report the result of the first edition of the WMT shared task on Translation Suggestion.
The task aims to provide alternatives for specific words or phrases given the entire documents generated by machine translation (MT)
It consists two sub-tasks, namely, the naive translation suggestion and translation suggestion with hints.
arXiv Detail & Related papers (2022-11-30T03:48:36Z) - Overview of the Shared Task on Fake News Detection in Urdu at FIRE 2020 [62.6928395368204]
Task was posed as a binary classification task, in which the goal is to differentiate between real and fake news.
We provided a dataset divided into 900 annotated news articles for training and 400 news articles for testing.
42 teams from 6 different countries (India, China, Egypt, Germany, Pakistan, and the UK) registered for the task.
arXiv Detail & Related papers (2022-07-25T03:41:32Z) - Overview of Abusive and Threatening Language Detection in Urdu at FIRE
2021 [50.591267188664666]
We present two shared tasks of abusive and threatening language detection for the Urdu language.
We present two manually annotated datasets containing tweets labelled as (i) Abusive and Non-Abusive, and (ii) Threatening and Non-Threatening.
For both subtasks, m-Bert based transformer model showed the best performance.
arXiv Detail & Related papers (2022-07-14T07:38:13Z) - UrduFake@FIRE2021: Shared Track on Fake News Identification in Urdu [55.41644538483948]
This study reports the second shared task named as UrduFake@FIRE2021 on identifying fake news detection in Urdu language.
The proposed systems were based on various count-based features and used different classifiers as well as neural network architectures.
The gradient descent (SGD) algorithm outperformed other classifiers and achieved 0.679 F-score.
arXiv Detail & Related papers (2022-07-11T19:15:04Z) - Overview of the Shared Task on Fake News Detection in Urdu at FIRE 2021 [55.41644538483948]
The goal of the shared task is to motivate the community to come up with efficient methods for solving this vital problem.
The training set contains 1300 annotated news articles -- 750 real news, 550 fake news, while the testing set contains 300 news articles -- 200 real, 100 fake news.
The best performing system obtained an F1-macro score of 0.679, which is lower than the past year's best result of 0.907 F1-macro.
arXiv Detail & Related papers (2022-07-11T18:58:36Z) - Team Alex at CLEF CheckThat! 2020: Identifying Check-Worthy Tweets With
Transformer Models [28.25006244616817]
We propose a model for detecting check-worthy tweets about COVID-19, which combines deep contextualized text representations with modeling the social context of the tweet.
Our official submission to the English version of CLEF-2020 CheckThat! Task 1, system Team_Alex, was ranked second with a MAP score of 0.8034.
arXiv Detail & Related papers (2020-09-07T08:03:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.