g2tmn at Constraint@AAAI2021: Exploiting CT-BERT and Ensembling Learning
for COVID-19 Fake News Detection
- URL: http://arxiv.org/abs/2012.11967v3
- Date: Wed, 13 Jan 2021 11:36:32 GMT
- Title: g2tmn at Constraint@AAAI2021: Exploiting CT-BERT and Ensembling Learning
for COVID-19 Fake News Detection
- Authors: Anna Glazkova, Maksim Glazkov, Timofey Trifonov
- Abstract summary: We present our results at the Constraint@AAAI2021 Shared Task: COVID-19 Fake News Detection in English.
We propose our approach using the transformer-based ensemble of COVID-Twitter-BERT (CT-BERT) models.
As a result, our best model achieved the weighted F1-score of 98.69 on the test set.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The COVID-19 pandemic has had a huge impact on various areas of human life.
Hence, the coronavirus pandemic and its consequences are being actively
discussed on social media. However, not all social media posts are truthful.
Many of them spread fake news that cause panic among readers, misinform people
and thus exacerbate the effect of the pandemic. In this paper, we present our
results at the Constraint@AAAI2021 Shared Task: COVID-19 Fake News Detection in
English. In particular, we propose our approach using the transformer-based
ensemble of COVID-Twitter-BERT (CT-BERT) models. We describe the models used,
the ways of text preprocessing and adding extra data. As a result, our best
model achieved the weighted F1-score of 98.69 on the test set (the first place
in the leaderboard) of this shared task that attracted 166 submitted teams in
total.
Related papers
- Understanding writing style in social media with a supervised
contrastively pre-trained transformer [57.48690310135374]
Online Social Networks serve as fertile ground for harmful behavior, ranging from hate speech to the dissemination of disinformation.
We introduce the Style Transformer for Authorship Representations (STAR), trained on a large corpus derived from public sources of 4.5 x 106 authored texts.
Using a support base of 8 documents of 512 tokens, we can discern authors from sets of up to 1616 authors with at least 80% accuracy.
arXiv Detail & Related papers (2023-10-17T09:01:17Z) - ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media [74.93847489218008]
We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information.
To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles.
Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance.
arXiv Detail & Related papers (2023-05-23T16:40:07Z) - "COVID-19 was a FIFA conspiracy #curropt": An Investigation into the
Viral Spread of COVID-19 Misinformation [60.268682953952506]
We estimate the extent to which misinformation has influenced the course of the COVID-19 pandemic using natural language processing models.
We provide a strategy to combat social media posts that are likely to cause widespread harm.
arXiv Detail & Related papers (2022-06-12T19:41:01Z) - Cross-lingual COVID-19 Fake News Detection [54.125563009333995]
We make the first attempt to detect COVID-19 misinformation in a low-resource language (Chinese) only using the fact-checked news in a high-resource language (English)
We propose a deep learning framework named CrossFake to jointly encode the cross-lingual news body texts and capture the news content.
Empirical results on our dataset demonstrate the effectiveness of CrossFake under the cross-lingual setting.
arXiv Detail & Related papers (2021-10-13T04:44:02Z) - A transformer based approach for fighting COVID-19 fake news [0.8793721044482612]
COVID-19 is the first pandemic in history when humanity is the most technologically advanced.
Fake news and misinformation regarding this virus is also available to people and causing some massive problems.
We present our solution for the "Constraint@AAAI2021 - COVID19 Fake News Detection in English" challenge in this work.
arXiv Detail & Related papers (2021-01-28T14:43:42Z) - TUDublin team at Constraint@AAAI2021 -- COVID19 Fake News Detection [0.0]
The main goal of the work was to create a model that would carry out a binary classification of messages from social media as real or fake news.
The model allowed us to achieve 0.94 F1-score, which is within 5% of the best result.
arXiv Detail & Related papers (2021-01-14T16:25:32Z) - Model Generalization on COVID-19 Fake News Detection [41.03093888315081]
We aim to achieve a robust model for the COVID-19 fake-news detection task proposed at CONSTRAINT 2021 (FakeNews-19)
We evaluate our models on two COVID-19 fake-news test sets.
arXiv Detail & Related papers (2021-01-11T12:23:41Z) - TIB's Visual Analytics Group at MediaEval '20: Detecting Fake News on
Corona Virus and 5G Conspiracy [9.66022279280394]
Fake news on social media has become a hot topic of research as it negatively impacts the discourse of real news in the public.
The FakeNews task at MediaEval 2020 tackles this problem by creating a challenge to automatically detect tweets containing misinformation.
We present a simple approach that uses BERT embeddings and a shallow neural network for classifying tweets using only text.
arXiv Detail & Related papers (2021-01-10T11:52:17Z) - Transformer based Automatic COVID-19 Fake News Detection System [9.23545668304066]
Misinformation is especially prevalent in the ongoing coronavirus disease (COVID-19) pandemic.
We report a methodology to analyze the reliability of information shared on social media pertaining to the COVID-19 pandemic.
Our system obtained 0.9855 f1-score on testset and ranked 5th among 160 teams.
arXiv Detail & Related papers (2021-01-01T06:49:27Z) - Cross-lingual Transfer Learning for COVID-19 Outbreak Alignment [90.12602012910465]
We train on Italy's early COVID-19 outbreak through Twitter and transfer to several other countries.
Our experiments show strong results with up to 0.85 Spearman correlation in cross-country predictions.
arXiv Detail & Related papers (2020-06-05T02:04:25Z) - The Ivory Tower Lost: How College Students Respond Differently than the
General Public to the COVID-19 Pandemic [66.80677233314002]
Pandemic of the novel Coronavirus Disease 2019 (COVID-19) has presented governments with ultimate challenges.
In the United States, the country with the highest confirmed COVID-19 infection cases, a nationwide social distancing protocol has been implemented by the President.
This paper aims to discover the social implications of this unprecedented disruption in our interactive society by mining people's opinions on social media.
arXiv Detail & Related papers (2020-04-21T13:02:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.