Related papers: Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection

Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection

URL: http://arxiv.org/abs/2111.07819v1
Date: Mon, 15 Nov 2021 15:01:55 GMT
Title: Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection
Authors: Jan Philip Wahle and Nischal Ashok and Terry Ruas and Norman Meuschke and Tirthankar Ghosal and Bela Gipp
Abstract summary: A drastic rise in potentially life-threatening misinformation has been a by-product of the COVID-19 pandemic. We evaluate fifteen Transformer-based models on five COVID-19 misinformation datasets. We show tokenizers and models tailored to COVID-19 data do not provide a significant advantage over general-purpose ones.
Score: 6.1204874238049705
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: A drastic rise in potentially life-threatening misinformation has been a by-product of the COVID-19 pandemic. Computational support to identify false information within the massive body of data on the topic is crucial to prevent harm. Researchers proposed many methods for flagging online misinformation related to COVID-19. However, these methods predominantly target specific content types (e.g., news) or platforms (e.g., Twitter). The methods' capabilities to generalize were largely unclear so far. We evaluate fifteen Transformer-based models on five COVID-19 misinformation datasets that include social media posts, news articles, and scientific papers to fill this gap. We show tokenizers and models tailored to COVID-19 data do not provide a significant advantage over general-purpose ones. Our study provides a realistic assessment of models for detecting COVID-19 misinformation. We expect that evaluating a broad spectrum of datasets and models will benefit future research in developing misinformation detection systems.

Related papers

Limited Effectiveness of LLM-based Data Augmentation for COVID-19 Misinformation Stance Detection [7.807156538988814]
Misinformation surrounding emerging outbreaks poses a serious societal threat. One promising approach is stance detection (SD), which identifies whether social media posts support or oppose misleading claims. We test controllable misinformation generation using large language models (LLMs) as a method for data augmentation.
arXiv Detail & Related papers (2025-03-04T06:38:29Z)
Unsupervised Model Diagnosis [49.36194740479798]
This paper proposes Unsupervised Model Diagnosis (UMO) to produce semantic counterfactual explanations without any user guidance. Our approach identifies and visualizes changes in semantics, and then matches these changes to attributes from wide-ranging text sources.
arXiv Detail & Related papers (2024-10-08T17:59:03Z)
AMIR: Automated MisInformation Rebuttal -- A COVID-19 Vaccination Datasets based Recommendation System [0.05461938536945722]
This work explored how existing information obtained from social media can be harnessed to facilitate automated rebuttal of misinformation at scale. It leverages two publicly available datasets, FaCov (fact-checked articles) and misleading (social media Twitter) data on COVID-19 Vaccination.
arXiv Detail & Related papers (2023-10-29T13:07:33Z)
Two-Stage Classifier for COVID-19 Misinformation Detection Using BERT: a Study on Indonesian Tweets [0.15229257192293202]
Research on COVID-19 misinformation detection in Indonesia is still scarce. In this study, we propose the two-stage classifier model using IndoBERT pre-trained language model for the Tweet misinformation detection task. The experimental results show that the combination of the BERT sequence classifier for relevance prediction and Bi-LSTM for misinformation detection outperformed other machine learning models with an accuracy of 87.02%.
arXiv Detail & Related papers (2022-06-30T15:33:20Z)
When Accuracy Meets Privacy: Two-Stage Federated Transfer Learning Framework in Classification of Medical Images on Limited Data: A COVID-19 Case Study [77.34726150561087]
COVID-19 pandemic has spread rapidly and caused a shortage of global medical resources. CNN has been widely utilized and verified in analyzing medical images.
arXiv Detail & Related papers (2022-03-24T02:09:41Z)
COVID-19 Electrocardiograms Classification using CNN Models [1.1172382217477126]
A novel approach is proposed to automatically diagnose the COVID-19 by the utilization of Electrocardiogram (ECG) data with the integration of deep learning algorithms. CNN models have been utilized in this proposed framework, including VGG16, VGG19, InceptionResnetv2, InceptionV3, Resnet50, and Densenet201. Our results show a relatively low accuracy in the rest of the models compared to the VGG16 model, which is due to the small size of the utilized dataset.
arXiv Detail & Related papers (2021-12-15T08:06:45Z)
The pitfalls of using open data to develop deep learning solutions for COVID-19 detection in chest X-rays [64.02097860085202]
Deep learning models have been developed to identify COVID-19 from chest X-rays. Results have been exceptional when training and testing on open-source data. Data analysis and model evaluations show that the popular open-source dataset COVIDx is not representative of the real clinical problem.
arXiv Detail & Related papers (2021-09-14T10:59:11Z)
FLOP: Federated Learning on Medical Datasets using Partial Networks [84.54663831520853]
COVID-19 Disease due to the novel coronavirus has caused a shortage of medical resources. Different data-driven deep learning models have been developed to mitigate the diagnosis of COVID-19. The data itself is still scarce due to patient privacy concerns. We propose a simple yet effective algorithm, named textbfFederated textbfL textbfon Medical datasets using textbfPartial Networks (FLOP)
arXiv Detail & Related papers (2021-02-10T01:56:58Z)
End-2-End COVID-19 Detection from Breath & Cough Audio [68.41471917650571]
We demonstrate the first attempt to diagnose COVID-19 using end-to-end deep learning from a crowd-sourced dataset of audio samples. We introduce a novel modelling strategy using a custom deep neural network to diagnose COVID-19 from a joint breath and cough representation.
arXiv Detail & Related papers (2021-01-07T01:13:00Z)
Classification supporting COVID-19 diagnostics based on patient survey data [82.41449972618423]
logistic regression and XGBoost classifiers, that allow for effective screening of patients for COVID-19 were generated. The obtained classification models provided the basis for the DECODE service (decode.polsl.pl), which can serve as support in screening patients with COVID-19 disease. This data set consists of more than 3,000 examples is based on questionnaires collected at a hospital in Poland.
arXiv Detail & Related papers (2020-11-24T17:44:01Z)
Blockchain-Federated-Learning and Deep Learning Models for COVID-19 detection using CT Imaging [8.280858576611587]
Primary problem in diagnosing COVID-19 patients is the shortage and reliability of testing kits. Second real-world problem is to share the data among the hospitals globally. Thirdly, we design a method that can collaboratively train a global model using blockchain technology.
arXiv Detail & Related papers (2020-07-10T11:23:14Z)
Classification Aware Neural Topic Model and its Application on a New COVID-19 Disinformation Corpus [2.492887522265771]
The explosion of disinformation following the COVID-19 pandemic has overloaded fact-checkers and media worldwide. To help tackle this, we developed computational methods to categorise COVID-19 disinformation.
arXiv Detail & Related papers (2020-06-05T10:32:18Z)
COVID-Net: A Tailored Deep Convolutional Neural Network Design for Detection of COVID-19 Cases from Chest X-Ray Images [93.0013343535411]
We introduce COVID-Net, a deep convolutional neural network design tailored for the detection of COVID-19 cases from chest X-ray (CXR) images. To the best of the authors' knowledge, COVID-Net is one of the first open source network designs for COVID-19 detection from CXR images. We also introduce COVIDx, an open access benchmark dataset that we generated comprising of 13,975 CXR images across 13,870 patient patient cases.
arXiv Detail & Related papers (2020-03-22T12:26:36Z)

This list is automatically generated from the titles and abstracts of the papers in this site.