Unsupervised Text Mining of COVID-19 Records
- URL: http://arxiv.org/abs/2110.07357v1
- Date: Wed, 8 Sep 2021 05:57:22 GMT
- Title: Unsupervised Text Mining of COVID-19 Records
- Authors: Mohamad Zamini
- Abstract summary: Twitter as a powerful tool can help researchers measure public health in response to COVID-19.
This paper preprocessed the existing medical dataset regarding COVID-19 named CORD-19 and annotated the dataset for supervised classification tasks.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Since the beginning of coronavirus, the disease has spread worldwide and
drastically changed many aspects of the human's lifestyle. Twitter as a
powerful tool can help researchers measure public health in response to
COVID-19. According to the high volume of data production on social networks,
automated text mining approaches can help search, read and summarize helpful
information. This paper preprocessed the existing medical dataset regarding
COVID-19 named CORD-19 and annotated the dataset for supervised classification
tasks. At this time of the COVID-19 pandemic, we made a preprocessed dataset
for the research community. This may contribute towards finding new solutions
for some social interventions that COVID-19 has made. The preprocessed version
of the mentioned dataset is publicly available through Github.
Related papers
- A Summary of COVID-19 Datasets [1.3490988186255934]
This research presents a review of main datasets that are developed for COVID-19 research.
We hope this collection will continue to bring together members of the computing community, biomedical experts, and policymakers.
arXiv Detail & Related papers (2022-02-06T17:34:26Z) - A Global Survey of Technological Resources and Datasets on COVID-19 [0.0]
The application and successful utilization of technological resources in developing solutions to health, safety, and economic issues caused by COVID-19 indicate the importance of technology in curbing COVID-19.
arXiv Detail & Related papers (2022-02-06T04:37:14Z) - COVIDx-US -- An open-access benchmark dataset of ultrasound imaging data
for AI-driven COVID-19 analytics [116.6248556979572]
COVIDx-US is an open-access benchmark dataset of COVID-19 related ultrasound imaging data.
It consists of 93 lung ultrasound videos and 10,774 processed images of patients infected with SARS-CoV-2 pneumonia, non-SARS-CoV-2 pneumonia, as well as healthy control cases.
arXiv Detail & Related papers (2021-03-18T03:31:33Z) - FLOP: Federated Learning on Medical Datasets using Partial Networks [84.54663831520853]
COVID-19 Disease due to the novel coronavirus has caused a shortage of medical resources.
Different data-driven deep learning models have been developed to mitigate the diagnosis of COVID-19.
The data itself is still scarce due to patient privacy concerns.
We propose a simple yet effective algorithm, named textbfFederated textbfL textbfon Medical datasets using textbfPartial Networks (FLOP)
arXiv Detail & Related papers (2021-02-10T01:56:58Z) - Repurposing TREC-COVID Annotations to Answer the Key Questions of
CORD-19 [4.847073702809032]
coronavirus disease 2019 (COVID-19) began in Wuhan, China in late 2019 and to date has infected over 14M people worldwide.
White House aggregated over 200,000 journal articles related to a variety of coronaviruses and tasked the community with answering key questions related to the corpus.
We set out to repurpose the relevancy annotations for TREC-COVID tasks to identify journal articles in CORD-19 which are relevant to the key questions posed by CORD-19.
arXiv Detail & Related papers (2020-08-27T19:51:07Z) - So You Need Datasets for Your COVID-19 Detection Research Using Machine
Learning? [0.0]
This article represents the detailed information on frequently used datasets in COVID19 detection using Machine Learning (ML)
We investigated 96 papers on COVID19 detection between January 2020 and June 2020.
arXiv Detail & Related papers (2020-08-11T18:25:09Z) - TICO-19: the Translation Initiative for Covid-19 [112.5601530395345]
The Translation Initiative for COvid-19 (TICO-19) has made test and development data available to AI and MT researchers in 35 different languages.
The same data is translated into all of the languages represented, meaning that testing or development can be done for any pairing of languages in the set.
arXiv Detail & Related papers (2020-07-03T16:26:17Z) - CO-Search: COVID-19 Information Retrieval with Semantic Search, Question
Answering, and Abstractive Summarization [53.67205506042232]
CO-Search is a retriever-ranker semantic search engine designed to handle complex queries over the COVID-19 literature.
To account for the domain-specific and relatively limited dataset, we generate a bipartite graph of document paragraphs and citations.
We evaluate our system on the data of the TREC-COVID information retrieval challenge.
arXiv Detail & Related papers (2020-06-17T01:32:48Z) - Rapidly Bootstrapping a Question Answering Dataset for COVID-19 [88.86456834766288]
We present CovidQA, the beginnings of a question answering dataset specifically designed for COVID-19.
This is the first publicly available resource of its type, and intended as a stopgap measure for guiding research until more substantial evaluation resources become available.
arXiv Detail & Related papers (2020-04-23T17:35:11Z) - CORD-19: The COVID-19 Open Research Dataset [28.556291682259477]
CORD-19 is a growing resource of scientific papers on COVID-19 and related historical coronavirus research.
Since its release, CORD-19 has been downloaded over 200K times and has served as the basis of many COVID-19 text mining and discovery systems.
arXiv Detail & Related papers (2020-04-22T17:10:18Z) - COVID-Net: A Tailored Deep Convolutional Neural Network Design for
Detection of COVID-19 Cases from Chest X-Ray Images [93.0013343535411]
We introduce COVID-Net, a deep convolutional neural network design tailored for the detection of COVID-19 cases from chest X-ray (CXR) images.
To the best of the authors' knowledge, COVID-Net is one of the first open source network designs for COVID-19 detection from CXR images.
We also introduce COVIDx, an open access benchmark dataset that we generated comprising of 13,975 CXR images across 13,870 patient patient cases.
arXiv Detail & Related papers (2020-03-22T12:26:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.