Data Mining Approach to Analyze Covid19 Dataset of Brazilian Patients
- URL: http://arxiv.org/abs/2008.11344v1
- Date: Wed, 26 Aug 2020 02:21:56 GMT
- Title: Data Mining Approach to Analyze Covid19 Dataset of Brazilian Patients
- Authors: Josimar E. Chire Saire
- Abstract summary: The pandemic originated by coronavirus(covid-19), name coined by World Health Organization during the first month in 2020.
Almost all the countries presented covid19 positive cases and governments are choosing different health policies to stop the infection.
One of top countries with more infections is Brazil, until August 11 had a total of 3,112,393 cases.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The pandemic originated by coronavirus(covid-19), name coined by World Health
Organization during the first month in 2020. Actually, almost all the countries
presented covid19 positive cases and governments are choosing different health
policies to stop the infection and many research groups are working on patients
data to understand the virus, at the same time scientists are looking for a
vacuum to enhance imnulogy system to tack covid19 virus. One of top countries
with more infections is Brazil, until August 11 had a total of 3,112,393 cases.
Research Foundation of Sao Paulo State(Fapesp) released a dataset, it was an
innovative in collaboration with hospitals(Einstein, Sirio-Libanes),
laboratory(Fleury) and Sao Paulo University to foster reseach on this trend
topic. The present paper presents an exploratory analysis of the datasets,
using a Data Mining Approach, and some inconsistencies are found, i.e. NaN
values, null references values for analytes, outliers on results of analytes,
encoding issues. The results were cleaned datasets for future studies, but at
least a 20\% of data were discarded because of non numerical, null values and
numbers out of reference range.
Related papers
- A Large-Scale Dataset of Search Interests Related to Disease X
Originating from Different Geographic Regions [0.0]
This paper presents a dataset of web behavior related to Disease X between February 2018 and August 2023.
The dataset was developed by collecting data using Google Trends.
The relevant search interests for all these regions for each month in this time range are available in this dataset.
arXiv Detail & Related papers (2023-12-19T06:20:27Z) - COVID-19 Vaccine Misinformation in Middle Income Countries [5.891662430960944]
This paper introduces a multilingual dataset of COVID-19 vaccine misinformation, consisting of annotated tweets from three middle-income countries: Brazil, Indonesia, and Nigeria.
The dataset includes annotations for 5,952 tweets, assessing their relevance to COVID-19 vaccines, presence of misinformation, and the themes of the misinformation.
arXiv Detail & Related papers (2023-11-30T02:27:34Z) - Visual Exploratory Data Analysis of the Covid-19 Vaccination Progress in
Nigeria [0.0]
Since the covid-19 vaccine came to Nigeria; 18,728,188 people have been fully vaccinated as at May 31st, 2022.
This is roughly 10% of the Nigerian population estimated at 206.7 million.
The data used for this research contain the state-by-state breakdown of Covid-19 vaccine distribution recorded between March 5th, 2021, and May 31st, 2022.
arXiv Detail & Related papers (2022-08-20T09:52:18Z) - COVID-19: An exploration of consecutive systemic barriers to
pathogen-related data sharing during a pandemic [3.192308005611312]
In 2020, the COVID-19 pandemic resulted in a rapid response from governments and researchers worldwide.
As of late 2023, over millions have died as a result of COVID-19.
Data professionals working with pandemic-relevant data often face significant systemic barriers to accessing, sharing or re-using this data.
arXiv Detail & Related papers (2022-05-24T14:25:09Z) - A Machine Learning Analysis of COVID-19 Mental Health Data [0.0]
In December 2019, the novel coronavirus (Sars-Cov-2) and the resulting disease COVID-19 were first identified in Wuhan China.
In this paper, we analyze the impacts the COVID-19 pandemic has had on the mental health of frontline workers in the United States.
Through the interpretation of the many models applied to the mental health survey data, we have concluded that the most important factor in predicting the mental health decline of a frontline worker is the healthcare role.
arXiv Detail & Related papers (2021-12-01T02:00:44Z) - Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in
Artificial Intelligence [79.038671794961]
We launch the Unified CT-COVID AI Diagnostic Initiative (UCADI), where the AI model can be distributedly trained and independently executed at each host institution.
Our study is based on 9,573 chest computed tomography scans (CTs) from 3,336 patients collected from 23 hospitals located in China and the UK.
arXiv Detail & Related papers (2021-11-18T00:43:41Z) - COVIDx-US -- An open-access benchmark dataset of ultrasound imaging data
for AI-driven COVID-19 analytics [116.6248556979572]
COVIDx-US is an open-access benchmark dataset of COVID-19 related ultrasound imaging data.
It consists of 93 lung ultrasound videos and 10,774 processed images of patients infected with SARS-CoV-2 pneumonia, non-SARS-CoV-2 pneumonia, as well as healthy control cases.
arXiv Detail & Related papers (2021-03-18T03:31:33Z) - FLOP: Federated Learning on Medical Datasets using Partial Networks [84.54663831520853]
COVID-19 Disease due to the novel coronavirus has caused a shortage of medical resources.
Different data-driven deep learning models have been developed to mitigate the diagnosis of COVID-19.
The data itself is still scarce due to patient privacy concerns.
We propose a simple yet effective algorithm, named textbfFederated textbfL textbfon Medical datasets using textbfPartial Networks (FLOP)
arXiv Detail & Related papers (2021-02-10T01:56:58Z) - Classification supporting COVID-19 diagnostics based on patient survey
data [82.41449972618423]
logistic regression and XGBoost classifiers, that allow for effective screening of patients for COVID-19 were generated.
The obtained classification models provided the basis for the DECODE service (decode.polsl.pl), which can serve as support in screening patients with COVID-19 disease.
This data set consists of more than 3,000 examples is based on questionnaires collected at a hospital in Poland.
arXiv Detail & Related papers (2020-11-24T17:44:01Z) - Cross-lingual Transfer Learning for COVID-19 Outbreak Alignment [90.12602012910465]
We train on Italy's early COVID-19 outbreak through Twitter and transfer to several other countries.
Our experiments show strong results with up to 0.85 Spearman correlation in cross-country predictions.
arXiv Detail & Related papers (2020-06-05T02:04:25Z) - Mapping the Landscape of Artificial Intelligence Applications against
COVID-19 [59.30734371401316]
COVID-19, the disease caused by the SARS-CoV-2 virus, has been declared a pandemic by the World Health Organization.
We present an overview of recent studies using Machine Learning and, more broadly, Artificial Intelligence to tackle many aspects of the COVID-19 crisis.
arXiv Detail & Related papers (2020-03-25T12:30:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.