Predicting Visit Cost of Obstructive Sleep Apnea using Electronic
Healthcare Records with Transformer
- URL: http://arxiv.org/abs/2301.12289v1
- Date: Sat, 28 Jan 2023 20:08:00 GMT
- Title: Predicting Visit Cost of Obstructive Sleep Apnea using Electronic
Healthcare Records with Transformer
- Authors: Zhaoyang Chen, Lina Siltala-Li, Mikko Lassila, Pekka Malo, Eeva
Vilkkumaa, Tarja Saaresranta, Arho Veli Virkki
- Abstract summary: Obstructive sleep apnea (OSA) is growing increasingly prevalent in many countries as obesity rises.
For treatment purposes, predicting OSA patients' visit expenses for the coming year is crucial.
Just a third of those data from OSA patients can be used to train analytics models.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Background: Obstructive sleep apnea (OSA) is growing increasingly prevalent
in many countries as obesity rises. Sufficient, effective treatment of OSA
entails high social and financial costs for healthcare. Objective: For
treatment purposes, predicting OSA patients' visit expenses for the coming year
is crucial. Reliable estimates enable healthcare decision-makers to perform
careful fiscal management and budget well for effective distribution of
resources to hospitals. The challenges created by scarcity of high-quality
patient data are exacerbated by the fact that just a third of those data from
OSA patients can be used to train analytics models: only OSA patients with more
than 365 days of follow-up are relevant for predicting a year's expenditures.
Methods and procedures: The authors propose a method applying two Transformer
models, one for augmenting the input via data from shorter visit histories and
the other predicting the costs by considering both the material thus enriched
and cases with more than a year's follow-up. Results: The two-model solution
permits putting the limited body of OSA patient data to productive use.
Relative to a single-Transformer solution using only a third of the
high-quality patient data, the solution with two models improved the prediction
performance's $R^{2}$ from 88.8% to 97.5%. Even using baseline models with the
model-augmented data improved the $R^{2}$ considerably, from 61.6% to 81.9%.
Conclusion: The proposed method makes prediction with the most of the available
high-quality data by carefully exploiting details, which are not directly
relevant for answering the question of the next year's likely expenditure.
Related papers
- Active Sensing of Knee Osteoarthritis Progression with Reinforcement Learning [2.8498944632323755]
Osteoarthritis (OA) is the most common musculoskeletal disease, which has no cure.
Knee OA (KOA) is one of the highest causes of disability worldwide, and it costs billions of United States dollars to the global community.
arXiv Detail & Related papers (2024-08-05T09:54:08Z) - Enhancing Uncertain Demand Prediction in Hospitals Using Simple and Advanced Machine Learning [3.9054437595657534]
Using patient care demand data from Rambam Medical Center in Israel, our results show that both proposed models effectively capture hourly variations of patient demand.
It is possible to predict patient care demand with good accuracy (around 4 patients) three days or a week in advance using machine learning.
arXiv Detail & Related papers (2024-04-29T13:05:59Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Data-Driven Allocation of Preventive Care With Application to Diabetes
Mellitus Type II [34.29871115270994]
We develop a data-driven decision model for determining a cost-effective allocation of preventive treatments to patients at risk.
Our work supports decision-making in health management, with the goal of achieving effective disease prevention at lower costs.
arXiv Detail & Related papers (2023-08-14T06:29:09Z) - Building predictive models of healthcare costs with open healthcare data [0.0]
We present an approach to developing a predictive model using machine-learning techniques.
We analyzed de-identified patient data from New York StateS, consisting of 2.3 million records in 2016.
We built models to predict costs from patient diagnoses and demographics.
arXiv Detail & Related papers (2023-04-05T02:12:58Z) - Textual Data Augmentation for Patient Outcomes Prediction [67.72545656557858]
We propose a novel data augmentation method to generate artificial clinical notes in patients' Electronic Health Records.
We fine-tune the generative language model GPT-2 to synthesize labeled text with the original training data.
We evaluate our method on the most common patient outcome, i.e., the 30-day readmission rate.
arXiv Detail & Related papers (2022-11-13T01:07:23Z) - Clinical Outcome Prediction from Admission Notes using Self-Supervised
Knowledge Integration [55.88616573143478]
Outcome prediction from clinical text can prevent doctors from overlooking possible risks.
Diagnoses at discharge, procedures performed, in-hospital mortality and length-of-stay prediction are four common outcome prediction targets.
We propose clinical outcome pre-training to integrate knowledge about patient outcomes from multiple public sources.
arXiv Detail & Related papers (2021-02-08T10:26:44Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Hemogram Data as a Tool for Decision-making in COVID-19 Management:
Applications to Resource Scarcity Scenarios [62.997667081978825]
COVID-19 pandemics has challenged emergency response systems worldwide, with widespread reports of essential services breakdown and collapse of health care structure.
This work describes a machine learning model derived from hemogram exam data performed in symptomatic patients.
Proposed models can predict COVID-19 qRT-PCR results in symptomatic individuals with high accuracy, sensitivity and specificity.
arXiv Detail & Related papers (2020-05-10T01:45:03Z) - Predictive Modeling of ICU Healthcare-Associated Infections from
Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling
Approach [55.41644538483948]
This work is focused on both the identification of risk factors and the prediction of healthcare-associated infections in intensive-care units.
The aim is to support decision making addressed at reducing the incidence rate of infections.
arXiv Detail & Related papers (2020-05-07T16:13:12Z) - Deep learning for prediction of population health costs [0.0]
We developed a deep neural network to predict future cost from health insurance claims records.
We applied the deep network and a ridge regression model to a sample of 1.4 million German insurants to predict total one-year health care costs.
arXiv Detail & Related papers (2020-03-06T23:33:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.