Building predictive models of healthcare costs with open healthcare data
- URL: http://arxiv.org/abs/2304.02191v1
- Date: Wed, 5 Apr 2023 02:12:58 GMT
- Title: Building predictive models of healthcare costs with open healthcare data
- Authors: A. Ravishankar Rao, Subrata Garai, Soumyabrata Dey, Hang Peng
- Abstract summary: We present an approach to developing a predictive model using machine-learning techniques.
We analyzed de-identified patient data from New York StateS, consisting of 2.3 million records in 2016.
We built models to predict costs from patient diagnoses and demographics.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Due to rapidly rising healthcare costs worldwide, there is significant
interest in controlling them. An important aspect concerns price transparency,
as preliminary efforts have demonstrated that patients will shop for lower
costs, driving efficiency. This requires the data to be made available, and
models that can predict healthcare costs for a wide range of patient
demographics and conditions. We present an approach to this problem by
developing a predictive model using machine-learning techniques. We analyzed
de-identified patient data from New York State SPARCS (statewide planning and
research cooperative system), consisting of 2.3 million records in 2016. We
built models to predict costs from patient diagnoses and demographics. We
investigated two model classes consisting of sparse regression and decision
trees. We obtained the best performance by using a decision tree with depth 10.
We obtained an R-square value of 0.76 which is better than the values reported
in the literature for similar problems.
Related papers
- Predictive and Prescriptive Analytics for Multi-Site Modeling of Frail
and Elderly Patient Services [0.0]
The aim of this research is to assess how various predictive and prescriptive analytical methods contribute to addressing the operational challenges within an area of healthcare that is growing in demand.
On the prescriptive side, deterministic and two-stage programs are developed to determine how to optimally plan for beds and ward staff.
Our research reveals that healthcare managers should consider using predictive and prescriptive models to make more informed decisions.
arXiv Detail & Related papers (2023-11-13T12:25:45Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Data-Driven Allocation of Preventive Care With Application to Diabetes
Mellitus Type II [34.29871115270994]
We develop a data-driven decision model for determining a cost-effective allocation of preventive treatments to patients at risk.
Our work supports decision-making in health management, with the goal of achieving effective disease prevention at lower costs.
arXiv Detail & Related papers (2023-08-14T06:29:09Z) - Predicting Visit Cost of Obstructive Sleep Apnea using Electronic
Healthcare Records with Transformer [0.0]
Obstructive sleep apnea (OSA) is growing increasingly prevalent in many countries as obesity rises.
For treatment purposes, predicting OSA patients' visit expenses for the coming year is crucial.
Just a third of those data from OSA patients can be used to train analytics models.
arXiv Detail & Related papers (2023-01-28T20:08:00Z) - Foresight -- Deep Generative Modelling of Patient Timelines using
Electronic Health Records [46.024501445093755]
Temporal modelling of medical history can be used to forecast and simulate future events, estimate risk, suggest alternative diagnoses or forecast complications.
We present Foresight, a novel GPT3-based pipeline that uses NER+L tools (i.e. MedCAT) to convert document text into structured, coded concepts.
arXiv Detail & Related papers (2022-12-13T19:06:00Z) - Development of patients triage algorithm from nationwide COVID-19
registry data based on machine learning [1.0323063834827415]
This paper provides the development processes of the severity assessment model using machine learning techniques.
Model only requires basic patients' basic personal data, allowing for them to judge their own severity.
We aim to establish a medical system that allows patients to check their own severity and informs them to visit the appropriate clinic center based on the past treatment details of other patients with similar severity.
arXiv Detail & Related papers (2021-09-18T19:56:27Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Learning-based Computer-aided Prescription Model for Parkinson's
Disease: A Data-driven Perspective [61.70045118068213]
We build a dataset by collecting symptoms of PD patients, and their prescription drug provided by neurologists.
Then, we build a novel computer-aided prescription model by learning the relation between observed symptoms and prescription drug.
For the new coming patients, we could recommend (predict) suitable prescription drug on their observed symptoms by our prescription model.
arXiv Detail & Related papers (2020-07-31T14:34:35Z) - Hemogram Data as a Tool for Decision-making in COVID-19 Management:
Applications to Resource Scarcity Scenarios [62.997667081978825]
COVID-19 pandemics has challenged emergency response systems worldwide, with widespread reports of essential services breakdown and collapse of health care structure.
This work describes a machine learning model derived from hemogram exam data performed in symptomatic patients.
Proposed models can predict COVID-19 qRT-PCR results in symptomatic individuals with high accuracy, sensitivity and specificity.
arXiv Detail & Related papers (2020-05-10T01:45:03Z) - Predictive Modeling of ICU Healthcare-Associated Infections from
Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling
Approach [55.41644538483948]
This work is focused on both the identification of risk factors and the prediction of healthcare-associated infections in intensive-care units.
The aim is to support decision making addressed at reducing the incidence rate of infections.
arXiv Detail & Related papers (2020-05-07T16:13:12Z) - Deep learning for prediction of population health costs [0.0]
We developed a deep neural network to predict future cost from health insurance claims records.
We applied the deep network and a ridge regression model to a sample of 1.4 million German insurants to predict total one-year health care costs.
arXiv Detail & Related papers (2020-03-06T23:33:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.