Accurate and Interpretable Machine Learning for Transparent Pricing of
Health Insurance Plans
- URL: http://arxiv.org/abs/2009.10990v2
- Date: Sat, 27 Feb 2021 22:47:22 GMT
- Title: Accurate and Interpretable Machine Learning for Transparent Pricing of
Health Insurance Plans
- Authors: Rohun Kshirsagar, Li-Yen Hsu, Vatshank Chaturvedi, Charles H.
Greenberg, Matthew McClelland, Anushadevi Mohan, Wideet Shende, Nicolas P.
Tilmans, Renzo Frigato, Min Guo, Ankit Chheda, Meredith Trotter, Shonket Ray,
Arnold Lee, Miguel Alvarado
- Abstract summary: Health insurance companies cover half of the United States population and pay 1.2 trillion US dollars every year to cover medical expenses for their members.
The actuary and underwriter roles at a health insurance company serve to assess which risks to take on and how to price those risks to ensure profitability of the organization.
We developed a sequence of two models, an individual patient-level and an employer-group-level model, to predict the annual per member per month allowed amount for employer groups.
Our models performed 20% better than the insurance carrier's existing pricing model, and identified 84% of the concession opportunities
- Score: 3.772148470078554
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Health insurance companies cover half of the United States population through
commercial employer-sponsored health plans and pay 1.2 trillion US dollars
every year to cover medical expenses for their members. The actuary and
underwriter roles at a health insurance company serve to assess which risks to
take on and how to price those risks to ensure profitability of the
organization. While Bayesian hierarchical models are the current standard in
the industry to estimate risk, interest in machine learning as a way to improve
upon these existing methods is increasing. Lumiata, a healthcare analytics
company, ran a study with a large health insurance company in the United
States. We evaluated the ability of machine learning models to predict the per
member per month cost of employer groups in their next renewal period,
especially those groups who will cost less than 95\% of what an actuarial model
predicts (groups with "concession opportunities"). We developed a sequence of
two models, an individual patient-level and an employer-group-level model, to
predict the annual per member per month allowed amount for employer groups,
based on a population of 14 million patients. Our models performed 20\% better
than the insurance carrier's existing pricing model, and identified 84\% of the
concession opportunities. This study demonstrates the application of a machine
learning system to compute an accurate and fair price for health insurance
products and analyzes how explainable machine learning models can exceed
actuarial models' predictive accuracy while maintaining interpretability.
Related papers
- Introducing the Large Medical Model: State of the art healthcare cost and risk prediction with transformers trained on patient event sequences [0.47901560316389713]
The Large Medical Model (LMM) is a generative pre-trained transformer (GPT) designed to guide and predict the broad facets of patient care and healthcare administration.
The model is trained on medical event sequences from over 140M longitudinal patient claims records with a specialized vocabulary built from medical terminology systems.
The LMM is able to improve both cost prediction by 14.1% over the best commercial models and chronic conditions prediction by 1.9% over the best transformer models in research predicting a broad set of conditions.
arXiv Detail & Related papers (2024-09-19T15:38:21Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Insuring Smiles: Predicting routine dental coverage using Spark ML [0.19285000127136376]
We leverage machine learning algorithms to predict if a health insurance plan covers routine dental services for adults.
Our goal is to provide a clinical strategy for individuals and families to select the most suitable insurance plan based on income and expenses.
arXiv Detail & Related papers (2023-10-13T16:31:51Z) - Ecosystem-level Analysis of Deployed Machine Learning Reveals Homogeneous Outcomes [72.13373216644021]
We study the societal impact of machine learning by considering the collection of models that are deployed in a given context.
We find deployed machine learning is prone to systemic failure, meaning some users are exclusively misclassified by all models available.
These examples demonstrate ecosystem-level analysis has unique strengths for characterizing the societal impact of machine learning.
arXiv Detail & Related papers (2023-07-12T01:11:52Z) - Diagnosis Uncertain Models For Medical Risk Prediction [80.07192791931533]
We consider a patient risk model which has access to vital signs, lab values, and prior history but does not have access to a patient's diagnosis.
We show that such all-cause' risk models have good generalization across diagnoses but have a predictable failure mode.
We propose a fix for this problem by explicitly modeling the uncertainty in risk prediction coming from uncertainty in patient diagnoses.
arXiv Detail & Related papers (2023-06-29T23:36:04Z) - Building predictive models of healthcare costs with open healthcare data [0.0]
We present an approach to developing a predictive model using machine-learning techniques.
We analyzed de-identified patient data from New York StateS, consisting of 2.3 million records in 2016.
We built models to predict costs from patient diagnoses and demographics.
arXiv Detail & Related papers (2023-04-05T02:12:58Z) - Safe AI for health and beyond -- Monitoring to transform a health
service [51.8524501805308]
We will assess the infrastructure required to monitor the outputs of a machine learning algorithm.
We will present two scenarios with examples of monitoring and updates of models.
arXiv Detail & Related papers (2023-03-02T17:27:45Z) - High-Throughput Approach to Modeling Healthcare Costs Using Electronic
Healthcare Records [5.354801701968199]
This study presents the results of a generalizable machine learning approach to predicting medical events built from 40 years of data from >860,000 patients pertaining to >6,700 prescription medications.
It was found that models built using this approach performed well when compared to similar studies predicting physician prescriptions of individual medications.
arXiv Detail & Related papers (2020-11-18T19:06:18Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Using massive health insurance claims data to predict very high-cost
claimants: a machine learning approach [43.861384583351835]
High-cost claimants (HiCCs) represent just 0.16% of the insured population but account for 9% of all healthcare costs.
We applied machine learning to train binary classification models to calculate the personal risk of HiCC.
Our results demonstrate that high-performing predictive models can be constructed using claims data and publicly available data alone.
arXiv Detail & Related papers (2019-12-30T18:01:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.