Predicting Treatment Adherence of Tuberculosis Patients at Scale
- URL: http://arxiv.org/abs/2211.02943v1
- Date: Sat, 5 Nov 2022 17:00:21 GMT
- Title: Predicting Treatment Adherence of Tuberculosis Patients at Scale
- Authors: Mihir Kulkarni, Satvik Golechha, Rishi Raj, Jithin Sreedharan, Ankit
Bhardwaj, Santanu Rathod, Bhavin Vadera, Jayakrishna Kurada, Sanjay Mattoo,
Rajendra Joshi, Kirankumar Rade, Alpan Raval
- Abstract summary: Non-adherence to TB medication is a significant cause of mortality and morbidity.
We formulate and solve the machine learning problem of early prediction of non-adherence based on a custom rank-based metric.
Our findings indicate that risk stratification of non-adherent patients is a viable, deployable-at-scale ML solution.
- Score: 0.6873562466909032
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Tuberculosis (TB), an infectious bacterial disease, is a significant cause of
death, especially in low-income countries, with an estimated ten million new
cases reported globally in $2020$. While TB is treatable, non-adherence to the
medication regimen is a significant cause of morbidity and mortality. Thus,
proactively identifying patients at risk of dropping off their medication
regimen enables corrective measures to mitigate adverse outcomes. Using a proxy
measure of extreme non-adherence and a dataset of nearly $700,000$ patients
from four states in India, we formulate and solve the machine learning (ML)
problem of early prediction of non-adherence based on a custom rank-based
metric. We train ML models and evaluate against baselines, achieving a $\sim
100\%$ lift over rule-based baselines and $\sim 214\%$ over a random
classifier, taking into account country-wide large-scale future deployment. We
deal with various issues in the process, including data quality,
high-cardinality categorical data, low target prevalence, distribution shift,
variation across cohorts, algorithmic fairness, and the need for robustness and
explainability. Our findings indicate that risk stratification of non-adherent
patients is a viable, deployable-at-scale ML solution.
Related papers
- Considerations for Distribution Shift Robustness of Diagnostic Models in Healthcare [10.393967785465536]
In the domain of applied ML for health, it is common to predict $Y$ from $X$ without considering further information about the patient.
In this work, we highlight a data generating mechanism common to healthcare settings and discuss how recent theoretical results from the causality literature can be applied to build robust predictive models.
arXiv Detail & Related papers (2024-10-25T14:13:09Z) - Introducing the Large Medical Model: State of the art healthcare cost and risk prediction with transformers trained on patient event sequences [0.47901560316389713]
The Large Medical Model (LMM) is a generative pre-trained transformer (GPT) designed to guide and predict the broad facets of patient care and healthcare administration.
The model is trained on medical event sequences from over 140M longitudinal patient claims records with a specialized vocabulary built from medical terminology systems.
The LMM is able to improve both cost prediction by 14.1% over the best commercial models and chronic conditions prediction by 1.9% over the best transformer models in research predicting a broad set of conditions.
arXiv Detail & Related papers (2024-09-19T15:38:21Z) - SepsisLab: Early Sepsis Prediction with Uncertainty Quantification and Active Sensing [67.8991481023825]
Sepsis is the leading cause of in-hospital mortality in the USA.
Existing predictive models are usually trained on high-quality data with few missing information.
For the potential high-risk patients with low confidence due to limited observations, we propose a robust active sensing algorithm.
arXiv Detail & Related papers (2024-07-24T04:47:36Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Advances in Prediction of Readmission Rates Using Long Term Short Term
Memory Networks on Healthcare Insurance Data [1.454498931674109]
30-day hospital readmission is a long standing medical problem that affects patients' morbidity and mortality and costs billions of dollars annually.
We developed a bi-directional Long Short Term Memory (LSTM) Network that is able to use readily available insurance data.
Our results demonstrate that a machine learning model is able to predict risk of inpatient readmission with reasonable accuracy for all patients.
arXiv Detail & Related papers (2022-06-30T19:07:10Z) - Clinical Outcome Prediction from Admission Notes using Self-Supervised
Knowledge Integration [55.88616573143478]
Outcome prediction from clinical text can prevent doctors from overlooking possible risks.
Diagnoses at discharge, procedures performed, in-hospital mortality and length-of-stay prediction are four common outcome prediction targets.
We propose clinical outcome pre-training to integrate knowledge about patient outcomes from multiple public sources.
arXiv Detail & Related papers (2021-02-08T10:26:44Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - From predictions to prescriptions: A data-driven response to COVID-19 [42.57407485467993]
We propose a comprehensive data-driven approach to understand the clinical characteristics of COVID-19.
We build personalized calculators to predict the risk of infection and mortality.
We propose an optimization model to re-allocate ventilators and alleviate shortages.
arXiv Detail & Related papers (2020-06-30T03:34:00Z) - Hemogram Data as a Tool for Decision-making in COVID-19 Management:
Applications to Resource Scarcity Scenarios [62.997667081978825]
COVID-19 pandemics has challenged emergency response systems worldwide, with widespread reports of essential services breakdown and collapse of health care structure.
This work describes a machine learning model derived from hemogram exam data performed in symptomatic patients.
Proposed models can predict COVID-19 qRT-PCR results in symptomatic individuals with high accuracy, sensitivity and specificity.
arXiv Detail & Related papers (2020-05-10T01:45:03Z) - Predictive Modeling of ICU Healthcare-Associated Infections from
Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling
Approach [55.41644538483948]
This work is focused on both the identification of risk factors and the prediction of healthcare-associated infections in intensive-care units.
The aim is to support decision making addressed at reducing the incidence rate of infections.
arXiv Detail & Related papers (2020-05-07T16:13:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.