Towards Outcome-Driven Patient Subgroups: A Machine Learning Analysis
Across Six Depression Treatment Studies
- URL: http://arxiv.org/abs/2303.15202v2
- Date: Thu, 30 Mar 2023 16:44:45 GMT
- Title: Towards Outcome-Driven Patient Subgroups: A Machine Learning Analysis
Across Six Depression Treatment Studies
- Authors: David Benrimoh, Akiva Kleinerman, Toshi A. Furukawa, Charles F.
Reynolds III, Eric Lenze, Jordan Karp, Benoit Mulsant, Caitrin Armstrong,
Joseph Mehltretter, Robert Fratila, Kelly Perlman, Sonia Israel, Myriam
Tanguay-Sela, Christina Popescu, Grace Golden, Sabrina Qassim, Alexandra
Anacleto, Adam Kapelner, Ariel Rosenfeld, Gustavo Turecki
- Abstract summary: We analyzed data from six clinical trials of pharmacological treatment for depression using a neural network model.
A model classifying remission and outputting individual remission probabilities for five first-line monotherapies and three combination treatments was trained.
Post-hoc analyses yielded clusters (subgroups) based on patient prototypes learned during training.
- Score: 41.34047608276278
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Major depressive disorder (MDD) is a heterogeneous condition; multiple
underlying neurobiological substrates could be associated with treatment
response variability. Understanding the sources of this variability and
predicting outcomes has been elusive. Machine learning has shown promise in
predicting treatment response in MDD, but one limitation has been the lack of
clinical interpretability of machine learning models. We analyzed data from six
clinical trials of pharmacological treatment for depression (total n = 5438)
using the Differential Prototypes Neural Network (DPNN), a neural network model
that derives patient prototypes which can be used to derive treatment-relevant
patient clusters while learning to generate probabilities for differential
treatment response. A model classifying remission and outputting individual
remission probabilities for five first-line monotherapies and three combination
treatments was trained using clinical and demographic data. Model validity and
clinical utility were measured based on area under the curve (AUC) and expected
improvement in sample remission rate with model-guided treatment, respectively.
Post-hoc analyses yielded clusters (subgroups) based on patient prototypes
learned during training. Prototypes were evaluated for interpretability by
assessing differences in feature distributions and treatment-specific outcomes.
A 3-prototype model achieved an AUC of 0.66 and an expected absolute
improvement in population remission rate compared to the sample remission rate.
We identified three treatment-relevant patient clusters which were clinically
interpretable. It is possible to produce novel treatment-relevant patient
profiles using machine learning models; doing so may improve precision medicine
for depression. Note: This model is not currently the subject of any active
clinical trials and is not intended for clinical use.
Related papers
- TREEMENT: Interpretable Patient-Trial Matching via Personalized Dynamic
Tree-Based Memory Network [54.332862955411656]
Clinical trials are critical for drug development but often suffer from expensive and inefficient patient recruitment.
In recent years, machine learning models have been proposed for speeding up patient recruitment via automatically matching patients with clinical trials.
We introduce a dynamic tree-based memory network model named TREEMENT to provide accurate and interpretable patient trial matching.
arXiv Detail & Related papers (2023-07-19T12:35:09Z) - Enabling scalable clinical interpretation of ML-based phenotypes using
real world data [0.0]
This study investigates approaches to perform patient stratification analysis at scale using large EHR datasets.
We have developed several tools to facilitate the clinical evaluation and interpretation of unsupervised patient stratification results.
arXiv Detail & Related papers (2022-08-02T17:31:03Z) - Benchmarking Heterogeneous Treatment Effect Models through the Lens of
Interpretability [82.29775890542967]
Estimating personalized effects of treatments is a complex, yet pervasive problem.
Recent developments in the machine learning literature on heterogeneous treatment effect estimation gave rise to many sophisticated, but opaque, tools.
We use post-hoc feature importance methods to identify features that influence the model's predictions.
arXiv Detail & Related papers (2022-06-16T17:59:05Z) - DeepJoint: Robust Survival Modelling Under Clinical Presence Shift [2.9745607433320926]
We propose a recurrent neural network which models three clinical presence dimensions in parallel to the survival outcome.
On a prediction task, explicit modelling of these three processes showed improved performance in comparison to state-of-the-art predictive models.
arXiv Detail & Related papers (2022-05-26T16:42:38Z) - What Do You See in this Patient? Behavioral Testing of Clinical NLP
Models [69.09570726777817]
We introduce an extendable testing framework that evaluates the behavior of clinical outcome models regarding changes of the input.
We show that model behavior varies drastically even when fine-tuned on the same data and that allegedly best-performing models have not always learned the most medically plausible patterns.
arXiv Detail & Related papers (2021-11-30T15:52:04Z) - Longitudinal patient stratification of electronic health records with
flexible adjustment for clinical outcomes [0.7874708385247353]
We develop a recurrent neural network autoencoder to cluster EHR data using reconstruction, outcome, and clustering losses.
We demonstrate the model performance on $29,229$ diabetes patients, showing it finds clusters of patients with both different trajectories and different outcomes.
arXiv Detail & Related papers (2021-11-11T11:21:39Z) - Adversarial Sample Enhanced Domain Adaptation: A Case Study on
Predictive Modeling with Electronic Health Records [57.75125067744978]
We propose a data augmentation method to facilitate domain adaptation.
adversarially generated samples are used during domain adaptation.
Results confirm the effectiveness of our method and the generality on different tasks.
arXiv Detail & Related papers (2021-01-13T03:20:20Z) - Hemogram Data as a Tool for Decision-making in COVID-19 Management:
Applications to Resource Scarcity Scenarios [62.997667081978825]
COVID-19 pandemics has challenged emergency response systems worldwide, with widespread reports of essential services breakdown and collapse of health care structure.
This work describes a machine learning model derived from hemogram exam data performed in symptomatic patients.
Proposed models can predict COVID-19 qRT-PCR results in symptomatic individuals with high accuracy, sensitivity and specificity.
arXiv Detail & Related papers (2020-05-10T01:45:03Z) - Language Models Are An Effective Patient Representation Learning
Technique For Electronic Health Record Data [7.260199064831896]
We show that patient representation schemes inspired from techniques in natural language processing can increase the accuracy of clinical prediction models.
Such patient representation schemes enable a 3.5% mean improvement in AUROC on five prediction tasks compared to standard baselines.
arXiv Detail & Related papers (2020-01-06T22:24:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.