A cost-based multi-layer network approach for the discovery of patient
phenotypes
- URL: http://arxiv.org/abs/2209.09032v2
- Date: Tue, 20 Sep 2022 08:17:34 GMT
- Title: A cost-based multi-layer network approach for the discovery of patient
phenotypes
- Authors: Clara Puga, Uli Niemann, Winfried Schlee, Myra Spiliopoulou
- Abstract summary: We propose a cost-based layer selector model for detecting phenotypes using a community detection approach.
Our goal is to minimize the number of features used to build these phenotypes while preserving its quality.
For some post-treatment variables, predictors using phenotypes from COBALT as features outperformed those using phenotypes detected by traditional clustering methods.
- Score: 2.816539638885011
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Clinical records frequently include assessments of the characteristics of
patients, which may include the completion of various questionnaires. These
questionnaires provide a variety of perspectives on a patient's current state
of well-being. Not only is it critical to capture the heterogeneity given by
these perspectives, but there is also a growing demand for developing
cost-effective technologies for clinical phenotyping. Filling out many
questionnaires may be a strain for the patients and therefore costly. In this
work, we propose COBALT -- a cost-based layer selector model for detecting
phenotypes using a community detection approach. Our goal is to minimize the
number of features used to build these phenotypes while preserving its quality.
We test our model using questionnaire data from chronic tinnitus patients and
represent the data in a multi-layer network structure. The model is then
evaluated by predicting post-treatment data using baseline features (age,
gender, and pre-treatment data) as well as the identified phenotypes as a
feature. For some post-treatment variables, predictors using phenotypes from
COBALT as features outperformed those using phenotypes detected by traditional
clustering methods. Moreover, using phenotype data to predict post-treatment
data proved beneficial in comparison with predictors that were solely trained
with baseline features.
Related papers
- Clustering of Disease Trajectories with Explainable Machine Learning: A Case Study on Postoperative Delirium Phenotypes [13.135589459700865]
We propose an approach that combines supervised machine learning for personalized POD risk prediction with unsupervised clustering techniques to uncover potential POD phenotypes.
We show that clustering patients in the SHAP feature importance space successfully recovers the true underlying phenotypes, outperforming clustering in the raw feature space.
arXiv Detail & Related papers (2024-05-06T10:05:46Z) - Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - T-Phenotype: Discovering Phenotypes of Predictive Temporal Patterns in
Disease Progression [82.85825388788567]
We develop a novel temporal clustering method, T-Phenotype, to discover phenotypes of predictive temporal patterns from labeled time-series data.
We show that T-Phenotype achieves the best phenotype discovery performance over all the evaluated baselines.
arXiv Detail & Related papers (2023-02-24T13:30:35Z) - Unsupervised EHR-based Phenotyping via Matrix and Tensor Decompositions [0.6875312133832078]
We provide a comprehensive review of low-rank approximation-based approaches for computational phenotyping.
Recent developments have adapted low-rank data approximation methods by incorporating different constraints and regularizations that facilitate interpretability further.
arXiv Detail & Related papers (2022-09-01T09:47:27Z) - Cancer Subtyping by Improved Transcriptomic Features Using Vector
Quantized Variational Autoencoder [10.835673227875615]
We propose Vector Quantized Variational AutoEncoder (VQ-VAE) to tackle the data issues and extract informative latent features that are crucial to the quality of subsequent clustering.
VQ-VAE does not impose strict assumptions and hence its latent features are better representations of the input, capable of yielding superior clustering performance with any mainstream clustering method.
arXiv Detail & Related papers (2022-07-20T09:47:53Z) - Benchmarking Heterogeneous Treatment Effect Models through the Lens of
Interpretability [82.29775890542967]
Estimating personalized effects of treatments is a complex, yet pervasive problem.
Recent developments in the machine learning literature on heterogeneous treatment effect estimation gave rise to many sophisticated, but opaque, tools.
We use post-hoc feature importance methods to identify features that influence the model's predictions.
arXiv Detail & Related papers (2022-06-16T17:59:05Z) - What Do You See in this Patient? Behavioral Testing of Clinical NLP
Models [69.09570726777817]
We introduce an extendable testing framework that evaluates the behavior of clinical outcome models regarding changes of the input.
We show that model behavior varies drastically even when fine-tuned on the same data and that allegedly best-performing models have not always learned the most medically plausible patterns.
arXiv Detail & Related papers (2021-11-30T15:52:04Z) - Hybrid deep learning methods for phenotype prediction from clinical
notes [4.866431869728018]
This paper proposes a novel hybrid model for automatically extracting patient phenotypes using natural language processing and deep learning models.
The proposed hybrid model is based on a neural bidirectional sequence model (BiLSTM or BiGRU) and a Convolutional Neural Network (CNN) for identifying patient's phenotypes in discharge reports.
arXiv Detail & Related papers (2021-08-16T05:57:28Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Temporal Phenotyping using Deep Predictive Clustering of Disease
Progression [97.88605060346455]
We develop a deep learning approach for clustering time-series data, where each cluster comprises patients who share similar future outcomes of interest.
Experiments on two real-world datasets show that our model achieves superior clustering performance over state-of-the-art benchmarks.
arXiv Detail & Related papers (2020-06-15T20:48:43Z) - Towards Patient Record Summarization Through Joint Phenotype Learning in
HIV Patients [1.598617270887469]
We propose an unsupervised phenotyping approach that jointly learns a large number of phenotypes/problems across structured and unstructured data.
We ground our experiments in phenotyping patients from an HIV clinic in a large urban care institution.
We find that the learned phenotypes and their relatedness are clinically valid when assessed by clinical experts.
arXiv Detail & Related papers (2020-03-09T15:41:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.