Multi-Omic Data Integration and Feature Selection for Survival-based
Patient Stratification via Supervised Concrete Autoencoders
- URL: http://arxiv.org/abs/2206.10699v1
- Date: Tue, 21 Jun 2022 19:44:08 GMT
- Title: Multi-Omic Data Integration and Feature Selection for Survival-based
Patient Stratification via Supervised Concrete Autoencoders
- Authors: Pedro Henrique da Costa Avelar, Roman Laddach, Sophia Karagiannis, Min
Wu, Sophia Tsoka
- Abstract summary: We develop a Supervised Autoencoder model for survival-based multi-omic integration.
We report a Concrete Supervised Autoencoder model (CSAE) which uses feature selection to jointly reconstruct the input features as well as predict survival.
Our experiments show that our models outperform or are on par with some of the most commonly used baselines.
- Score: 4.846870511582627
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Cancer is a complex disease with significant social and economic impact.
Advancements in high-throughput molecular assays and the reduced cost for
performing high-quality multi-omics measurements have fuelled insights through
machine learning . Previous studies have shown promise on using multiple omic
layers to predict survival and stratify cancer patients. In this paper, we
developed a Supervised Autoencoder (SAE) model for survival-based multi-omic
integration which improves upon previous work, and report a Concrete Supervised
Autoencoder model (CSAE), which uses feature selection to jointly reconstruct
the input features as well as predict survival. Our experiments show that our
models outperform or are on par with some of the most commonly used baselines,
while either providing a better survival separation (SAE) or being more
interpretable (CSAE). We also perform a feature selection stability analysis on
our models and notice that there is a power-law relationship with features
which are commonly associated with survival. The code for this project is
available at: https://github.com/phcavelar/coxae
Related papers
- Enhanced Survival Prediction in Head and Neck Cancer Using Convolutional Block Attention and Multimodal Data Fusion [7.252280210331731]
This paper proposes a deep learning-based approach to predict survival outcomes in head and neck cancer patients.
Our method integrates feature extraction with a Convolutional Block Attention Module (CBAM) and a multi-modal data fusion layer.
The final prediction is achieved through a fully parametric discrete-time survival model.
arXiv Detail & Related papers (2024-10-29T07:56:04Z) - Interpretable Prediction and Feature Selection for Survival Analysis [18.987678432106563]
We present DyS (pronounced dice''), a new survival analysis model that achieves both strong discrimination and interpretability.
DyS is a feature-sparse Generalized Additive Model, combining feature selection and interpretable prediction into one model.
arXiv Detail & Related papers (2024-04-23T02:36:54Z) - Interpretable Survival Analysis for Heart Failure Risk Prediction [50.64739292687567]
We propose a novel survival analysis pipeline that is both interpretable and competitive with state-of-the-art survival models.
Our pipeline achieves state-of-the-art performance and provides interesting and novel insights about risk factors for heart failure.
arXiv Detail & Related papers (2023-10-24T02:56:05Z) - Pathology-and-genomics Multimodal Transformer for Survival Outcome
Prediction [43.1748594898772]
We propose a multimodal transformer (PathOmics) integrating pathology and genomics insights into colon-related cancer survival prediction.
We emphasize the unsupervised pretraining to capture the intrinsic interaction between tissue microenvironments in gigapixel whole slide images.
We evaluate our approach on both TCGA colon and rectum cancer cohorts, showing that the proposed approach is competitive and outperforms state-of-the-art studies.
arXiv Detail & Related papers (2023-07-22T00:59:26Z) - Stacked Autoencoder Based Multi-Omics Data Integration for Cancer
Survival Prediction [3.083561980077652]
We propose a novel method to integrate multi-omics data for cancer survival prediction, called Stacked AutoEncoder-based Survival Prediction Neural Network (SAEsurv-net)
SAEsurv-net addresses the curse of dimensionality with a two-stage dimensionality reduction strategy and handles multi-omics heterogeneity with a stacked computation autoencoder model.
The experiments show that SAEsurv-net outperforms models based on a single type of data as well as other state-of-the-art methods.
arXiv Detail & Related papers (2022-07-08T13:53:11Z) - Analysis of MRI Biomarkers for Brain Cancer Survival Prediction [0.3093890460224435]
Prediction of Overall Survival (OS) of brain cancer patients from multi-modal MRI is a challenging field of research.
Most of the existing literature on survival prediction is based on Radiomic features.
Age was found to be the most important biological predictor.
arXiv Detail & Related papers (2021-09-03T05:35:47Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Modeling Shared Responses in Neuroimaging Studies through MultiView ICA [94.31804763196116]
Group studies involving large cohorts of subjects are important to draw general conclusions about brain functional organization.
We propose a novel MultiView Independent Component Analysis model for group studies, where data from each subject are modeled as a linear combination of shared independent sources plus noise.
We demonstrate the usefulness of our approach first on fMRI data, where our model demonstrates improved sensitivity in identifying common sources among subjects.
arXiv Detail & Related papers (2020-06-11T17:29:53Z) - A General Framework for Survival Analysis and Multi-State Modelling [70.31153478610229]
We use neural ordinary differential equations as a flexible and general method for estimating multi-state survival models.
We show that our model exhibits state-of-the-art performance on popular survival data sets and demonstrate its efficacy in a multi-state setting.
arXiv Detail & Related papers (2020-06-08T19:24:54Z) - Self-Training with Improved Regularization for Sample-Efficient Chest
X-Ray Classification [80.00316465793702]
We present a deep learning framework that enables robust modeling in challenging scenarios.
Our results show that using 85% lesser labeled data, we can build predictive models that match the performance of classifiers trained in a large-scale data setting.
arXiv Detail & Related papers (2020-05-03T02:36:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.