Bayesian Simultaneous Factorization and Prediction Using Multi-Omic Data
- URL: http://arxiv.org/abs/2211.16403v2
- Date: Wed, 30 Nov 2022 01:46:33 GMT
- Title: Bayesian Simultaneous Factorization and Prediction Using Multi-Omic Data
- Authors: Sarah Samorodnitsky, Chris H. Wendt, Eric F. Lock
- Abstract summary: We propose a framework for inference on the estimated factorization, simultaneously predict important disease phenotypes or clinical outcomes, and accommodate multiple imputation.
We use BSFP to predict lung function based on the bronchoalveolar lavage metabolome and proteome from a study of HIV-associated OLD.
Our analysis reveals a distinct cluster of patients with OLD driven by shared metabolomic and proteomic expression patterns, as well as multi-omic patterns related to lung function decline.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Understanding of the pathophysiology of obstructive lung disease (OLD) is
limited by available methods to examine the relationship between multi-omic
molecular phenomena and clinical outcomes. Integrative factorization methods
for multi-omic data can reveal latent patterns of variation describing
important biological signal. However, most methods do not provide a framework
for inference on the estimated factorization, simultaneously predict important
disease phenotypes or clinical outcomes, nor accommodate multiple imputation.
To address these gaps, we propose Bayesian Simultaneous Factorization (BSF). We
use conjugate normal priors and show that the posterior mode of this model can
be estimated by solving a structured nuclear norm-penalized objective that also
achieves rank selection and motivates the choice of hyperparameters. We then
extend BSF to simultaneously predict a continuous or binary response, termed
Bayesian Simultaneous Factorization and Prediction (BSFP). BSF and BSFP
accommodate concurrent imputation and full posterior inference for missing
data, including "blockwise" missingness, and BSFP offers prediction of
unobserved outcomes. We show via simulation that BSFP is competitive in
recovering latent variation structure, as well as the importance of propagating
uncertainty from the estimated factorization to prediction. We also study the
imputation performance of BSF via simulation under missing-at-random and
missing-not-at-random assumptions. Lastly, we use BSFP to predict lung function
based on the bronchoalveolar lavage metabolome and proteome from a study of
HIV-associated OLD. Our analysis reveals a distinct cluster of patients with
OLD driven by shared metabolomic and proteomic expression patterns, as well as
multi-omic patterns related to lung function decline. Software is freely
available at https://github.com/sarahsamorodnitsky/BSFP .
Related papers
- Integrating Probabilistic Trees and Causal Networks for Clinical and Epidemiological Data [18.539194412343104]
This study introduces the Probabilistic Causal Fusion (PCF) framework.
PCF integrates Causal Bayesian Networks (CBNs) and Probability Trees (PTrees) to extend beyond predictions.
It was validated on three real-world healthcare datasets.
arXiv Detail & Related papers (2025-01-27T11:34:19Z) - HMF: A Hybrid Multi-Factor Framework for Dynamic Intraoperative Hypotension Prediction [2.7807763048110337]
Intraoperative hypotension (IOH) prediction using Mean Arterial Pressure (MAP) is a critical research area with significant implications for patient outcomes during surgery.
Existing approaches predominantly employ static modeling paradigms that overlook the dynamic nature of physiological signals.
We introduce a novel Hybrid Multi-Factor (HMF) framework that reformulates IOH prediction as a blood pressure forecasting task.
arXiv Detail & Related papers (2024-09-17T10:46:41Z) - Deep State-Space Generative Model For Correlated Time-to-Event Predictions [54.3637600983898]
We propose a deep latent state-space generative model to capture the interactions among different types of correlated clinical events.
Our method also uncovers meaningful insights about the latent correlations among mortality and different types of organ failures.
arXiv Detail & Related papers (2024-07-28T02:42:36Z) - FUSE: Fast Unified Simulation and Estimation for PDEs [11.991297011923004]
We argue that solving both problems within the same framework can lead to consistent gains in accuracy and robustness.
We present the capabilities of the proposed methodology for predicting continuous and discrete biomarkers in full-body haemodynamics simulations.
arXiv Detail & Related papers (2024-05-23T13:37:26Z) - Conditional Score-Based Diffusion Model for Cortical Thickness
Trajectory Prediction [29.415616701032604]
Alzheimer's Disease (AD) is a neurodegenerative condition characterized by diverse progression rates among individuals.
We propose a conditional score-based diffusion model to generate CTh trajectories with the given baseline information.
Our model has a near-zero bias with narrow confidential 95% interval compared to the ground-truth CTh in 6-36 months.
arXiv Detail & Related papers (2024-03-11T17:26:18Z) - Bayesian Networks for the robust and unbiased prediction of depression
and its symptoms utilizing speech and multimodal data [65.28160163774274]
We apply a Bayesian framework to capture the relationships between depression, depression symptoms, and features derived from speech, facial expression and cognitive game data collected at thymia.
arXiv Detail & Related papers (2022-11-09T14:48:13Z) - Survival Analysis for Idiopathic Pulmonary Fibrosis using CT Images and
Incomplete Clinical Data [17.162038700963418]
Idiopathic Pulmonary Fibrosis (IPF) is an inexorably progressive fibrotic lung disease with a variable and unpredictable rate of progression.
CT scans of the lungs inform clinical assessment of IPF patients and contain pertinent information related to disease progression.
We propose a multi-modal method that uses neural networks and memory banks to predict the survival of IPF patients using clinical and imaging data.
arXiv Detail & Related papers (2022-03-21T23:48:47Z) - Flexible Amortized Variational Inference in qBOLD MRI [56.4324135502282]
Oxygen extraction fraction (OEF) and deoxygenated blood volume (DBV) are more ambiguously determined from the data.
Existing inference methods tend to yield very noisy and underestimated OEF maps, while overestimating DBV.
This work describes a novel probabilistic machine learning approach that can infer plausible distributions of OEF and DBV.
arXiv Detail & Related papers (2022-03-11T10:47:16Z) - When in Doubt: Neural Non-Parametric Uncertainty Quantification for
Epidemic Forecasting [70.54920804222031]
Most existing forecasting models disregard uncertainty quantification, resulting in mis-calibrated predictions.
Recent works in deep neural models for uncertainty-aware time-series forecasting also have several limitations.
We model the forecasting task as a probabilistic generative process and propose a functional neural process model called EPIFNP.
arXiv Detail & Related papers (2021-06-07T18:31:47Z) - Bayesian prognostic covariate adjustment [59.75318183140857]
Historical data about disease outcomes can be integrated into the analysis of clinical trials in many ways.
We build on existing literature that uses prognostic scores from a predictive model to increase the efficiency of treatment effect estimates.
arXiv Detail & Related papers (2020-12-24T05:19:03Z) - STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological
Regularization [76.57716281104938]
We develop a tensor method to predict the evolution of epidemic trends for many regions simultaneously.
STELAR enables long-term prediction by incorporating latent temporal regularization through a system of discrete-time difference equations.
We conduct experiments using both county- and state-level COVID-19 data and show that our model can identify interesting latent patterns of the epidemic.
arXiv Detail & Related papers (2020-12-08T21:21:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.