A Fair Experimental Comparison of Neural Network Architectures for
Latent Representations of Multi-Omics for Drug Response Prediction
- URL: http://arxiv.org/abs/2208.14822v1
- Date: Wed, 31 Aug 2022 12:46:08 GMT
- Title: A Fair Experimental Comparison of Neural Network Architectures for
Latent Representations of Multi-Omics for Drug Response Prediction
- Authors: Tony Hauptmann and Stefan Kramer
- Abstract summary: We train and optimize multi-omics integration methods under equal conditions.
We devised a novel method, Omics Stacking, that combines the advantages of intermediate and late integration.
Experiments were conducted on a public drug response data set with multiple omics data.
- Score: 7.690774882108066
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Recent years have seen a surge of novel neural network architectures for the
integration of multi-omics data for prediction. Most of the architectures
include either encoders alone or encoders and decoders, i.e., autoencoders of
various sorts, to transform multi-omics data into latent representations. One
important parameter is the depth of integration: the point at which the latent
representations are computed or merged, which can be either early,
intermediate, or late. The literature on integration methods is growing
steadily, however, close to nothing is known about the relative performance of
these methods under fair experimental conditions and under consideration of
different use cases. We developed a comparison framework that trains and
optimizes multi-omics integration methods under equal conditions. We
incorporated early integration and four recently published deep learning
methods: MOLI, Super.FELT, OmiEmbed, and MOMA. Further, we devised a novel
method, Omics Stacking, that combines the advantages of intermediate and late
integration. Experiments were conducted on a public drug response data set with
multiple omics data (somatic point mutations, somatic copy number profiles and
gene expression profiles) that was obtained from cell lines, patient-derived
xenografts, and patient samples. Our experiments confirmed that early
integration has the lowest predictive performance. Overall, architectures that
integrate triplet loss achieved the best results. Statistical differences can,
overall, rarely be observed, however, in terms of the average ranks of methods,
Super.FELT is consistently performing best in a cross-validation setting and
Omics Stacking best in an external test set setting. The source code of all
experiments is available under
\url{https://github.com/kramerlab/Multi-Omics_analysis}
Related papers
- Supervised Multi-Modal Fission Learning [19.396207029419813]
Learning from multimodal datasets can leverage complementary information and improve performance in prediction tasks.
We propose a Multi-Modal Fission Learning model that simultaneously identifies globally joint, partially joint, and individual components.
arXiv Detail & Related papers (2024-09-30T17:58:03Z) - Multi-modal Gaussian Process Variational Autoencoders for Neural and
Behavioral Data [0.9622208190558754]
We propose an unsupervised latent variable model which extracts temporally evolving shared and independent latents for distinct, simultaneously recorded experimental modalities.
We validate our model on simulated multi-modal data consisting of Poisson spike counts and MNIST images that scale and rotate smoothly over time.
We show that the multi-modal GP-VAE is able to not only identify the shared and independent latent structure across modalities accurately, but provides good reconstructions of both images and neural rates on held-out trials.
arXiv Detail & Related papers (2023-10-04T19:04:55Z) - The effect of data augmentation and 3D-CNN depth on Alzheimer's Disease
detection [51.697248252191265]
This work summarizes and strictly observes best practices regarding data handling, experimental design, and model evaluation.
We focus on Alzheimer's Disease (AD) detection, which serves as a paradigmatic example of challenging problem in healthcare.
Within this framework, we train predictive 15 models, considering three different data augmentation strategies and five distinct 3D CNN architectures.
arXiv Detail & Related papers (2023-09-13T10:40:41Z) - DCID: Deep Canonical Information Decomposition [84.59396326810085]
We consider the problem of identifying the signal shared between two one-dimensional target variables.
We propose ICM, an evaluation metric which can be used in the presence of ground-truth labels.
We also propose Deep Canonical Information Decomposition (DCID) - a simple, yet effective approach for learning the shared variables.
arXiv Detail & Related papers (2023-06-27T16:59:06Z) - Revisiting the Evaluation of Image Synthesis with GANs [55.72247435112475]
This study presents an empirical investigation into the evaluation of synthesis performance, with generative adversarial networks (GANs) as a representative of generative models.
In particular, we make in-depth analyses of various factors, including how to represent a data point in the representation space, how to calculate a fair distance using selected samples, and how many instances to use from each set.
arXiv Detail & Related papers (2023-04-04T17:54:32Z) - Learning with MISELBO: The Mixture Cookbook [62.75516608080322]
We present the first ever mixture of variational approximations for a normalizing flow-based hierarchical variational autoencoder (VAE) with VampPrior and a PixelCNN decoder network.
We explain this cooperative behavior by drawing a novel connection between VI and adaptive importance sampling.
We obtain state-of-the-art results among VAE architectures in terms of negative log-likelihood on the MNIST and FashionMNIST datasets.
arXiv Detail & Related papers (2022-09-30T15:01:35Z) - CustOmics: A versatile deep-learning based strategy for multi-omics
integration [0.0]
This paper presents a novel strategy to build a customizable autoencoder model that adapts to the dataset used in the case of high-dimensional multi-source integration.
We will assess the impact of integration strategies on the latent representation and combine the best strategies to propose a new method, CustOmics.
arXiv Detail & Related papers (2022-09-12T14:20:29Z) - Barcode Method for Generative Model Evaluation driven by Topological
Data Analysis [0.0]
In this study, we propose an algorithm named barcode, which is inspired by the topological data analysis.
In extensive experiments on real-world datasets as well as theoretical approach on high-dimensional normal samples, it was found that the 'usual' normality assumption of embedded vectors has several drawbacks.
arXiv Detail & Related papers (2021-06-04T02:07:07Z) - Brain Image Synthesis with Unsupervised Multivariate Canonical
CSC$\ell_4$Net [122.8907826672382]
We propose to learn dedicated features that cross both intre- and intra-modal variations using a novel CSC$ell_4$Net.
arXiv Detail & Related papers (2021-03-22T05:19:40Z) - Binary Classification of Gaussian Mixtures: Abundance of Support
Vectors, Benign Overfitting and Regularization [39.35822033674126]
We study binary linear classification under a generative Gaussian mixture model.
We derive novel non-asymptotic bounds on the classification error of the latter.
Our results extend to a noisy model with constant probability noise flips.
arXiv Detail & Related papers (2020-11-18T07:59:55Z) - MS-Net: Multi-Site Network for Improving Prostate Segmentation with
Heterogeneous MRI Data [75.73881040581767]
We propose a novel multi-site network (MS-Net) for improving prostate segmentation by learning robust representations.
Our MS-Net improves the performance across all datasets consistently, and outperforms state-of-the-art methods for multi-site learning.
arXiv Detail & Related papers (2020-02-09T14:11:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.