Cancer Subtype Identification through Integrating Inter and Intra
Dataset Relationships in Multi-Omics Data
- URL: http://arxiv.org/abs/2312.02195v1
- Date: Sat, 2 Dec 2023 12:11:47 GMT
- Title: Cancer Subtype Identification through Integrating Inter and Intra
Dataset Relationships in Multi-Omics Data
- Authors: Mark Peelen, Leila Bagheriye, and Johan Kwisthout
- Abstract summary: The integration of multi-omics data has emerged as a promising approach for gaining insights into complex diseases such as cancer.
This paper proposes a novel approach to identify cancer subtypes through the integration of multi-omics data for clustering.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The integration of multi-omics data has emerged as a promising approach for
gaining comprehensive insights into complex diseases such as cancer. This paper
proposes a novel approach to identify cancer subtypes through the integration
of multi-omics data for clustering. The proposed method, named LIDAF utilises
affinity matrices based on linear relationships between and within different
omics datasets (Linear Inter and Intra Dataset Affinity Fusion (LIDAF)).
Canonical Correlation Analysis is in this paper employed to create distance
matrices based on Euclidean distances between canonical variates. The distance
matrices are converted to affinity matrices and those are fused in a three-step
process. The proposed LIDAF addresses the limitations of the existing method
resulting in improvement of clustering performance as measured by the Adjusted
Rand Index and the Normalized Mutual Information score. Moreover, our proposed
LIDAF approach demonstrates a notable enhancement in 50% of the log10 rank
p-values obtained from Cox survival analysis, surpassing the performance of the
best reported method, highlighting its potential of identifying distinct cancer
subtypes.
Related papers
- Empirical Bayes Linked Matrix Decomposition [0.0]
We propose an empirical variational Bayesian approach to this problem.
We describe an associated iterative imputation approach that is novel for the single-matrix context.
We show that the method performs very well under different scenarios with respect to recovering underlying low-rank signal.
arXiv Detail & Related papers (2024-08-01T02:13:11Z) - Regularized Projection Matrix Approximation with Applications to Community Detection [1.3761665705201904]
This paper introduces a regularized projection matrix approximation framework designed to recover cluster information from the affinity matrix.
We investigate three distinct penalty functions, each specifically tailored to address bounded, positive, and sparse scenarios.
Numerical experiments conducted on both synthetic and real-world datasets reveal that our regularized projection matrix approximation approach significantly outperforms state-of-the-art methods in clustering performance.
arXiv Detail & Related papers (2024-05-26T15:18:22Z) - Synergistic eigenanalysis of covariance and Hessian matrices for enhanced binary classification [72.77513633290056]
We present a novel approach that combines the eigenanalysis of a covariance matrix evaluated on a training set with a Hessian matrix evaluated on a deep learning model.
Our method captures intricate patterns and relationships, enhancing classification performance.
arXiv Detail & Related papers (2024-02-14T16:10:42Z) - Weakly supervised covariance matrices alignment through Stiefel matrices
estimation for MEG applications [64.20396555814513]
This paper introduces a novel domain adaptation technique for time series data, called Mixing model Stiefel Adaptation (MSA)
We exploit abundant unlabeled data in the target domain to ensure effective prediction by establishing pairwise correspondence with equivalent signal variances between domains.
MSA outperforms recent methods in brain-age regression with task variations using magnetoencephalography (MEG) signals from the Cam-CAN dataset.
arXiv Detail & Related papers (2024-01-24T19:04:49Z) - Robust Detection of Lead-Lag Relationships in Lagged Multi-Factor Models [61.10851158749843]
Key insights can be obtained by discovering lead-lag relationships inherent in the data.
We develop a clustering-driven methodology for robust detection of lead-lag relationships in lagged multi-factor models.
arXiv Detail & Related papers (2023-05-11T10:30:35Z) - Orthogonalization of data via Gromov-Wasserstein type feedback for
clustering and visualization [5.44192123671277]
We propose an adaptive approach for clustering and visualization of data by an orthogonalization process.
We prove that the method converges globally to a unique fixpoint for certain parameter values.
We confirm that the method produces biologically meaningful clustering results consistent with human expert classification.
arXiv Detail & Related papers (2022-07-25T15:52:11Z) - Fusion of Physiological and Behavioural Signals on SPD Manifolds with
Application to Stress and Pain Detection [1.9844265130823329]
multimodal stress/pain recognition approaches generally extract features from different modalities independently.
This paper proposes a novel geometric framework for multimodal stress/pain detection utilizing Symmetric Positive Definite (SPD) matrices.
arXiv Detail & Related papers (2022-07-17T16:54:24Z) - Clustering Ensemble Meets Low-rank Tensor Approximation [50.21581880045667]
This paper explores the problem of clustering ensemble, which aims to combine multiple base clusterings to produce better performance than that of the individual one.
We propose a novel low-rank tensor approximation-based method to solve the problem from a global perspective.
Experimental results over 7 benchmark data sets show that the proposed model achieves a breakthrough in clustering performance, compared with 12 state-of-the-art methods.
arXiv Detail & Related papers (2020-12-16T13:01:37Z) - Asymptotic Analysis of an Ensemble of Randomly Projected Linear
Discriminants [94.46276668068327]
In [1], an ensemble of randomly projected linear discriminants is used to classify datasets.
We develop a consistent estimator of the misclassification probability as an alternative to the computationally-costly cross-validation estimator.
We also demonstrate the use of our estimator for tuning the projection dimension on both real and synthetic data.
arXiv Detail & Related papers (2020-04-17T12:47:04Z) - Saliency-based Weighted Multi-label Linear Discriminant Analysis [101.12909759844946]
We propose a new variant of Linear Discriminant Analysis (LDA) to solve multi-label classification tasks.
The proposed method is based on a probabilistic model for defining the weights of individual samples.
The Saliency-based weighted Multi-label LDA approach is shown to lead to performance improvements in various multi-label classification problems.
arXiv Detail & Related papers (2020-04-08T19:40:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.