Directionally Dependent Multi-View Clustering Using Copula Model
- URL: http://arxiv.org/abs/2003.07494v2
- Date: Sat, 22 Aug 2020 15:34:44 GMT
- Title: Directionally Dependent Multi-View Clustering Using Copula Model
- Authors: Kahkashan Afrin, Ashif S. Iquebal, Mostafa Karimi, Allyson Souris, Se
Yoon Lee, and Bani K. Mallick
- Abstract summary: In genomics studies, there is certain directional dependence between DNA expression, DNA methylation, and RNA expression.
Most of the existing multi-view clustering methods either assume an independent structure or pair-wise (non-directional) dependency.
We propose a copula-based multi-view clustering model where a copula enables the model to accommodate the directional dependence existing in the datasets.
- Score: 3.5097082077065
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent biomedical scientific problems, it is a fundamental issue to
integratively cluster a set of objects from multiple sources of datasets. Such
problems are mostly encountered in genomics, where data is collected from
various sources, and typically represent distinct yet complementary
information. Integrating these data sources for multi-source clustering is
challenging due to their complex dependence structure including directional
dependency. Particularly in genomics studies, it is known that there is certain
directional dependence between DNA expression, DNA methylation, and RNA
expression, widely called The Central Dogma.
Most of the existing multi-view clustering methods either assume an
independent structure or pair-wise (non-directional) dependency, thereby
ignoring the directional relationship. Motivated by this, we propose a
copula-based multi-view clustering model where a copula enables the model to
accommodate the directional dependence existing in the datasets. We conduct a
simulation experiment where the simulated datasets exhibiting inherent
directional dependence: it turns out that ignoring the directional dependence
negatively affects the clustering performance. As a real application, we
applied our model to the breast cancer tumor samples collected from The Cancer
Genome Altas (TCGA).
Related papers
- Causal Learning for Heterogeneous Subgroups Based on Nonlinear Causal Kernel Clustering [11.9672224014053]
The nonlinear Causal Kernel Clustering method is introduced for heterogeneous causal learning.
Experimental results indicate that the method performs well in identifying heterogeneous subgroups and enhancing causal learning.
arXiv Detail & Related papers (2025-01-20T17:43:17Z) - Learning Divergence Fields for Shift-Robust Graph Representations [73.11818515795761]
In this work, we propose a geometric diffusion model with learnable divergence fields for the challenging problem with interdependent data.
We derive a new learning objective through causal inference, which can guide the model to learn generalizable patterns of interdependence that are insensitive across domains.
arXiv Detail & Related papers (2024-06-07T14:29:21Z) - UnitedHuman: Harnessing Multi-Source Data for High-Resolution Human
Generation [59.77275587857252]
A holistic human dataset inevitably has insufficient and low-resolution information on local parts.
We propose to use multi-source datasets with various resolution images to jointly learn a high-resolution human generative model.
arXiv Detail & Related papers (2023-09-25T17:58:46Z) - Robust Detection of Lead-Lag Relationships in Lagged Multi-Factor Models [61.10851158749843]
Key insights can be obtained by discovering lead-lag relationships inherent in the data.
We develop a clustering-driven methodology for robust detection of lead-lag relationships in lagged multi-factor models.
arXiv Detail & Related papers (2023-05-11T10:30:35Z) - Linear Connectivity Reveals Generalization Strategies [54.947772002394736]
Some pairs of finetuned models have large barriers of increasing loss on the linear paths between them.
We find distinct clusters of models which are linearly connected on the test loss surface, but are disconnected from models outside the cluster.
Our work demonstrates how the geometry of the loss surface can guide models towards different functions.
arXiv Detail & Related papers (2022-05-24T23:43:02Z) - MoReL: Multi-omics Relational Learning [26.484803417186384]
We propose a novel deep Bayesian generative model to efficiently infer a multi-partite graph encoding molecular interactions across heterogeneous views.
With such an optimal transport regularization in the deep Bayesian generative model, it not only allows incorporating view-specific side information, but also increases the model flexibility with the distribution-based regularization.
arXiv Detail & Related papers (2022-03-15T02:50:07Z) - Variational Selective Autoencoder: Learning from Partially-Observed
Heterogeneous Data [45.23338389559936]
We propose the variational selective autoencoder (VSAE) to learn representations from partially-observed heterogeneous data.
VSAE learns the latent dependencies in heterogeneous data by modeling the joint distribution of observed data, unobserved data, and the imputation mask.
It results in a unified model for various downstream tasks including data generation and imputation.
arXiv Detail & Related papers (2021-02-25T04:39:13Z) - Modeling Shared Responses in Neuroimaging Studies through MultiView ICA [94.31804763196116]
Group studies involving large cohorts of subjects are important to draw general conclusions about brain functional organization.
We propose a novel MultiView Independent Component Analysis model for group studies, where data from each subject are modeled as a linear combination of shared independent sources plus noise.
We demonstrate the usefulness of our approach first on fMRI data, where our model demonstrates improved sensitivity in identifying common sources among subjects.
arXiv Detail & Related papers (2020-06-11T17:29:53Z) - Bayesian Sparse Factor Analysis with Kernelized Observations [67.60224656603823]
Multi-view problems can be faced with latent variable models.
High-dimensionality and non-linear issues are traditionally handled by kernel methods.
We propose merging both approaches into single model.
arXiv Detail & Related papers (2020-06-01T14:25:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.