Sparse Generalized Canonical Correlation Analysis: Distributed
Alternating Iteration based Approach
- URL: http://arxiv.org/abs/2004.10981v1
- Date: Thu, 23 Apr 2020 05:53:48 GMT
- Title: Sparse Generalized Canonical Correlation Analysis: Distributed
Alternating Iteration based Approach
- Authors: Jia Cai, Kexin Lv, Junyi Huo, Xiaolin Huang, Jie Yang
- Abstract summary: Sparse canonical correlation analysis (CCA) is a useful statistical tool to detect latent information with sparse structures.
We propose a generalized canonical correlation analysis (GCCA), which could detect the latent relations of multiview data with sparse structures.
- Score: 18.93565942407577
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Sparse canonical correlation analysis (CCA) is a useful statistical tool to
detect latent information with sparse structures. However, sparse CCA works
only for two datasets, i.e., there are only two views or two distinct objects.
To overcome this limitation, in this paper, we propose a sparse generalized
canonical correlation analysis (GCCA), which could detect the latent relations
of multiview data with sparse structures. Moreover, the introduced sparsity
could be considered as Laplace prior on the canonical variates. Specifically,
we convert the GCCA into a linear system of equations and impose $\ell_1$
minimization penalty for sparsity pursuit. This results in a nonconvex problem
on Stiefel manifold, which is difficult to solve. Motivated by Boyd's consensus
problem, an algorithm based on distributed alternating iteration approach is
developed and theoretical consistency analysis is investigated elaborately
under mild conditions. Experiments on several synthetic and real world datasets
demonstrate the effectiveness of the proposed algorithm.
Related papers
- On the Stability of a non-hyperbolic nonlinear map with non-bounded set of non-isolated fixed points with applications to Machine Learning [31.263649000946014]
This paper deals with the convergence analysis of the SUCPA (Semi Unsupervised through Prior Adaptation) algorithm.
The convergence analysis is addressed as a dynamical system problem, by studying the local and global stability of the nonlinear map derived from the algorithm.
arXiv Detail & Related papers (2024-01-05T20:04:40Z) - SUnAA: Sparse Unmixing using Archetypal Analysis [62.997667081978825]
This paper introduces a new geological error map technique using archetypal sparse analysis (SUnAA)
First, we design a new model based on archetypal sparse analysis (SUnAA)
arXiv Detail & Related papers (2023-08-09T07:58:33Z) - Exploiting Structure for Optimal Multi-Agent Bayesian Decentralized
Estimation [4.320393382724066]
Key challenge in Bayesian decentralized data fusion is the rumor propagation' or double counting' phenomenon.
We show that by exploiting the probabilistic independence structure in multi-agent decentralized fusion problems a tighter bound can be found.
We then test our new non-monolithic CI algorithm on a large-scale target tracking simulation and show that it achieves a tighter bound and a more accurate estimate.
arXiv Detail & Related papers (2023-07-20T05:16:33Z) - Learning to Bound Counterfactual Inference in Structural Causal Models
from Observational and Randomised Data [64.96984404868411]
We derive a likelihood characterisation for the overall data that leads us to extend a previous EM-based algorithm.
The new algorithm learns to approximate the (unidentifiability) region of model parameters from such mixed data sources.
It delivers interval approximations to counterfactual results, which collapse to points in the identifiable case.
arXiv Detail & Related papers (2022-12-06T12:42:11Z) - Partial Counterfactual Identification from Observational and
Experimental Data [83.798237968683]
We develop effective Monte Carlo algorithms to approximate the optimal bounds from an arbitrary combination of observational and experimental data.
Our algorithms are validated extensively on synthetic and real-world datasets.
arXiv Detail & Related papers (2021-10-12T02:21:30Z) - Sparse GCA and Thresholded Gradient Descent [9.971356146653973]
Generalized correlation analysis (GCA) is concerned with uncovering linear relationships across datasets.
We study sparse GCA when there are potentially multiple generalized correlations in data.
We propose a thresholded descent algorithm for estimating GCA loading vectors and gradient matrices in high dimensions.
arXiv Detail & Related papers (2021-07-01T11:15:20Z) - Spatially relaxed inference on high-dimensional linear models [48.989769153211995]
We study the properties of ensembled clustered inference algorithms which combine spatially constrained clustering, statistical inference, and ensembling to aggregate several clustered inference solutions.
We show that ensembled clustered inference algorithms control the $delta$-FWER under standard assumptions for $delta$ equal to the largest cluster diameter.
arXiv Detail & Related papers (2021-06-04T16:37:19Z) - Sparse PCA via $l_{2,p}$-Norm Regularization for Unsupervised Feature
Selection [138.97647716793333]
We propose a simple and efficient unsupervised feature selection method, by combining reconstruction error with $l_2,p$-norm regularization.
We present an efficient optimization algorithm to solve the proposed unsupervised model, and analyse the convergence and computational complexity of the algorithm theoretically.
arXiv Detail & Related papers (2020-12-29T04:08:38Z) - $\ell_0$-based Sparse Canonical Correlation Analysis [7.073210405344709]
Canonical Correlation Analysis (CCA) models are powerful for studying the associations between two sets of variables.
Despite their success, CCA models may break if the number of variables in either of the modalities exceeds the number of samples.
Here, we propose $ell_0$-CCA, a method for learning correlated representations based on sparse subsets of two observed modalities.
arXiv Detail & Related papers (2020-10-12T11:44:15Z) - Computationally efficient sparse clustering [67.95910835079825]
We provide a finite sample analysis of a new clustering algorithm based on PCA.
We show that it achieves the minimax optimal misclustering rate in the regime $|theta infty$.
arXiv Detail & Related papers (2020-05-21T17:51:30Z) - Generalized Canonical Correlation Analysis: A Subspace Intersection
Approach [30.475159163815505]
Generalized Canonical Correlation Analysis (GCCA) is an important tool that finds numerous applications in data mining, machine learning, and artificial intelligence.
This paper offers a fresh algebraic perspective of GCCA based on a (bi-linear) generative model that naturally captures its essence.
A novel GCCA algorithm is proposed based on subspace intersection, which scales up to handle large GCCA tasks.
arXiv Detail & Related papers (2020-03-25T04:04:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.