Robust bilinear factor analysis based on the matrix-variate $t$
distribution
- URL: http://arxiv.org/abs/2401.02203v1
- Date: Thu, 4 Jan 2024 11:15:44 GMT
- Title: Robust bilinear factor analysis based on the matrix-variate $t$
distribution
- Authors: Xuan Ma, Jianhua Zhao, Changchun Shang, Fen Jiang, Philip L.H. Yu
- Abstract summary: Factor Analysis based on $t$ distribution ($t$fa) is useful for extracting common factors on heavy-tailed or contaminated data.
This paper proposes a novel robust factor analysis model, namely bilinear factor analysis built on $t$ distribution ($t$bfa)
It is capable of simultaneously extracting common factors for both row and column variables of interest on heavy-tailed or contaminated matrix data.
- Score: 2.6530267536011913
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Factor Analysis based on multivariate $t$ distribution ($t$fa) is a useful
robust tool for extracting common factors on heavy-tailed or contaminated data.
However, $t$fa is only applicable to vector data. When $t$fa is applied to
matrix data, it is common to first vectorize the matrix observations. This
introduces two challenges for $t$fa: (i) the inherent matrix structure of the
data is broken, and (ii) robustness may be lost, as vectorized matrix data
typically results in a high data dimension, which could easily lead to the
breakdown of $t$fa. To address these issues, starting from the intrinsic matrix
structure of matrix data, a novel robust factor analysis model, namely bilinear
factor analysis built on the matrix-variate $t$ distribution ($t$bfa), is
proposed in this paper. The novelty is that it is capable to simultaneously
extract common factors for both row and column variables of interest on
heavy-tailed or contaminated matrix data. Two efficient algorithms for maximum
likelihood estimation of $t$bfa are developed. Closed-form expression for the
Fisher information matrix to calculate the accuracy of parameter estimates are
derived. Empirical studies are conducted to understand the proposed $t$bfa
model and compare with related competitors. The results demonstrate the
superiority and practicality of $t$bfa. Importantly, $t$bfa exhibits a
significantly higher breakdown point than $t$fa, making it more suitable for
matrix data.
Related papers
- Large-scale gradient-based training of Mixtures of Factor Analyzers [67.21722742907981]
This article contributes both a theoretical analysis as well as a new method for efficient high-dimensional training by gradient descent.
We prove that MFA training and inference/sampling can be performed based on precision matrices, which does not require matrix inversions after training is completed.
Besides the theoretical analysis and matrices, we apply MFA to typical image datasets such as SVHN and MNIST, and demonstrate the ability to perform sample generation and outlier detection.
arXiv Detail & Related papers (2023-08-26T06:12:33Z) - One-sided Matrix Completion from Two Observations Per Row [95.87811229292056]
We propose a natural algorithm that involves imputing the missing values of the matrix $XTX$.
We evaluate our algorithm on one-sided recovery of synthetic data and low-coverage genome sequencing.
arXiv Detail & Related papers (2023-06-06T22:35:16Z) - Learning Graphical Factor Models with Riemannian Optimization [70.13748170371889]
This paper proposes a flexible algorithmic framework for graph learning under low-rank structural constraints.
The problem is expressed as penalized maximum likelihood estimation of an elliptical distribution.
We leverage geometries of positive definite matrices and positive semi-definite matrices of fixed rank that are well suited to elliptical models.
arXiv Detail & Related papers (2022-10-21T13:19:45Z) - Unitary Approximate Message Passing for Matrix Factorization [90.84906091118084]
We consider matrix factorization (MF) with certain constraints, which finds wide applications in various areas.
We develop a Bayesian approach to MF with an efficient message passing implementation, called UAMPMF.
We show that UAMPMF significantly outperforms state-of-the-art algorithms in terms of recovery accuracy, robustness and computational complexity.
arXiv Detail & Related papers (2022-07-31T12:09:32Z) - On the well-spread property and its relation to linear regression [4.619541348328937]
We show that consistent recovery of the parameter vector in a robust linear regression model is information-theoretically impossible.
We show that it is possible to efficiently certify whether a given $n$-by-$d$ matrix is well-spread if the number of observations is quadratic in the ambient dimension.
arXiv Detail & Related papers (2022-06-16T11:17:44Z) - Robust factored principal component analysis for matrix-valued outlier
accommodation and detection [4.228971753938522]
Factored PCA (FPCA) is a probabilistic extension of PCA for matrix data.
We propose a robust extension of FPCA (RFPCA) for matrix data.
RFPCA can adaptively down-weight outliers and yield robust estimates.
arXiv Detail & Related papers (2021-12-13T16:12:22Z) - Classification of high-dimensional data with spiked covariance matrix
structure [0.2741266294612775]
We study the classification problem for high-dimensional data with $n$ observations on $p$ features.
We propose an adaptive classifier that first performs dimension reduction on the feature vectors prior to classification in the dimensionally reduced space.
We show that the resulting classifier is Bayes optimal whenever $n rightarrow infty$ and $s sqrtn-1 ln p rightarrow 0$.
arXiv Detail & Related papers (2021-10-05T11:26:53Z) - Sparse PCA via $l_{2,p}$-Norm Regularization for Unsupervised Feature
Selection [138.97647716793333]
We propose a simple and efficient unsupervised feature selection method, by combining reconstruction error with $l_2,p$-norm regularization.
We present an efficient optimization algorithm to solve the proposed unsupervised model, and analyse the convergence and computational complexity of the algorithm theoretically.
arXiv Detail & Related papers (2020-12-29T04:08:38Z) - Compressed sensing of low-rank plus sparse matrices [3.8073142980733]
This manuscript develops similar guarantees showing that $mtimes n$ that can be expressed as the sum of a rank-rparse matrix and a $s-sparse matrix can be recovered by computationally tractable methods.
Results are shown for synthetic problems, dynamic-foreground/static separation, multispectral imaging, and Robust PCA.
arXiv Detail & Related papers (2020-07-18T15:36:11Z) - Information-Theoretic Limits for the Matrix Tensor Product [8.206394018475708]
This paper studies a high-dimensional inference problem involving the matrix tensor product of random matrices.
On the technical side, this paper introduces some new techniques for the analysis of high-dimensional matrix-preserving signals.
arXiv Detail & Related papers (2020-05-22T17:03:48Z) - Supervised Quantile Normalization for Low-rank Matrix Approximation [50.445371939523305]
We learn the parameters of quantile normalization operators that can operate row-wise on the values of $X$ and/or of its factorization $UV$ to improve the quality of the low-rank representation of $X$ itself.
We demonstrate the applicability of these techniques on synthetic and genomics datasets.
arXiv Detail & Related papers (2020-02-08T21:06:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.