Online nonnegative CP-dictionary learning for Markovian data
- URL: http://arxiv.org/abs/2009.07612v4
- Date: Sun, 3 Apr 2022 00:07:43 GMT
- Title: Online nonnegative CP-dictionary learning for Markovian data
- Authors: Hanbaek Lyu and Christopher Strohmeier and Deanna Needell
- Abstract summary: We introduce a novel algorithm that learns a CANDECOMP/PARAFAC basis from a given stream of tensor-valued data under general constraints.
We prove that our algorithm converges almost surely to the set of stationary points of the objective function under the hypothesis that the sequence of data tensors is generated by an underlying Markov chain.
- Score: 8.490619842547739
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Online Tensor Factorization (OTF) is a fundamental tool in learning
low-dimensional interpretable features from streaming multi-modal data. While
various algorithmic and theoretical aspects of OTF have been investigated
recently, a general convergence guarantee to stationary points of the objective
function without any incoherence or sparsity assumptions is still lacking even
for the i.i.d. case. In this work, we introduce a novel algorithm that learns a
CANDECOMP/PARAFAC (CP) basis from a given stream of tensor-valued data under
general constraints, including nonnegativity constraints that induce
interpretability of the learned CP basis. We prove that our algorithm converges
almost surely to the set of stationary points of the objective function under
the hypothesis that the sequence of data tensors is generated by an underlying
Markov chain. Our setting covers the classical i.i.d. case as well as a wide
range of application contexts including data streams generated by independent
or MCMC sampling. Our result closes a gap between OTF and Online Matrix
Factorization in global convergence analysis \commHL{for CP-decompositions}.
Experimentally, we show that our algorithm converges much faster than standard
algorithms for nonnegative tensor factorization tasks on both synthetic and
real-world data. Also, we demonstrate the utility of our algorithm on a diverse
set of examples from image, video, and time-series data, illustrating how one
may learn qualitatively different CP-dictionaries from the same tensor data by
exploiting the tensor structure in multiple ways.
Related papers
- Multi-View Clustering via Semi-non-negative Tensor Factorization [120.87318230985653]
We develop a novel multi-view clustering based on semi-non-negative tensor factorization (Semi-NTF)
Our model directly considers the between-view relationship and exploits the between-view complementary information.
In addition, we provide an optimization algorithm for the proposed method and prove mathematically that the algorithm always converges to the stationary KKT point.
arXiv Detail & Related papers (2023-03-29T14:54:19Z) - Faster Adaptive Federated Learning [84.38913517122619]
Federated learning has attracted increasing attention with the emergence of distributed data.
In this paper, we propose an efficient adaptive algorithm (i.e., FAFED) based on momentum-based variance reduced technique in cross-silo FL.
arXiv Detail & Related papers (2022-12-02T05:07:50Z) - Fast and Provable Tensor Robust Principal Component Analysis via Scaled
Gradient Descent [30.299284742925852]
This paper tackles tensor robust principal component analysis (RPCA)
It aims to recover a low-rank tensor from its observations contaminated by sparse corruptions.
We show that the proposed algorithm achieves better and more scalable performance than state-of-the-art matrix and tensor RPCA algorithms.
arXiv Detail & Related papers (2022-06-18T04:01:32Z) - Truncated tensor Schatten p-norm based approach for spatiotemporal
traffic data imputation with complicated missing patterns [77.34726150561087]
We introduce four complicated missing patterns, including missing and three fiber-like missing cases according to the mode-drivenn fibers.
Despite nonity of the objective function in our model, we derive the optimal solutions by integrating alternating data-mputation method of multipliers.
arXiv Detail & Related papers (2022-05-19T08:37:56Z) - A Fast Parallel Tensor Decomposition with Optimal Stochastic Gradient
Descent: an Application in Structural Damage Identification [1.536989504296526]
We propose a novel algorithm, FP-CPD, to parallelize the CANDECOMP/PARAFAC (CP) decomposition of a tensor $mathcalX in mathbbR I_1 times dots times I_N $.
arXiv Detail & Related papers (2021-11-04T05:17:07Z) - Partial Counterfactual Identification from Observational and
Experimental Data [83.798237968683]
We develop effective Monte Carlo algorithms to approximate the optimal bounds from an arbitrary combination of observational and experimental data.
Our algorithms are validated extensively on synthetic and real-world datasets.
arXiv Detail & Related papers (2021-10-12T02:21:30Z) - Fractal Structure and Generalization Properties of Stochastic
Optimization Algorithms [71.62575565990502]
We prove that the generalization error of an optimization algorithm can be bounded on the complexity' of the fractal structure that underlies its generalization measure.
We further specialize our results to specific problems (e.g., linear/logistic regression, one hidden/layered neural networks) and algorithms.
arXiv Detail & Related papers (2021-06-09T08:05:36Z) - Provable Online CP/PARAFAC Decomposition of a Structured Tensor via
Dictionary Learning [18.464203215845373]
We consider the problem of factorizing a structured 3-mutation tensor into its constituent Polyadic (CP) factors.
We develop a provable algorithm for structured tensor factorization.
arXiv Detail & Related papers (2020-06-30T00:31:06Z) - Consistency of Anchor-based Spectral Clustering [0.0]
Anchor-based techniques reduce the computational complexity of spectral clustering algorithms.
We show that it is amenable to rigorous analysis, as well as being effective in practice.
We find that it is competitive with the state-of-the-art LSC method of Chen and Cai.
arXiv Detail & Related papers (2020-06-24T18:34:41Z) - Supervised Learning for Non-Sequential Data: A Canonical Polyadic
Decomposition Approach [85.12934750565971]
Efficient modelling of feature interactions underpins supervised learning for non-sequential tasks.
To alleviate this issue, it has been proposed to implicitly represent the model parameters as a tensor.
For enhanced expressiveness, we generalize the framework to allow feature mapping to arbitrarily high-dimensional feature vectors.
arXiv Detail & Related papers (2020-01-27T22:38:40Z) - A Unified Framework for Coupled Tensor Completion [42.19293115131073]
Coupled tensor decomposition reveals the joint data structure by incorporating priori knowledge that come from the latent coupled factors.
The TR has powerful expression ability and achieves success in some multi-dimensional data processing applications.
The proposed method is validated on numerical experiments on synthetic data, and experimental results on real-world data demonstrate its superiority over the state-of-the-art methods in terms of recovery accuracy.
arXiv Detail & Related papers (2020-01-09T02:15:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.