Data embedding and prediction by sparse tropical matrix factorization
- URL: http://arxiv.org/abs/2012.05210v1
- Date: Wed, 9 Dec 2020 18:09:17 GMT
- Title: Data embedding and prediction by sparse tropical matrix factorization
- Authors: Amra Omanovi\'c, Hilal Kazan, Polona Oblak and Toma\v{z} Curk
- Abstract summary: We propose a method called Sparse Tropical Matrix Factorization (STMF) for the estimation of missing (unknown) values.
Tests on unique synthetic data showed that STMF approximation achieves a higher correlation than non-negative matrix factorization.
STMF is the first work that uses tropical semiring on sparse data.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Matrix factorization methods are linear models, with limited capability to
model complex relations. In our work, we use tropical semiring to introduce
non-linearity into matrix factorization models. We propose a method called
Sparse Tropical Matrix Factorization (STMF) for the estimation of missing
(unknown) values. We evaluate the efficiency of the STMF method on both
synthetic data and biological data in the form of gene expression measurements
downloaded from The Cancer Genome Atlas (TCGA) database. Tests on unique
synthetic data showed that STMF approximation achieves a higher correlation
than non-negative matrix factorization (NMF), which is unable to recover
patterns effectively. On real data, STMF outperforms NMF on six out of nine
gene expression datasets. While NMF assumes normal distribution and tends
toward the mean value, STMF can better fit to extreme values and distributions.
STMF is the first work that uses tropical semiring on sparse data. We show that
in certain cases semirings are useful because they consider the structure,
which is different and simpler to understand than it is with standard linear
algebra.
Related papers
- Coseparable Nonnegative Tensor Factorization With T-CUR Decomposition [2.013220890731494]
Nonnegative Matrix Factorization (NMF) is an important unsupervised learning method to extract meaningful features from data.
In this work, we provide an alternating selection method to select the coseparable core.
The results demonstrate the efficiency of coseparable NTF when compared to coseparable NMF.
arXiv Detail & Related papers (2024-01-30T09:22:37Z) - Large-scale gradient-based training of Mixtures of Factor Analyzers [67.21722742907981]
This article contributes both a theoretical analysis as well as a new method for efficient high-dimensional training by gradient descent.
We prove that MFA training and inference/sampling can be performed based on precision matrices, which does not require matrix inversions after training is completed.
Besides the theoretical analysis and matrices, we apply MFA to typical image datasets such as SVHN and MNIST, and demonstrate the ability to perform sample generation and outlier detection.
arXiv Detail & Related papers (2023-08-26T06:12:33Z) - Robust Bayesian Nonnegative Matrix Factorization with Implicit
Regularizers [4.913248451323163]
We introduce a probabilistic model with implicit norm regularization for learning nonnegative matrix factorization (NMF)
We evaluate the model on several real-world datasets including Genomics of Drug Sensitivity in Cancer.
arXiv Detail & Related papers (2022-08-22T04:34:17Z) - FastSTMF: Efficient tropical matrix factorization algorithm for sparse
data [0.0]
Matrix factorization, one of the most popular methods in machine learning, has recently benefited from introducing non-linearity in prediction tasks using tropical semiring.
In our work, we propose a new method FastSTMF based on Sparse Tropical Matrix Factorization (STMF)
We evaluate FastSTMF on synthetic and real gene expression data from the TCGA database, and the results show that FastSTMF outperforms STMF in both accuracy and running time.
arXiv Detail & Related papers (2022-05-13T13:13:06Z) - Co-Separable Nonnegative Matrix Factorization [20.550794776914508]
Nonnegative matrix factorization (NMF) is a popular model in the field of pattern recognition.
We refer to this NMF as a Co-Separable NMF (CoS-NMF)
An optimization model for CoS-NMF is proposed and alternated fast gradient method is employed to solve the model.
arXiv Detail & Related papers (2021-09-02T07:05:04Z) - Feature Weighted Non-negative Matrix Factorization [92.45013716097753]
We propose the Feature weighted Non-negative Matrix Factorization (FNMF) in this paper.
FNMF learns the weights of features adaptively according to their importances.
It can be solved efficiently with the suggested optimization algorithm.
arXiv Detail & Related papers (2021-03-24T21:17:17Z) - Entropy Minimizing Matrix Factorization [102.26446204624885]
Nonnegative Matrix Factorization (NMF) is a widely-used data analysis technique, and has yielded impressive results in many real-world tasks.
In this study, an Entropy Minimizing Matrix Factorization framework (EMMF) is developed to tackle the above problem.
Considering that the outliers are usually much less than the normal samples, a new entropy loss function is established for matrix factorization.
arXiv Detail & Related papers (2021-03-24T21:08:43Z) - Self-supervised Symmetric Nonnegative Matrix Factorization [82.59905231819685]
Symmetric nonnegative factor matrix (SNMF) has demonstrated to be a powerful method for data clustering.
Inspired by ensemble clustering that aims to seek better clustering results, we propose self-supervised SNMF (S$3$NMF)
We take advantage of the sensitivity to code characteristic of SNMF, without relying on any additional information.
arXiv Detail & Related papers (2021-03-02T12:47:40Z) - Understanding Implicit Regularization in Over-Parameterized Single Index
Model [55.41685740015095]
We design regularization-free algorithms for the high-dimensional single index model.
We provide theoretical guarantees for the induced implicit regularization phenomenon.
arXiv Detail & Related papers (2020-07-16T13:27:47Z) - Slice Sampling for General Completely Random Measures [74.24975039689893]
We present a novel Markov chain Monte Carlo algorithm for posterior inference that adaptively sets the truncation level using auxiliary slice variables.
The efficacy of the proposed algorithm is evaluated on several popular nonparametric models.
arXiv Detail & Related papers (2020-06-24T17:53:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.