Random Persistence Diagram Generation
- URL: http://arxiv.org/abs/2104.07737v1
- Date: Thu, 15 Apr 2021 19:33:01 GMT
- Title: Random Persistence Diagram Generation
- Authors: Farzana Nasrin, Theodore Papamarkou, and Vasileios Maroulas
- Abstract summary: Topological data analysis (TDA) studies the shape patterns of data.
Persistent homology (PH) is a widely used method in TDA that summarizes homological features of data at multiple scales and stores this in persistence diagrams (PDs)
We propose random persistence diagram generation (RPDG), a method that generates a sequence of random PDs from the ones produced by the data.
- Score: 4.435094091999926
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Topological data analysis (TDA) studies the shape patterns of data.
Persistent homology (PH) is a widely used method in TDA that summarizes
homological features of data at multiple scales and stores this in persistence
diagrams (PDs). As TDA is commonly used in the analysis of high dimensional
data sets, a sufficiently large amount of PDs that allow performing statistical
analysis is typically unavailable or requires inordinate computational
resources. In this paper, we propose random persistence diagram generation
(RPDG), a method that generates a sequence of random PDs from the ones produced
by the data. RPDG is underpinned (i) by a parametric model based on pairwise
interacting point processes for inference of persistence diagrams and (ii) by a
reversible jump Markov chain Monte Carlo (RJ-MCMC) algorithm for generating
samples of PDs. The parametric model combines a Dirichlet partition to capture
spatial homogeneity of the location of points in a PD and a step function to
capture the pairwise interaction between them. The RJ-MCMC algorithm
incorporates trans-dimensional addition and removal of points and
same-dimensional relocation of points across samples of PDs. The efficacy of
RPDG is demonstrated via an example and a detailed comparison with other
existing methods is presented.
Related papers
- Hyperboloid GPLVM for Discovering Continuous Hierarchies via Nonparametric Estimation [41.13597666007784]
Dimensionality reduction (DR) offers a useful representation of complex high-dimensional data.
Recent DR methods focus on hyperbolic geometry to derive a faithful low-dimensional representation of hierarchical data.
This paper presents hGP-LVMs to embed high-dimensional hierarchical data with implicit continuity via nonparametric estimation.
arXiv Detail & Related papers (2024-10-22T05:07:30Z) - Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis [56.442307356162864]
We study the theoretical aspects of score-based discrete diffusion models under the Continuous Time Markov Chain (CTMC) framework.
We introduce a discrete-time sampling algorithm in the general state space $[S]d$ that utilizes score estimators at predefined time points.
Our convergence analysis employs a Girsanov-based method and establishes key properties of the discrete score function.
arXiv Detail & Related papers (2024-10-03T09:07:13Z) - Multi-Source and Test-Time Domain Adaptation on Multivariate Signals using Spatio-Temporal Monge Alignment [59.75420353684495]
Machine learning applications on signals such as computer vision or biomedical data often face challenges due to the variability that exists across hardware devices or session recordings.
In this work, we propose Spatio-Temporal Monge Alignment (STMA) to mitigate these variabilities.
We show that STMA leads to significant and consistent performance gains between datasets acquired with very different settings.
arXiv Detail & Related papers (2024-07-19T13:33:38Z) - On the Trajectory Regularity of ODE-based Diffusion Sampling [79.17334230868693]
Diffusion-based generative models use differential equations to establish a smooth connection between a complex data distribution and a tractable prior distribution.
In this paper, we identify several intriguing trajectory properties in the ODE-based sampling process of diffusion models.
arXiv Detail & Related papers (2024-05-18T15:59:41Z) - Estimating the Distribution of Parameters in Differential Equations with Repeated Cross-Sectional Data [5.79648227233365]
In economy, politics, and biology, observation data points in the time series are often independently obtained.
Traditional methods for parameter estimation in differential equations have limitations in estimating the shape of parameter distributions.
We introduce a novel method, Estimation of.
EPD, providing accurate distribution of parameters without loss of data information.
arXiv Detail & Related papers (2024-04-23T10:01:43Z) - Heterogeneous Multi-Task Gaussian Cox Processes [61.67344039414193]
We present a novel extension of multi-task Gaussian Cox processes for modeling heterogeneous correlated tasks jointly.
A MOGP prior over the parameters of the dedicated likelihoods for classification, regression and point process tasks can facilitate sharing of information between heterogeneous tasks.
We derive a mean-field approximation to realize closed-form iterative updates for estimating model parameters.
arXiv Detail & Related papers (2023-08-29T15:01:01Z) - Dynamic Mode Decomposition for data-driven analysis and reduced-order
modelling of ExB plasmas: I. Extraction of spatiotemporally coherent patterns [3.203036813451742]
We evaluate the generalability of the Dynamic Mode Decomposition (DMD) algorithm for data-driven analysis and reduced-order modelling of plasma dynamics.
arXiv Detail & Related papers (2023-08-26T01:37:52Z) - Multi-Linear Kernel Regression and Imputation in Data Manifolds [12.15802365851407]
This paper introduces an efficient multi-linear nonparametric approximation framework for data regression and imputation, and its application to dynamic magnetic-resonance imaging (dMRI)
Data features are assumed to reside in or close to a smooth manifold embedded in a kernel reproducing Hilbert space. Landmark points are identified to describe the point cloud of features by linear approximating patches which mimic the concept of tangent spaces to smooth.
The multi-linear model effects dimensionality reduction, enables efficient computations, and extracts data patterns and their geometry without any training data or additional information.
arXiv Detail & Related papers (2023-04-06T12:58:52Z) - Probabilistic Registration for Gaussian Process 3D shape modelling in
the presence of extensive missing data [63.8376359764052]
We propose a shape fitting/registration method based on a Gaussian Processes formulation, suitable for shapes with extensive regions of missing data.
Experiments are conducted both for a 2D small dataset with diverse transformations and a 3D dataset of ears.
arXiv Detail & Related papers (2022-03-26T16:48:27Z) - Improving Metric Dimensionality Reduction with Distributed Topology [68.8204255655161]
DIPOLE is a dimensionality-reduction post-processing step that corrects an initial embedding by minimizing a loss functional with both a local, metric term and a global, topological term.
We observe that DIPOLE outperforms popular methods like UMAP, t-SNE, and Isomap on a number of popular datasets.
arXiv Detail & Related papers (2021-06-14T17:19:44Z) - Estimation and Quantization of Expected Persistence Diagrams [0.0]
We study two such summaries, the Persistence Diagram (EPD) and its quantization.
EPD is a measure supported on R 2.
We prove that this estimator is optimal from a minimax standpoint on a large class of models with a parametric rate of convergence.
arXiv Detail & Related papers (2021-05-11T08:12:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.