Topological data analysis and machine learning
- URL: http://arxiv.org/abs/2206.15075v3
- Date: Tue, 25 Jul 2023 13:47:05 GMT
- Title: Topological data analysis and machine learning
- Authors: Daniel Leykam and Dimitris G. Angelakis
- Abstract summary: Topological data analysis refers to approaches for systematically and reliably computing abstract shapes'' of complex data sets.
There are various applications of topological data analysis in life and data sciences, with growing interest among physicists.
We present a concise yet (we hope) comprehensive review of applications of topological data analysis to physics and machine learning problems in physics.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Topological data analysis refers to approaches for systematically and
reliably computing abstract ``shapes'' of complex data sets. There are various
applications of topological data analysis in life and data sciences, with
growing interest among physicists. We present a concise yet (we hope)
comprehensive review of applications of topological data analysis to physics
and machine learning problems in physics including the detection of phase
transitions. We finish with a preview of anticipated directions for future
research.
Related papers
- Ontology Embedding: A Survey of Methods, Applications and Resources [54.3453925775069]
Ontologies are widely used for representing domain knowledge and meta data.
One straightforward solution is to integrate statistical analysis and machine learning.
Numerous papers have been published on embedding, but a lack of systematic reviews hinders researchers from gaining a comprehensive understanding of this field.
arXiv Detail & Related papers (2024-06-16T14:49:19Z) - Machine Learning and Data Analysis Using Posets: A Survey [0.0]
Posets are discrete mathematical structures ubiquitous in a broad range of data analysis and machine learning applications.
Research connecting posets to the data science domain has been ongoing for many years.
arXiv Detail & Related papers (2024-04-03T21:47:44Z) - Linguistics from a topological viewpoint [2.4238741865874363]
In this paper, we describe a workflow to analyze the topological shapes of South American languages.
As a result, it is difficult to have a clear visualization of the data.
arXiv Detail & Related papers (2024-03-16T23:10:42Z) - Discovering Interpretable Physical Models using Symbolic Regression and
Discrete Exterior Calculus [55.2480439325792]
We propose a framework that combines Symbolic Regression (SR) and Discrete Exterior Calculus (DEC) for the automated discovery of physical models.
DEC provides building blocks for the discrete analogue of field theories, which are beyond the state-of-the-art applications of SR to physical problems.
We prove the effectiveness of our methodology by re-discovering three models of Continuum Physics from synthetic experimental data.
arXiv Detail & Related papers (2023-10-10T13:23:05Z) - An analysis of Universal Differential Equations for data-driven
discovery of Ordinary Differential Equations [7.48176340790825]
We make a contribution by testing the UDE framework in the context of Ordinary Differential Equations (ODEs) discovery.
We highlight some of the issues arising when combining data-driven approaches and numerical solvers.
We believe that our analysis represents a significant contribution in investigating the capabilities and limitations of Physics-informed Machine Learning frameworks.
arXiv Detail & Related papers (2023-06-17T12:26:50Z) - Shining light on data: Geometric data analysis through quantum dynamics [14.315501760755604]
We show how fine-scale features of a structure in data can be extracted from approximations to quantum mechanical processes given by data-driven graph Laplacians and localized wavepackets.
This data-driven quantization procedure leads to a novel, yet natural uncertainty principle for data analysis induced by limited data.
We illustrate the new approach with algorithms and several applications to real-world data, including the learning of patterns and anomalies in social distancing and mobility behavior during the COVID-19 pandemic.
arXiv Detail & Related papers (2022-12-01T17:38:01Z) - Towards a mathematical understanding of learning from few examples with
nonlinear feature maps [68.8204255655161]
We consider the problem of data classification where the training set consists of just a few data points.
We reveal key relationships between the geometry of an AI model's feature space, the structure of the underlying data distributions, and the model's generalisation capabilities.
arXiv Detail & Related papers (2022-11-07T14:52:58Z) - On topological data analysis for structural dynamics: an introduction to
persistent homology [0.0]
Topological data analysis is a method of quantifying the shape of data over a range of length scales.
Persistent homology is a method of quantifying the shape of data over a range of length scales.
arXiv Detail & Related papers (2022-09-12T10:39:38Z) - Analytical Modelling of Exoplanet Transit Specroscopy with Dimensional
Analysis and Symbolic Regression [68.8204255655161]
The deep learning revolution has opened the door for deriving such analytical results directly with a computer algorithm fitting to the data.
We successfully demonstrate the use of symbolic regression on synthetic data for the transit radii of generic hot Jupiter exoplanets.
As a preprocessing step, we use dimensional analysis to identify the relevant dimensionless combinations of variables.
arXiv Detail & Related papers (2021-12-22T00:52:56Z) - Unsupervised machine learning of topological phase transitions from
experimental data [52.77024349608834]
We apply unsupervised machine learning techniques to experimental data from ultracold atoms.
We obtain the topological phase diagram of the Haldane model in a completely unbiased fashion.
Our work provides a benchmark for unsupervised detection of new exotic phases in complex many-body systems.
arXiv Detail & Related papers (2021-01-14T16:38:21Z) - Trajectories, bifurcations and pseudotime in large clinical datasets:
applications to myocardial infarction and diabetes data [94.37521840642141]
We suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values.
The methodology is based on application of elastic principal graphs which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection and quantifying the geodesic distances (pseudotime) in partially ordered sequences of observations.
arXiv Detail & Related papers (2020-07-07T21:04:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.