Geomstats: A Python Package for Riemannian Geometry in Machine Learning
- URL: http://arxiv.org/abs/2004.04667v1
- Date: Tue, 7 Apr 2020 20:41:50 GMT
- Title: Geomstats: A Python Package for Riemannian Geometry in Machine Learning
- Authors: Nina Miolane, Alice Le Brigant, Johan Mathe, Benjamin Hou, Nicolas
Guigui, Yann Thanwerdas, Stefan Heyder, Olivier Peltre, Niklas Koep, Hadi
Zaatiti, Hatem Hajri, Yann Cabanes, Thomas Gerald, Paul Chauchat, Christian
Shewmake, Bernhard Kainz, Claire Donnat, Susan Holmes, Xavier Pennec
- Abstract summary: We introduce Geomstats, an open-source Python toolbox for computations and statistics on nonlinear equations.
We provide object-oriented and extensively unit-tested implementations.
We show that Geomstats provides reliable building blocks to foster research in differential geometry and statistics.
The source code is freely available under the MIT license at urlgeomstats.ai.
- Score: 5.449970675406181
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce Geomstats, an open-source Python toolbox for computations and
statistics on nonlinear manifolds, such as hyperbolic spaces, spaces of
symmetric positive definite matrices, Lie groups of transformations, and many
more. We provide object-oriented and extensively unit-tested implementations.
Among others, manifolds come equipped with families of Riemannian metrics, with
associated exponential and logarithmic maps, geodesics and parallel transport.
Statistics and learning algorithms provide methods for estimation, clustering
and dimension reduction on manifolds. All associated operations are vectorized
for batch computation and provide support for different execution backends,
namely NumPy, PyTorch and TensorFlow, enabling GPU acceleration. This paper
presents the package, compares it with related libraries and provides relevant
code examples. We show that Geomstats provides reliable building blocks to
foster research in differential geometry and statistics, and to democratize the
use of Riemannian geometry in machine learning applications. The source code is
freely available under the MIT license at \url{geomstats.ai}.
Related papers
- Statistical investigations into the geometry and homology of random programs [0.2302001830524133]
We show how the relation between random Python programs generated from chatGPT can be described geometrically and topologically.
We compare publicly available models: ChatGPT-4 and TinyLlama, on a simple problem related to image processing.
We speculate that our approach may in the future give new insights into the structure of programming languages.
arXiv Detail & Related papers (2024-07-05T20:25:02Z) - YASTN: Yet another symmetric tensor networks; A Python library for abelian symmetric tensor network calculations [0.0]
We present an open-source tensor network Python library for quantum many-body simulations.
At its core is an abelian-symmetric tensor, implemented as a sparse block structure managed by logical layer on top of dense multi-dimensional array backend.
We show the library performance in simulations with infinite projected entangled-pair states, such as finding the ground states with AD, or simulating thermal states of the Hubbard model via imaginary time evolution.
arXiv Detail & Related papers (2024-05-20T17:32:16Z) - Intrinsic Gaussian Process on Unknown Manifolds with Probabilistic
Metrics [5.582101184758529]
This article presents a novel approach to construct Intrinsic Gaussian Processes for regression on unknown manifold with probabilistic metrics in point clouds.
The geometry of manifold is in general different from the usual Euclidean geometry.
The applications of GPUM are illustrated in the simulation studies on the Swiss roll, high dimensional real datasets of WiFi signals and image data examples.
arXiv Detail & Related papers (2023-01-16T17:42:40Z) - Deep Learning Symmetries and Their Lie Groups, Algebras, and Subalgebras
from First Principles [55.41644538483948]
We design a deep-learning algorithm for the discovery and identification of the continuous group of symmetries present in a labeled dataset.
We use fully connected neural networks to model the transformations symmetry and the corresponding generators.
Our study also opens the door for using a machine learning approach in the mathematical study of Lie groups and their properties.
arXiv Detail & Related papers (2023-01-13T16:25:25Z) - Parametric information geometry with the package Geomstats [4.205692673448206]
We introduce the information geometry module of the Python package Geomstats.
The module gives the Fisher-Rao Riemannian geometry of any parametric family of distributions of interest.
Importantly, such capabilities open the door to statistics and machine learning on probability distributions.
arXiv Detail & Related papers (2022-11-21T16:56:45Z) - DADApy: Distance-based Analysis of DAta-manifolds in Python [51.37841707191944]
DADApy is a python software package for analysing and characterising high-dimensional data.
It provides methods for estimating the intrinsic dimension and the probability density, for performing density-based clustering and for comparing different distance metrics.
arXiv Detail & Related papers (2022-05-04T08:41:59Z) - Geometry-aware Bayesian Optimization in Robotics using Riemannian
Mat\'ern Kernels [64.62221198500467]
We show how to implement geometry-aware kernels for Bayesian optimization.
This technique can be used for control parameter tuning, parametric policy adaptation, and structure design in robotics.
arXiv Detail & Related papers (2021-11-02T09:47:22Z) - Scikit-dimension: a Python package for intrinsic dimension estimation [58.8599521537]
This technical note introduces textttscikit-dimension, an open-source Python package for intrinsic dimension estimation.
textttscikit-dimension package provides a uniform implementation of most of the known ID estimators based on scikit-learn application programming interface.
We briefly describe the package and demonstrate its use in a large-scale (more than 500 datasets) benchmarking of methods for ID estimation in real-life and synthetic data.
arXiv Detail & Related papers (2021-09-06T16:46:38Z) - Bayesian Quadrature on Riemannian Data Manifolds [79.71142807798284]
A principled way to model nonlinear geometric structure inherent in data is provided.
However, these operations are typically computationally demanding.
In particular, we focus on Bayesian quadrature (BQ) to numerically compute integrals over normal laws.
We show that by leveraging both prior knowledge and an active exploration scheme, BQ significantly reduces the number of required evaluations.
arXiv Detail & Related papers (2021-02-12T17:38:04Z) - Fewer is More: A Deep Graph Metric Learning Perspective Using Fewer
Proxies [65.92826041406802]
We propose a Proxy-based deep Graph Metric Learning approach from the perspective of graph classification.
Multiple global proxies are leveraged to collectively approximate the original data points for each class.
We design a novel reverse label propagation algorithm, by which the neighbor relationships are adjusted according to ground-truth labels.
arXiv Detail & Related papers (2020-10-26T14:52:42Z) - Tensor Relational Algebra for Machine Learning System Design [7.764107702934616]
We present an alternative implementation abstraction called the relational tensor algebra (TRA)
TRA is a set-based algebra based on the relational algebra.
Our empirical study shows that the optimized TRA-based back-end can significantly outperform alternatives for running ML in distributed clusters.
arXiv Detail & Related papers (2020-09-01T15:51:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.