Neural-Network-Directed Genetic Programmer for Discovery of Governing
Equations
- URL: http://arxiv.org/abs/2203.08808v1
- Date: Tue, 15 Mar 2022 21:28:05 GMT
- Title: Neural-Network-Directed Genetic Programmer for Discovery of Governing
Equations
- Authors: Shahab Razavi, Eric R. Gamazon
- Abstract summary: faiGP is designed to leverage the properties of a function algebra that have been encoded into a grammar.
We quantify the impact of different types of regularizers, including a diversity metric adapted from studies of the transcriptome.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We develop a symbolic regression framework for extracting the governing
mathematical expressions from observed data. The evolutionary approach, faiGP,
is designed to leverage the properties of a function algebra that have been
encoded into a grammar, providing a theoretical guarantee of universal
approximation and a way to minimize bloat. In this framework, the choice of
operators of the grammar may be informed by a physical theory or symmetry
considerations. Since there is currently no theory that can derive the
'constants of nature', an empirical investigation on extracting these
coefficients from an evolutionary process is of methodological interest. We
quantify the impact of different types of regularizers, including a diversity
metric adapted from studies of the transcriptome and a complexity measure, on
the performance of the framework. Our implementation, which leverages neural
networks and a genetic programmer, generates non-trivial symbolically
equivalent expressions ("Ramanujan expressions") or approximations with
potentially interesting numerical applications. To illustrate the framework, a
model of ligand-receptor binding kinetics, including an account of gene
regulation by transcription factors, and a model of the regulatory range of the
cistrome from omics data are presented. This study has important implications
on the development of data-driven methodologies for the discovery of governing
equations in experimental data derived from new sensing systems and
high-throughput screening technologies.
Related papers
- Causal Representation Learning from Multimodal Biological Observations [57.00712157758845]
We aim to develop flexible identification conditions for multimodal data.
We establish identifiability guarantees for each latent component, extending the subspace identification results from prior work.
Our key theoretical ingredient is the structural sparsity of the causal connections among distinct modalities.
arXiv Detail & Related papers (2024-11-10T16:40:27Z) - Network-based Neighborhood regression [0.0]
Current statistical analysis on biological modules focuses on either detecting the functional modules in biological networks or sub-group regression on the biological features without using the network data.
This paper proposes a novel network-based neighborhood regression framework whose regression functions depend on both the global community-level information and local connectivity structures among entities.
arXiv Detail & Related papers (2024-07-04T18:08:40Z) - Semantically Rich Local Dataset Generation for Explainable AI in Genomics [0.716879432974126]
Black box deep learning models trained on genomic sequences excel at predicting the outcomes of different gene regulatory mechanisms.
We propose using Genetic Programming to generate datasets by evolving perturbations in sequences that contribute to their semantic diversity.
arXiv Detail & Related papers (2024-07-03T10:31:30Z) - Nonlinear classification of neural manifolds with contextual information [6.292933471495322]
manifold capacity has emerged as a promising framework linking population geometry to the separability of neural manifold.
We propose a theoretical framework that overcomes this limitation by leveraging contextual input information.
Our framework's increased expressivity captures representation untanglement in deep networks at early stages of the layer hierarchy, previously inaccessible to analysis.
arXiv Detail & Related papers (2024-05-10T23:37:31Z) - Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - Discovering Interpretable Physical Models using Symbolic Regression and
Discrete Exterior Calculus [55.2480439325792]
We propose a framework that combines Symbolic Regression (SR) and Discrete Exterior Calculus (DEC) for the automated discovery of physical models.
DEC provides building blocks for the discrete analogue of field theories, which are beyond the state-of-the-art applications of SR to physical problems.
We prove the effectiveness of our methodology by re-discovering three models of Continuum Physics from synthetic experimental data.
arXiv Detail & Related papers (2023-10-10T13:23:05Z) - A Generative Modeling Framework for Inferring Families of Biomechanical
Constitutive Laws in Data-Sparse Regimes [0.15658704610960567]
We propose a novel approach to efficiently infer families of relationships in data-sparse regimes.
Inspired by the concept of functional priors, we develop a generative network (GAN) that incorporates a neural operator as the generator and a fully-connected network as the adversarial discriminator.
arXiv Detail & Related papers (2023-05-04T22:07:27Z) - Controllable Neural Symbolic Regression [10.128755371375572]
In symbolic regression, the goal is to find an analytical expression that fits experimental data with the minimal use of mathematical symbols.
We propose a novel neural symbolic regression method, named Neural Symbolic Regression with Hypothesis (NSRwH)
Our experiments demonstrate that the proposed conditioned deep learning model outperforms its unconditioned counterparts in terms of accuracy.
arXiv Detail & Related papers (2023-04-20T14:20:48Z) - Data-driven emergence of convolutional structure in neural networks [83.4920717252233]
We show how fully-connected neural networks solving a discrimination task can learn a convolutional structure directly from their inputs.
By carefully designing data models, we show that the emergence of this pattern is triggered by the non-Gaussian, higher-order local structure of the inputs.
arXiv Detail & Related papers (2022-02-01T17:11:13Z) - Fractal Structure and Generalization Properties of Stochastic
Optimization Algorithms [71.62575565990502]
We prove that the generalization error of an optimization algorithm can be bounded on the complexity' of the fractal structure that underlies its generalization measure.
We further specialize our results to specific problems (e.g., linear/logistic regression, one hidden/layered neural networks) and algorithms.
arXiv Detail & Related papers (2021-06-09T08:05:36Z) - Complexity-based speciation and genotype representation for
neuroevolution [81.21462458089142]
This paper introduces a speciation principle for neuroevolution where evolving networks are grouped into species based on the number of hidden neurons.
The proposed speciation principle is employed in several techniques designed to promote and preserve diversity within species and in the ecosystem as a whole.
arXiv Detail & Related papers (2020-10-11T06:26:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.