Racing Control Variable Genetic Programming for Symbolic Regression
- URL: http://arxiv.org/abs/2309.07934v1
- Date: Wed, 13 Sep 2023 21:38:06 GMT
- Title: Racing Control Variable Genetic Programming for Symbolic Regression
- Authors: Nan Jiang, Yexiang Xue
- Abstract summary: Symbolic regression is one of the most crucial tasks in AI for science.
We propose Racing Control Variable Genetic Programming (Racing-CVGP), which carries out multiple experiment schedules simultaneously.
A selection scheme similar to that used in selecting good symbolic equations in the genetic programming process is implemented to ensure that promising experiment schedules eventually win over the average ones.
- Score: 22.101494818629963
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Symbolic regression, as one of the most crucial tasks in AI for science,
discovers governing equations from experimental data. Popular approaches based
on genetic programming, Monte Carlo tree search, or deep reinforcement learning
learn symbolic regression from a fixed dataset. They require massive datasets
and long training time especially when learning complex equations involving
many variables. Recently, Control Variable Genetic Programming (CVGP) has been
introduced which accelerates the regression process by discovering equations
from designed control variable experiments. However, the set of experiments is
fixed a-priori in CVGP and we observe that sub-optimal selection of experiment
schedules delay the discovery process significantly. To overcome this
limitation, we propose Racing Control Variable Genetic Programming
(Racing-CVGP), which carries out multiple experiment schedules simultaneously.
A selection scheme similar to that used in selecting good symbolic equations in
the genetic programming process is implemented to ensure that promising
experiment schedules eventually win over the average ones. The unfavorable
schedules are terminated early to save time for the promising ones. We evaluate
Racing-CVGP on several synthetic and real-world datasets corresponding to true
physics laws. We demonstrate that Racing-CVGP outperforms CVGP and a series of
symbolic regressors which discover equations from fixed datasets.
Related papers
- Autonomous Vehicle Controllers From End-to-End Differentiable Simulation [60.05963742334746]
We propose a differentiable simulator and design an analytic policy gradients (APG) approach to training AV controllers.
Our proposed framework brings the differentiable simulator into an end-to-end training loop, where gradients of environment dynamics serve as a useful prior to help the agent learn a more grounded policy.
We find significant improvements in performance and robustness to noise in the dynamics, as well as overall more intuitive human-like handling.
arXiv Detail & Related papers (2024-09-12T11:50:06Z) - Deep Generative Symbolic Regression with Monte-Carlo-Tree-Search [29.392036559507755]
Symbolic regression is a problem of learning a symbolic expression from numerical data.
Deep neural models trained on procedurally-generated synthetic datasets showed competitive performance.
We propose a novel method which provides the best of both worlds, based on a Monte-Carlo Tree Search procedure.
arXiv Detail & Related papers (2023-02-22T09:10:20Z) - Fast variable selection makes scalable Gaussian process BSS-ANOVA a
speedy and accurate choice for tabular and time series regression [0.0]
Gaussian processes (GPs) are non-parametric regression engines with a long history.
One of a number of scalable GP approaches is the Karhunen-Lo'eve (KL) decomposed kernel BSS-ANOVA, developed in 2009.
A new method of forward variable selection, quickly and effectively limits the number of terms, yielding a method with competitive accuracies.
arXiv Detail & Related papers (2022-05-26T23:41:43Z) - Taylor Genetic Programming for Symbolic Regression [5.371028373792346]
Genetic programming (GP) is a commonly used approach to solve symbolic regression (SR) problems.
We propose Taylor genetic programming (TaylorGP) to approximate the symbolic equation that fits the dataset.
TaylorGP not only has higher accuracy than the nine baseline methods, but also is faster in finding stable results.
arXiv Detail & Related papers (2022-04-28T13:43:39Z) - Scaling Gaussian Process Optimization by Evaluating a Few Unique
Candidates Multiple Times [119.41129787351092]
We show that sequential black-box optimization based on GPs can be made efficient by sticking to a candidate solution for multiple evaluation steps.
We modify two well-established GP-Opt algorithms, GP-UCB and GP-EI to adapt rules from batched GP-Opt.
arXiv Detail & Related papers (2022-01-30T20:42:14Z) - Non-Gaussian Gaussian Processes for Few-Shot Regression [71.33730039795921]
We propose an invertible ODE-based mapping that operates on each component of the random variable vectors and shares the parameters across all of them.
NGGPs outperform the competing state-of-the-art approaches on a diversified set of benchmarks and applications.
arXiv Detail & Related papers (2021-10-26T10:45:25Z) - Incremental Ensemble Gaussian Processes [53.3291389385672]
We propose an incremental ensemble (IE-) GP framework, where an EGP meta-learner employs an it ensemble of GP learners, each having a unique kernel belonging to a prescribed kernel dictionary.
With each GP expert leveraging the random feature-based approximation to perform online prediction and model update with it scalability, the EGP meta-learner capitalizes on data-adaptive weights to synthesize the per-expert predictions.
The novel IE-GP is generalized to accommodate time-varying functions by modeling structured dynamics at the EGP meta-learner and within each GP learner.
arXiv Detail & Related papers (2021-10-13T15:11:25Z) - SignalGP-Lite: Event Driven Genetic Programming Library for Large-Scale
Artificial Life Applications [62.997667081978825]
Event-driven genetic programming representations have been shown to outperform traditional imperative representations on interaction-intensive problems.
Event-driven approach organizes genome content into modules that are triggered in response to environmental signals.
SignalGP library caters to traditional program synthesis applications.
arXiv Detail & Related papers (2021-08-01T07:20:49Z) - Deep Transformer Networks for Time Series Classification: The NPP Safety
Case [59.20947681019466]
An advanced temporal neural network referred to as the Transformer is used within a supervised learning fashion to model the time-dependent NPP simulation data.
The Transformer can learn the characteristics of the sequential data and yield promising performance with approximately 99% classification accuracy on the testing dataset.
arXiv Detail & Related papers (2021-04-09T14:26:25Z) - Zoetrope Genetic Programming for Regression [2.642406403099596]
The Zoetrope Genetic Programming (ZGP) algorithm is based on an original representation for mathematical expressions.
ZGP is validated using a large number of public domain regression datasets.
arXiv Detail & Related papers (2021-02-26T10:47:10Z) - Robust Gaussian Process Regression Based on Iterative Trimming [6.912744078749024]
This paper presents a new robust GP regression algorithm that iteratively trims the most extreme data points.
It can greatly improve the model accuracy for contaminated data even in the presence of extreme or abundant outliers.
As a practical example in the astrophysical study, we show that this method can precisely determine the main-sequence ridge line in the color-magnitude diagram of star clusters.
arXiv Detail & Related papers (2020-11-22T16:43:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.