DISCOVER: A Physics-Informed, GPU-Accelerated Symbolic Regression Framework
- URL: http://arxiv.org/abs/2602.06986v1
- Date: Tue, 27 Jan 2026 16:33:35 GMT
- Title: DISCOVER: A Physics-Informed, GPU-Accelerated Symbolic Regression Framework
- Authors: Udaykumar Gajera, Mohsen Sotoudeh, Kanchan Sarkar, Axel Groß,
- Abstract summary: Symbolic Regression (SR) enables the discovery of interpretable mathematical relationships from experimental and simulation data.<n>This paper introduces DISCOVER, an open-source symbolic regression package developed to address these challenges through a modular, physics-motivated design.<n>The software is intended for applications in computational physics, computational chemistry, and materials science, where interpretability, physical consistency, and execution time are important.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Symbolic Regression (SR) enables the discovery of interpretable mathematical relationships from experimental and simulation data. These relationships are often coined descriptors which are defined as a fundamental materials property that is directly correlated to a desired or undesired functional property of the material. Although established approaches such as Sure Independence Screening and Sparsifying Operator (SISSO) have successfully identified low-dimensional descriptors within large feature spaces many existing SR tools integrate poorly with modern Python workflows, offer limited control over the symbolic search space, or struggle with the computational demands of large-scale studies. This paper introduces DISCOVER (Data-Informed Symbolic Combination of Operators for Variable Equation Regression), an open-source symbolic regression package developed to address these challenges through a modular, physics-motivated design. DISCOVER allows users to guide the symbolic search using domain knowledge, constrain the feature space explicitly, and take advantage of optional GPU acceleration to improve computational efficiency in data-intensive workflows, enabling reproducible and scalable SR workflows. The software is intended for applications in computational physics, computational chemistry, and materials science, where interpretability, physical consistency, and execution time are especially important, and it complements general-purpose SR frameworks by emphasizing the discovery of physically meaningful models.
Related papers
- Introduction to Symbolic Regression in the Physical Sciences [0.41998444721319217]
Symbolic regression has emerged as a powerful method for uncovering interpretable mathematical relationships from data.<n>This article introduces the Special Issue on Symbolic Regression for the Physical Sciences, motivated by the Royal Society discussion meeting held in April 2025.
arXiv Detail & Related papers (2025-12-17T19:32:50Z) - ASP-Assisted Symbolic Regression: Uncovering Hidden Physics in Fluid Mechanics [0.34952465649465553]
This study applies Symbolic Regression to model a fundamental 3D incompressible flow in a rectangular channel.<n>By employing the PySR library, compact symbolic equations were derived directly from numerical simulation data.<n>We propose an innovative approach that integrates SR with the knowledge-representation framework of Answer Set Programming.
arXiv Detail & Related papers (2025-07-22T15:16:20Z) - DrSR: LLM based Scientific Equation Discovery with Dual Reasoning from Data and Experience [14.093206703519103]
DrSR is a framework that combines data-driven insight with reflective learning to enhance both robustness and discovery capability.<n> Experiments across interdisciplinary datasets in physics, chemistry, biology, and materials science demonstrate that DrSR substantially improves the valid equation rate.
arXiv Detail & Related papers (2025-06-04T04:52:34Z) - Interpretable Robotic Friction Learning via Symbolic Regression [52.41267112707149]
Accurately modeling the friction torque in robotic joints has long been challenging due to the request for a robust mathematical description.<n>Traditional model-based approaches are often labor-intensive, requiring extensive experiments and expert knowledge.<n>Data-driven methods based on neural networks are easier to implement but often lack robustness.
arXiv Detail & Related papers (2025-05-19T14:44:02Z) - RV-Syn: Rational and Verifiable Mathematical Reasoning Data Synthesis based on Structured Function Library [58.404895570822184]
RV-Syn is a novel mathematical Synthesis approach.<n>It generates graphs as solutions by combining Python-formatted functions from this library.<n>Based on the constructed graph, we achieve solution-guided logic-aware problem generation.
arXiv Detail & Related papers (2025-04-29T04:42:02Z) - GausSim: Foreseeing Reality by Gaussian Simulator for Elastic Objects [55.02281855589641]
GausSim is a novel neural network-based simulator designed to capture the dynamic behaviors of real-world elastic objects represented through Gaussian kernels.<n>We leverage continuum mechanics and treat each kernel as a Center of Mass System (CMS) that represents continuous piece of matter.<n>In addition, GausSim incorporates explicit physics constraints, such as mass and momentum conservation, ensuring interpretable results and robust, physically plausible simulations.
arXiv Detail & Related papers (2024-12-23T18:58:17Z) - TorchSISSO: A PyTorch-Based Implementation of the Sure Independence Screening and Sparsifying Operator for Efficient and Interpretable Model Discovery [0.0]
Symbolic regression (SR) is a powerful machine learning approach that searches for both the structure and parameters of algebraic models.<n>In this work, we introduce TorchSISSO, a native Python implementation built in the PyTorch framework.<n>We demonstrate that TorchSISSO matches or exceeds the performance of the original SISSO across a range of tasks.
arXiv Detail & Related papers (2024-10-02T17:02:17Z) - Discovering physical laws with parallel symbolic enumeration [67.36739393470869]
We introduce parallel symbolic enumeration (PSE) to efficiently distill generic mathematical expressions from limited data.<n>Experiments show that PSE achieves higher accuracy and faster computation compared to the state-of-the-art baseline algorithms.<n> PSE represents an advance in accurate and efficient data-driven discovery of symbolic, interpretable models.
arXiv Detail & Related papers (2024-07-05T10:41:15Z) - The Role of Foundation Models in Neuro-Symbolic Learning and Reasoning [54.56905063752427]
Neuro-Symbolic AI (NeSy) holds promise to ensure the safe deployment of AI systems.
Existing pipelines that train the neural and symbolic components sequentially require extensive labelling.
New architecture, NeSyGPT, fine-tunes a vision-language foundation model to extract symbolic features from raw data.
arXiv Detail & Related papers (2024-02-02T20:33:14Z) - Deep Generative Symbolic Regression [83.04219479605801]
Symbolic regression aims to discover concise closed-form mathematical equations from data.
Existing methods, ranging from search to reinforcement learning, fail to scale with the number of input variables.
We propose an instantiation of our framework, Deep Generative Symbolic Regression.
arXiv Detail & Related papers (2023-12-30T17:05:31Z) - Discovering Interpretable Physical Models using Symbolic Regression and
Discrete Exterior Calculus [55.2480439325792]
We propose a framework that combines Symbolic Regression (SR) and Discrete Exterior Calculus (DEC) for the automated discovery of physical models.
DEC provides building blocks for the discrete analogue of field theories, which are beyond the state-of-the-art applications of SR to physical problems.
We prove the effectiveness of our methodology by re-discovering three models of Continuum Physics from synthetic experimental data.
arXiv Detail & Related papers (2023-10-10T13:23:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.