ChemAlgebra: Algebraic Reasoning on Chemical Reactions
- URL: http://arxiv.org/abs/2210.02095v1
- Date: Wed, 5 Oct 2022 08:34:44 GMT
- Title: ChemAlgebra: Algebraic Reasoning on Chemical Reactions
- Authors: Andrea Valenti, Davide Bacciu, Antonio Vergari
- Abstract summary: It is unclear whether deep learning models have the ability to tackle reasoning tasks.
ChemAlgebra is a benchmark for measuring the reasoning capabilities of deep learning models.
- Score: 16.93639996082923
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: While showing impressive performance on various kinds of learning tasks, it
is yet unclear whether deep learning models have the ability to robustly tackle
reasoning tasks. than by learning the underlying reasoning process that is
actually required to solve the tasks. Measuring the robustness of reasoning in
machine learning models is challenging as one needs to provide a task that
cannot be easily shortcut by exploiting spurious statistical correlations in
the data, while operating on complex objects and constraints. reasoning task.
To address this issue, we propose ChemAlgebra, a benchmark for measuring the
reasoning capabilities of deep learning models through the prediction of
stoichiometrically-balanced chemical reactions. ChemAlgebra requires
manipulating sets of complex discrete objects -- molecules represented as
formulas or graphs -- under algebraic constraints such as the mass preservation
principle. We believe that ChemAlgebra can serve as a useful test bed for the
next generation of machine reasoning models and as a promoter of their
development.
Related papers
- QCBench: Evaluating Large Language Models on Domain-Specific Quantitative Chemistry [12.18966912295507]
QCBench is a benchmark comprising 350 computational chemistry problems across 7 chemistry subfields.<n>Each problem focuses on pure calculations rooted in real-world chemical vertical fields.<n> Evaluations on 19 LLMs demonstrate a consistent performance degradation with increasing task complexity.
arXiv Detail & Related papers (2025-08-03T08:55:42Z) - ChemGraph: An Agentic Framework for Computational Chemistry Workflows [0.0]
ChemGraph is an agentic framework powered by artificial intelligence and state-of-the-art simulation tools.<n>Users can perform tasks such as molecular structure generation, single-point energy, geometry optimization, vibrational analysis, and thermochemistry calculations.
arXiv Detail & Related papers (2025-06-03T21:11:56Z) - Beyond Chemical QA: Evaluating LLM's Chemical Reasoning with Modular Chemical Operations [43.623140005091535]
We introduce ChemCoTBench, a reasoning framework that bridges molecular structure understanding with arithmetic-inspired operations.<n>ChemCoTBench formalizes chemical problem-solving into transparent, step-by-step reasoning.<n>We evaluate models on two high-impact tasks: Molecular Property Optimization and Chemical Reaction Prediction.
arXiv Detail & Related papers (2025-05-27T15:15:44Z) - ChemKANs for Combustion Chemistry Modeling and Acceleration [0.0]
Machine learning techniques have been proposed to streamline chemical kinetic model inference.
ChemKAN can accurately represent hydrogen combustion chemistry, providing a 2x acceleration over the detailed chemistry in a solver.
These demonstrations indicate potential for ChemKANs in combustion physics and chemical kinetics.
arXiv Detail & Related papers (2025-04-17T01:53:28Z) - ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning [64.2106664137118]
ChemAgent is a novel framework designed to improve the performance of large language models (LLMs)
It is developed by decomposing chemical tasks into sub-tasks and compiling these sub-tasks into a structured collection that can be referenced for future queries.
When presented with a new problem, ChemAgent retrieves and refines pertinent information from the library, which we call memory.
arXiv Detail & Related papers (2025-01-11T17:10:30Z) - Constructing and explaining machine learning models for chemistry: example of the exploration and design of boron-based Lewis acids [0.0]
We leverage explainable AI techniques to explore the rational design of boron-based Lewis acids.
Lewis acids play a pivotal role in organic reactions due to their electron-ccepting properties.
By constraining the chemical space to well-defined molecular scaffolds, we achieved highly accurate predictions.
arXiv Detail & Related papers (2025-01-02T23:47:54Z) - A Gaussian Process Model for Ordinal Data with Applications to Chemoinformatics [0.0]
We present conditional Gaussian process models to predict ordinal outcomes from chemical experiments.
A novel aspect of our model is that the kernel contains a scaling parameter, that controls the strength of the correlation between elements of the chemical space.
We present a genetic algorithm for the facilitation of chemical discovery and identification of important features to the compound's efficacy.
arXiv Detail & Related papers (2024-05-16T11:18:32Z) - Chemist-X: Large Language Model-empowered Agent for Reaction Condition Recommendation in Chemical Synthesis [57.70772230913099]
Chemist-X automates the reaction condition recommendation (RCR) task in chemical synthesis with retrieval-augmented generation (RAG) technology.
Chemist-X interrogates online molecular databases and distills critical data from the latest literature database.
Chemist-X considerably reduces chemists' workload and allows them to focus on more fundamental and creative problems.
arXiv Detail & Related papers (2023-11-16T01:21:33Z) - Machine learning for accuracy in density functional approximations [0.0]
Recent progress in applying machine learning to improve the accuracy of density functional approximations is reviewed.
Promises and challenges in devising machine learning models transferable between different chemistries and materials classes are discussed.
arXiv Detail & Related papers (2023-11-01T00:02:09Z) - Discovering Interpretable Physical Models using Symbolic Regression and
Discrete Exterior Calculus [55.2480439325792]
We propose a framework that combines Symbolic Regression (SR) and Discrete Exterior Calculus (DEC) for the automated discovery of physical models.
DEC provides building blocks for the discrete analogue of field theories, which are beyond the state-of-the-art applications of SR to physical problems.
We prove the effectiveness of our methodology by re-discovering three models of Continuum Physics from synthetic experimental data.
arXiv Detail & Related papers (2023-10-10T13:23:05Z) - Generative Models as a Complex Systems Science: How can we make sense of
large language model behavior? [75.79305790453654]
Coaxing out desired behavior from pretrained models, while avoiding undesirable ones, has redefined NLP.
We argue for a systematic effort to decompose language model behavior into categories that explain cross-task performance.
arXiv Detail & Related papers (2023-07-31T22:58:41Z) - Learning Homogenization for Elliptic Operators [5.151892549395954]
Multiscale partial differential equations (PDEs) arise in various applications, and several schemes have been developed to solve them efficiently.
Homogenization theory is a powerful methodology that eliminates the small-scale dependence, resulting in simplified equations that are tractable.
This paper investigates the learnability of homogenized laws for elliptic operators in the presence of such complexities.
arXiv Detail & Related papers (2023-06-21T04:05:10Z) - Faith and Fate: Limits of Transformers on Compositionality [109.79516190693415]
We investigate the limits of transformer large language models across three representative compositional tasks.
These tasks require breaking problems down into sub-steps and synthesizing these steps into a precise answer.
Our empirical findings suggest that transformer LLMs solve compositional tasks by reducing multi-step compositional reasoning into linearized subgraph matching.
arXiv Detail & Related papers (2023-05-29T23:24:14Z) - Advancing Reacting Flow Simulations with Data-Driven Models [50.9598607067535]
Key to effective use of machine learning tools in multi-physics problems is to couple them to physical and computer models.
The present chapter reviews some of the open opportunities for the application of data-driven reduced-order modeling of combustion systems.
arXiv Detail & Related papers (2022-09-05T16:48:34Z) - Improving Molecular Representation Learning with Metric
Learning-enhanced Optimal Transport [49.237577649802034]
We develop a novel optimal transport-based algorithm termed MROT to enhance their generalization capability for molecular regression problems.
MROT significantly outperforms state-of-the-art models, showing promising potential in accelerating the discovery of new substances.
arXiv Detail & Related papers (2022-02-13T04:56:18Z) - Learning Transport Processes with Machine Intelligence [0.0]
We present a machine learning based approach to address the study of transport processes.
Our model is capable of learning latent representations of the transport process substantially closer to the ground truth than expected.
arXiv Detail & Related papers (2021-09-27T14:49:22Z) - Do Large Scale Molecular Language Representations Capture Important
Structural Information? [31.76876206167457]
We present molecular embeddings obtained by training an efficient transformer encoder model, referred to as MoLFormer.
Experiments show that the learned molecular representation performs competitively, when compared to graph-based and fingerprint-based supervised learning baselines.
arXiv Detail & Related papers (2021-06-17T14:33:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.