Related papers: Modeling non-uniform uncertainty in Reaction Prediction via Boosting and Dropout

Modeling non-uniform uncertainty in Reaction Prediction via Boosting and Dropout

URL: http://arxiv.org/abs/2310.04674v1
Date: Sat, 7 Oct 2023 03:18:26 GMT
Title: Modeling non-uniform uncertainty in Reaction Prediction via Boosting and Dropout
Authors: Taicheng Guo, Changsheng Ma, Xiuying Chen, Bozhao Nan, Kehan Guo, Shichao Pei, Nitesh V. Chawla, Olaf Wiest, Xiangliang Zhang
Abstract summary: Variational Autoencoder(VAE) framework has typically been employed to tackle challenges in reaction prediction. We introduce randomness into product generation via boosting to ensemble diverse models and cover the range of potential outcomes. We design a ranking method to union the predictions from boosting and dropout, prioritizing the most plausible products.
Score: 44.5946975612778
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Reaction prediction has been recognized as a critical task in synthetic chemistry, where the goal is to predict the outcome of a reaction based on the given reactants. With the widespread adoption of generative models, the Variational Autoencoder(VAE) framework has typically been employed to tackle challenges in reaction prediction, where the reactants are encoded as a condition for the decoder, which then generates the product. Despite effectiveness, these conditional VAE (CVAE) models still fail to adequately account for the inherent uncertainty in reaction prediction, which primarily stems from the stochastic reaction process. The principal limitations are twofold. Firstly, in these CVAE models, the prior is independent of the reactants, leading to a default wide and assumed uniform distribution variance of the generated product. Secondly, reactants with analogous molecular representations are presumed to undergo similar electronic transition processes, thereby producing similar products. This hinders the ability to model diverse reaction mechanisms effectively. Since the variance in outcomes is inherently non-uniform, we are thus motivated to develop a framework that generates reaction products with non-uniform uncertainty. Firstly, we eliminate the latent variable in previous CVAE models to mitigate uncontrol-label noise. Instead, we introduce randomness into product generation via boosting to ensemble diverse models and cover the range of potential outcomes, and through dropout to secure models with minor variations. Additionally, we design a ranking method to union the predictions from boosting and dropout, prioritizing the most plausible products. Experimental results on the largest reaction prediction benchmark USPTO-MIT show the superior performance of our proposed method in modeling the non-uniform uncertainty compared to baselines.

Related papers

Predicting Chemical Reaction Outcomes Based on Electron Movements Using Machine Learning [5.191954242696695]
We present Reactron, the first electron-based machine learning model for general reaction prediction. We demonstrate the high predictive performance of Reactron over existing product-only models. With robust performance in both in- and out-of-distribution, Reactron embodies human-like reasoning in chemistry.
arXiv Detail & Related papers (2025-03-13T09:31:51Z)
Electron flow matching for generative reaction mechanism prediction obeying conservation laws [8.136277960071032]
This work recasts the problem of reaction prediction as a problem of electron redistribution using the modern deep generative framework of flow matching. Our model, FlowER, overcomes limitations by enforcing exact mass conservation, thereby resolving hallucinatory failure modes. FlowER additionally enables estimation of thermodynamic or kinetic feasibility and manifests a degree of chemical intuition in reaction prediction tasks.
arXiv Detail & Related papers (2025-02-18T16:01:17Z)
Challenging reaction prediction models to generalize to novel chemistry [12.33727805025678]
We report a series of evaluations of a prototypical SMILES-based deep learning model. First, we illustrate how performance on randomly sampled datasets is overly optimistic compared to performance when generalizing to new patents or new authors. Second, we conduct time splits that evaluate how models perform when tested on reactions published in years after those in their training set, mimicking real-world deployment.
arXiv Detail & Related papers (2025-01-11T23:49:14Z)
Learning Chemical Reaction Representation with Reactant-Product Alignment [50.28123475356234]
This paper introduces modelname, a novel chemical reaction representation learning model tailored for a variety of organic-reaction-related tasks. By integrating atomic correspondence between reactants and products, our model discerns the molecular transformations that occur during the reaction, thereby enhancing the comprehension of the reaction mechanism. We have designed an adapter structure to incorporate reaction conditions into the chemical reaction representation, allowing the model to handle diverse reaction conditions and adapt to various datasets and downstream tasks, e.g., reaction performance prediction.
arXiv Detail & Related papers (2024-11-26T17:41:44Z)
Beyond Major Product Prediction: Reproducing Reaction Mechanisms with Machine Learning Models Trained on a Large-Scale Mechanistic Dataset [10.968137261042715]
Mechanistic understanding of organic reactions can facilitate reaction development, impurity prediction, and in principle, reaction discovery. While several machine learning models have sought to address the task of predicting reaction products, their extension to predicting reaction mechanisms has been impeded by the lack of a corresponding mechanistic dataset. We construct such a dataset by imputing intermediates between experimentally reported reactants and products using expert reaction templates and train several machine learning models on the resulting dataset of 5,184,184 elementary steps.
arXiv Detail & Related papers (2024-03-07T15:26:23Z)
AI for Interpretable Chemistry: Predicting Radical Mechanistic Pathways via Contrastive Learning [45.379791270351184]
RMechRP is a new deep learning-based reaction predictor system. We develop and train models using RMechDB, a public database of radical reactions. Our results demonstrate the effectiveness of RMechRP in providing accurate and interpretable predictions.
arXiv Detail & Related papers (2023-11-02T09:47:27Z)
Doubly Stochastic Graph-based Non-autoregressive Reaction Prediction [59.41636061300571]
We propose a new framework called that combines two doubly self-attention mappings to obtain electron redistribution predictions. We show that our approach consistently improves the predictive performance of non-autoregressive models.
arXiv Detail & Related papers (2023-06-05T14:15:39Z)
MARS: A Motif-based Autoregressive Model for Retrosynthesis Prediction [54.75583184356392]
We propose a novel end-to-end graph generation model for retrosynthesis prediction. It sequentially identifies the reaction center, generates the synthons, and adds motifs to the synthons to generate reactants. Experiments on a benchmark dataset show that the proposed model significantly outperforms previous state-of-the-art algorithms.
arXiv Detail & Related papers (2022-09-27T06:29:35Z)
Non-Autoregressive Electron Redistribution Modeling for Reaction Prediction [26.007965383304864]
We devise a non-autoregressive learning paradigm that predicts reaction in one shot. We formulate a reaction as an arbitrary electron flow and predict it with a novel multi-pointer decoding network. Experiments on the USPTO-MIT dataset show that our approach has established a new state-of-the-art top-1 accuracy.
arXiv Detail & Related papers (2021-06-08T16:39:08Z)
Non-autoregressive electron flow generation for reaction prediction [15.98143959075733]
We devise a novel decoder that avoids such sequential generating and predicts the reaction in a Non-Autoregressive manner. Inspired by physical-chemistry insights, we represent edge edits in a molecule graph as electron flows, which can then be predicted in parallel. Our model achieves both an order of magnitude lower inference latency, with state-of-the-art top-1 accuracy and comparable performance on Top-K sampling.
arXiv Detail & Related papers (2020-12-16T10:01:26Z)
RetroXpert: Decompose Retrosynthesis Prediction like a Chemist [60.463900712314754]
We devise a novel template-free algorithm for automatic retrosynthetic expansion. Our method disassembles retrosynthesis into two steps. While outperforming the state-of-the-art baselines, our model also provides chemically reasonable interpretation.
arXiv Detail & Related papers (2020-11-04T04:35:34Z)

This list is automatically generated from the titles and abstracts of the papers in this site.