Modeling non-uniform uncertainty in Reaction Prediction via Boosting and
Dropout
- URL: http://arxiv.org/abs/2310.04674v1
- Date: Sat, 7 Oct 2023 03:18:26 GMT
- Title: Modeling non-uniform uncertainty in Reaction Prediction via Boosting and
Dropout
- Authors: Taicheng Guo, Changsheng Ma, Xiuying Chen, Bozhao Nan, Kehan Guo,
Shichao Pei, Nitesh V. Chawla, Olaf Wiest, Xiangliang Zhang
- Abstract summary: Variational Autoencoder(VAE) framework has typically been employed to tackle challenges in reaction prediction.
We introduce randomness into product generation via boosting to ensemble diverse models and cover the range of potential outcomes.
We design a ranking method to union the predictions from boosting and dropout, prioritizing the most plausible products.
- Score: 44.5946975612778
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Reaction prediction has been recognized as a critical task in synthetic
chemistry, where the goal is to predict the outcome of a reaction based on the
given reactants. With the widespread adoption of generative models, the
Variational Autoencoder(VAE) framework has typically been employed to tackle
challenges in reaction prediction, where the reactants are encoded as a
condition for the decoder, which then generates the product. Despite
effectiveness, these conditional VAE (CVAE) models still fail to adequately
account for the inherent uncertainty in reaction prediction, which primarily
stems from the stochastic reaction process. The principal limitations are
twofold. Firstly, in these CVAE models, the prior is independent of the
reactants, leading to a default wide and assumed uniform distribution variance
of the generated product. Secondly, reactants with analogous molecular
representations are presumed to undergo similar electronic transition
processes, thereby producing similar products. This hinders the ability to
model diverse reaction mechanisms effectively. Since the variance in outcomes
is inherently non-uniform, we are thus motivated to develop a framework that
generates reaction products with non-uniform uncertainty. Firstly, we eliminate
the latent variable in previous CVAE models to mitigate uncontrol-label noise.
Instead, we introduce randomness into product generation via boosting to
ensemble diverse models and cover the range of potential outcomes, and through
dropout to secure models with minor variations. Additionally, we design a
ranking method to union the predictions from boosting and dropout, prioritizing
the most plausible products. Experimental results on the largest reaction
prediction benchmark USPTO-MIT show the superior performance of our proposed
method in modeling the non-uniform uncertainty compared to baselines.
Related papers
- Beyond Major Product Prediction: Reproducing Reaction Mechanisms with
Machine Learning Models Trained on a Large-Scale Mechanistic Dataset [10.968137261042715]
Mechanistic understanding of organic reactions can facilitate reaction development, impurity prediction, and in principle, reaction discovery.
While several machine learning models have sought to address the task of predicting reaction products, their extension to predicting reaction mechanisms has been impeded by the lack of a corresponding mechanistic dataset.
We construct such a dataset by imputing intermediates between experimentally reported reactants and products using expert reaction templates and train several machine learning models on the resulting dataset of 5,184,184 elementary steps.
arXiv Detail & Related papers (2024-03-07T15:26:23Z) - Retrosynthesis prediction enhanced by in-silico reaction data
augmentation [66.5643280109899]
We present RetroWISE, a framework that employs a base model inferred from real paired data to perform in-silico reaction generation and augmentation.
On three benchmark datasets, RetroWISE achieves the best overall performance against state-of-the-art models.
arXiv Detail & Related papers (2024-01-31T07:40:37Z) - Simultaneous inference for generalized linear models with unmeasured confounders [0.0]
We propose a unified statistical estimation and inference framework that harnesses structures and integrates linear projections into three key stages.
We show effective Type-I error control of $z$-tests as sample and response sizes approach infinity.
arXiv Detail & Related papers (2023-09-13T18:53:11Z) - Doubly Stochastic Graph-based Non-autoregressive Reaction Prediction [59.41636061300571]
We propose a new framework called that combines two doubly self-attention mappings to obtain electron redistribution predictions.
We show that our approach consistently improves the predictive performance of non-autoregressive models.
arXiv Detail & Related papers (2023-06-05T14:15:39Z) - MARS: A Motif-based Autoregressive Model for Retrosynthesis Prediction [54.75583184356392]
We propose a novel end-to-end graph generation model for retrosynthesis prediction.
It sequentially identifies the reaction center, generates the synthons, and adds motifs to the synthons to generate reactants.
Experiments on a benchmark dataset show that the proposed model significantly outperforms previous state-of-the-art algorithms.
arXiv Detail & Related papers (2022-09-27T06:29:35Z) - Loss function based second-order Jensen inequality and its application
to particle variational inference [112.58907653042317]
Particle variational inference (PVI) uses an ensemble of models as an empirical approximation for the posterior distribution.
PVI iteratively updates each model with a repulsion force to ensure the diversity of the optimized models.
We derive a novel generalization error bound and show that it can be reduced by enhancing the diversity of models.
arXiv Detail & Related papers (2021-06-09T12:13:51Z) - Non-Autoregressive Electron Redistribution Modeling for Reaction
Prediction [26.007965383304864]
We devise a non-autoregressive learning paradigm that predicts reaction in one shot.
We formulate a reaction as an arbitrary electron flow and predict it with a novel multi-pointer decoding network.
Experiments on the USPTO-MIT dataset show that our approach has established a new state-of-the-art top-1 accuracy.
arXiv Detail & Related papers (2021-06-08T16:39:08Z) - Non-autoregressive electron flow generation for reaction prediction [15.98143959075733]
We devise a novel decoder that avoids such sequential generating and predicts the reaction in a Non-Autoregressive manner.
Inspired by physical-chemistry insights, we represent edge edits in a molecule graph as electron flows, which can then be predicted in parallel.
Our model achieves both an order of magnitude lower inference latency, with state-of-the-art top-1 accuracy and comparable performance on Top-K sampling.
arXiv Detail & Related papers (2020-12-16T10:01:26Z) - Decision-Making with Auto-Encoding Variational Bayes [71.44735417472043]
We show that a posterior approximation distinct from the variational distribution should be used for making decisions.
Motivated by these theoretical results, we propose learning several approximate proposals for the best model.
In addition to toy examples, we present a full-fledged case study of single-cell RNA sequencing.
arXiv Detail & Related papers (2020-02-17T19:23:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.