Beyond the Typical: Modeling Rare Plausible Patterns in Chemical Reactions by Leveraging Sequential Mixture-of-Experts
- URL: http://arxiv.org/abs/2310.04674v2
- Date: Tue, 20 Aug 2024 18:52:56 GMT
- Title: Beyond the Typical: Modeling Rare Plausible Patterns in Chemical Reactions by Leveraging Sequential Mixture-of-Experts
- Authors: Taicheng Guo, Changsheng Ma, Xiuying Chen, Bozhao Nan, Kehan Guo, Shichao Pei, Nitesh V. Chawla, Olaf Wiest, Xiangliang Zhang,
- Abstract summary: Generative models like Transformer and VAE have typically been employed to predict the reaction product.
We propose organizing the mapping space between reactants and electron redistribution patterns in a divide-and-conquer manner.
- Score: 42.9784548283531
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Reaction prediction, a critical task in synthetic chemistry, is to predict the outcome of a reaction based on given reactants. Generative models like Transformer and VAE have typically been employed to predict the reaction product. However, these likelihood-maximization models overlooked the inherent stochastic nature of chemical reactions, such as the multiple ways electrons can be redistributed among atoms during the reaction process. In scenarios where similar reactants could follow different electron redistribution patterns, these models typically predict the most common outcomes, neglecting less frequent but potentially crucial reaction patterns. These overlooked patterns, though rare, can lead to innovative methods for designing synthetic routes and significantly advance synthesis techniques. To break the limits of previous approaches, we propose organizing the mapping space between reactants and electron redistribution patterns in a divide-and-conquer manner. We address the reaction problem by training multiple expert models, each specializing in capturing a type of electron redistribution pattern in reaction. These experts enhance the prediction process by considering both typical and other less common electron redistribution manners. In the inference stage, a dropout strategy is applied to each expert to improve the electron redistribution diversity. The most plausible products are finally predicted through a ranking stage designed to integrate the predictions from multiple experts. Experimental results on the largest reaction prediction benchmark USPTO-MIT show the superior performance of our proposed method compared to baselines.
Related papers
- Learning Chemical Reaction Representation with Reactant-Product Alignment [50.28123475356234]
This paper introduces modelname, a novel chemical reaction representation learning model tailored for a variety of organic-reaction-related tasks.
By integrating atomic correspondence between reactants and products, our model discerns the molecular transformations that occur during the reaction, thereby enhancing the comprehension of the reaction mechanism.
We have designed an adapter structure to incorporate reaction conditions into the chemical reaction representation, allowing the model to handle diverse reaction conditions and adapt to various datasets and downstream tasks, e.g., reaction performance prediction.
arXiv Detail & Related papers (2024-11-26T17:41:44Z) - Beyond Major Product Prediction: Reproducing Reaction Mechanisms with
Machine Learning Models Trained on a Large-Scale Mechanistic Dataset [10.968137261042715]
Mechanistic understanding of organic reactions can facilitate reaction development, impurity prediction, and in principle, reaction discovery.
While several machine learning models have sought to address the task of predicting reaction products, their extension to predicting reaction mechanisms has been impeded by the lack of a corresponding mechanistic dataset.
We construct such a dataset by imputing intermediates between experimentally reported reactants and products using expert reaction templates and train several machine learning models on the resulting dataset of 5,184,184 elementary steps.
arXiv Detail & Related papers (2024-03-07T15:26:23Z) - AI for Interpretable Chemistry: Predicting Radical Mechanistic Pathways
via Contrastive Learning [45.379791270351184]
RMechRP is a new deep learning-based reaction predictor system.
We develop and train models using RMechDB, a public database of radical reactions.
Our results demonstrate the effectiveness of RMechRP in providing accurate and interpretable predictions.
arXiv Detail & Related papers (2023-11-02T09:47:27Z) - Doubly Stochastic Graph-based Non-autoregressive Reaction Prediction [59.41636061300571]
We propose a new framework called that combines two doubly self-attention mappings to obtain electron redistribution predictions.
We show that our approach consistently improves the predictive performance of non-autoregressive models.
arXiv Detail & Related papers (2023-06-05T14:15:39Z) - MARS: A Motif-based Autoregressive Model for Retrosynthesis Prediction [54.75583184356392]
We propose a novel end-to-end graph generation model for retrosynthesis prediction.
It sequentially identifies the reaction center, generates the synthons, and adds motifs to the synthons to generate reactants.
Experiments on a benchmark dataset show that the proposed model significantly outperforms previous state-of-the-art algorithms.
arXiv Detail & Related papers (2022-09-27T06:29:35Z) - Non-Autoregressive Electron Redistribution Modeling for Reaction
Prediction [26.007965383304864]
We devise a non-autoregressive learning paradigm that predicts reaction in one shot.
We formulate a reaction as an arbitrary electron flow and predict it with a novel multi-pointer decoding network.
Experiments on the USPTO-MIT dataset show that our approach has established a new state-of-the-art top-1 accuracy.
arXiv Detail & Related papers (2021-06-08T16:39:08Z) - Non-autoregressive electron flow generation for reaction prediction [15.98143959075733]
We devise a novel decoder that avoids such sequential generating and predicts the reaction in a Non-Autoregressive manner.
Inspired by physical-chemistry insights, we represent edge edits in a molecule graph as electron flows, which can then be predicted in parallel.
Our model achieves both an order of magnitude lower inference latency, with state-of-the-art top-1 accuracy and comparable performance on Top-K sampling.
arXiv Detail & Related papers (2020-12-16T10:01:26Z) - RetroXpert: Decompose Retrosynthesis Prediction like a Chemist [60.463900712314754]
We devise a novel template-free algorithm for automatic retrosynthetic expansion.
Our method disassembles retrosynthesis into two steps.
While outperforming the state-of-the-art baselines, our model also provides chemically reasonable interpretation.
arXiv Detail & Related papers (2020-11-04T04:35:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.