A Bayesian algorithm for retrosynthesis
- URL: http://arxiv.org/abs/2003.03190v1
- Date: Fri, 6 Mar 2020 13:30:30 GMT
- Title: A Bayesian algorithm for retrosynthesis
- Authors: Zhongliang Guo and Stephen Wu and Mitsuru Ohno and Ryo Yoshida
- Abstract summary: This study is to discover synthetic routes backwardly from a given desired molecule to commercially available compounds.
A deep neural network is trained that forwardly predicts a product of the given reactants with a high level of accuracy.
Using the backward model, a diverse set of highly probable reaction sequences ending with a given synthetic target is exhaustively explored using a Monte Carlo search.
- Score: 3.1092085121563526
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The identification of synthetic routes that end with a desired product has
been an inherently time-consuming process that is largely dependent on expert
knowledge regarding a limited fraction of the entire reaction space. At
present, emerging machine-learning technologies are overturning the process of
retrosynthetic planning. The objective of this study is to discover synthetic
routes backwardly from a given desired molecule to commercially available
compounds. The problem is reduced to a combinatorial optimization task with the
solution space subject to the combinatorial complexity of all possible pairs of
purchasable reactants. We address this issue within the framework of Bayesian
inference and computation. The workflow consists of two steps: a deep neural
network is trained that forwardly predicts a product of the given reactants
with a high level of accuracy, following which this forward model is inverted
into the backward one via Bayes' law of conditional probability. Using the
backward model, a diverse set of highly probable reaction sequences ending with
a given synthetic target is exhaustively explored using a Monte Carlo search
algorithm. The Bayesian retrosynthesis algorithm could successfully rediscover
80.3% and 50.0% of known synthetic routes of single-step and two-step reactions
within top-10 accuracy, respectively, thereby outperforming state-of-the-art
algorithms in terms of the overall accuracy. Remarkably, the Monte Carlo
method, which was specifically designed for the presence of diverse multiple
routes, often revealed a ranked list of hundreds of reaction routes to the same
synthetic target. We investigated the potential applicability of such diverse
candidates based on expert knowledge from synthetic organic chemistry.
Related papers
- BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction [65.93303145891628]
BatGPT-Chem is a large language model with 15 billion parameters, tailored for enhanced retrosynthesis prediction.
Our model captures a broad spectrum of chemical knowledge, enabling precise prediction of reaction conditions.
This development empowers chemists to adeptly address novel compounds, potentially expediting the innovation cycle in drug manufacturing and materials science.
arXiv Detail & Related papers (2024-08-19T05:17:40Z) - Retro-prob: Retrosynthetic Planning Based on a Probabilistic Model [5.044138778500218]
Retrosynthesis is a fundamental but challenging task in organic chemistry.
Given a target molecule, the goal of retrosynthesis is to find out a series of reactions which could be assembled into a synthetic route.
We propose a new retrosynthetic planning algorithm called retro-prob to maximize the successful synthesis probability of target molecules.
arXiv Detail & Related papers (2024-05-25T08:23:40Z) - FusionRetro: Molecule Representation Fusion via In-Context Learning for
Retrosynthetic Planning [58.47265392465442]
Retrosynthetic planning aims to devise a complete multi-step synthetic route from starting materials to a target molecule.
Current strategies use a decoupled approach of single-step retrosynthesis models and search algorithms.
We propose a novel framework that utilizes context information for improved retrosynthetic planning.
arXiv Detail & Related papers (2022-09-30T08:44:58Z) - Bayesian Sequential Stacking Algorithm for Concurrently Designing
Molecules and Synthetic Reaction Networks [11.787915018281897]
We present a powerful sequential Monte Carlo algorithm that sequentially designs a synthetic reaction network by building up single-step reactions.
In a case study of designing drug-like molecules based on commercially available compounds, the proposed method shows overwhelming performance in terms of computational efficiency and coverage.
arXiv Detail & Related papers (2022-03-01T14:55:32Z) - ChemiRise: a data-driven retrosynthesis engine [19.52621175562223]
ChemiRise can propose complete retrosynthesis routes for organic compounds rapidly and reliably.
System was trained on a processed patent database of over 3 million organic reactions.
arXiv Detail & Related papers (2021-08-09T05:13:14Z) - RetCL: A Selection-based Approach for Retrosynthesis via Contrastive
Learning [107.64562550844146]
Retrosynthesis is an emerging research area of deep learning.
We propose a new approach that reformulating retrosynthesis into a selection problem of reactants from a candidate set of commercially available molecules.
For learning the score functions, we also propose a novel contrastive training scheme with hard negative mining.
arXiv Detail & Related papers (2021-05-03T12:47:57Z) - RetroXpert: Decompose Retrosynthesis Prediction like a Chemist [60.463900712314754]
We devise a novel template-free algorithm for automatic retrosynthetic expansion.
Our method disassembles retrosynthesis into two steps.
While outperforming the state-of-the-art baselines, our model also provides chemically reasonable interpretation.
arXiv Detail & Related papers (2020-11-04T04:35:34Z) - Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search [83.22850633478302]
Retrosynthetic planning identifies a series of reactions that can lead to the synthesis of a target product.
Existing methods either require expensive return estimation by rollout with high variance, or optimize for search speed rather than the quality.
We propose Retro*, a neural-based A*-like algorithm that finds high-quality synthetic routes efficiently.
arXiv Detail & Related papers (2020-06-29T05:53:33Z) - Retrosynthesis Prediction with Conditional Graph Logic Network [118.70437805407728]
Computer-aided retrosynthesis is finding renewed interest from both chemistry and computer science communities.
We propose a new approach to this task using the Conditional Graph Logic Network, a conditional graphical model built upon graph neural networks.
arXiv Detail & Related papers (2020-01-06T05:36:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.