Models Matter: The Impact of Single-Step Retrosynthesis on Synthesis
Planning
- URL: http://arxiv.org/abs/2308.05522v1
- Date: Thu, 10 Aug 2023 12:04:47 GMT
- Title: Models Matter: The Impact of Single-Step Retrosynthesis on Synthesis
Planning
- Authors: Paula Torren-Peraire, Alan Kai Hassen, Samuel Genheden, Jonas
Verhoeven, Djork-Arne Clevert, Mike Preuss, Igor Tetko
- Abstract summary: Retrosynthesis consists of breaking down a chemical compound step-by-step into molecular precursors.
Its two primary research directions, single-step retrosynthesis prediction and multi-step synthesis planning, are inherently intertwined.
We show that the choice of the single-step model can improve the overall success rate of synthesis planning by up to +28%.
- Score: 0.8620335948752805
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Retrosynthesis consists of breaking down a chemical compound recursively
step-by-step into molecular precursors until a set of commercially available
molecules is found with the goal to provide a synthesis route. Its two primary
research directions, single-step retrosynthesis prediction, which models the
chemical reaction logic, and multi-step synthesis planning, which tries to find
the correct sequence of reactions, are inherently intertwined. Still, this
connection is not reflected in contemporary research. In this work, we combine
these two major research directions by applying multiple single-step
retrosynthesis models within multi-step synthesis planning and analyzing their
impact using public and proprietary reaction data. We find a disconnection
between high single-step performance and potential route-finding success,
suggesting that single-step models must be evaluated within synthesis planning
in the future. Furthermore, we show that the commonly used single-step
retrosynthesis benchmark dataset USPTO-50k is insufficient as this evaluation
task does not represent model performance and scalability on larger and more
diverse datasets. For multi-step synthesis planning, we show that the choice of
the single-step model can improve the overall success rate of synthesis
planning by up to +28% compared to the commonly used baseline model. Finally,
we show that each single-step model finds unique synthesis routes, and differs
in aspects such as route-finding success, the number of found synthesis routes,
and chemical validity, making the combination of single-step retrosynthesis
prediction and multi-step synthesis planning a crucial aspect when developing
future methods.
Related papers
- BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction [65.93303145891628]
BatGPT-Chem is a large language model with 15 billion parameters, tailored for enhanced retrosynthesis prediction.
Our model captures a broad spectrum of chemical knowledge, enabling precise prediction of reaction conditions.
This development empowers chemists to adeptly address novel compounds, potentially expediting the innovation cycle in drug manufacturing and materials science.
arXiv Detail & Related papers (2024-08-19T05:17:40Z) - Retro-prob: Retrosynthetic Planning Based on a Probabilistic Model [5.044138778500218]
Retrosynthesis is a fundamental but challenging task in organic chemistry.
Given a target molecule, the goal of retrosynthesis is to find out a series of reactions which could be assembled into a synthetic route.
We propose a new retrosynthetic planning algorithm called retro-prob to maximize the successful synthesis probability of target molecules.
arXiv Detail & Related papers (2024-05-25T08:23:40Z) - UAlign: Pushing the Limit of Template-free Retrosynthesis Prediction with Unsupervised SMILES Alignment [51.49238426241974]
This paper introduces UAlign, a template-free graph-to-sequence pipeline for retrosynthesis prediction.
By combining graph neural networks and Transformers, our method can more effectively leverage the inherent graph structure of molecules.
arXiv Detail & Related papers (2024-03-25T03:23:03Z) - RetroBridge: Modeling Retrosynthesis with Markov Bridges [2.256703675017117]
Retrosynthesis planning aims at designing reaction pathways from commercially available starting materials to a target molecule.
We introduce the Markov Bridge Model, a generative framework aimed to approximate the dependency between two discrete distributions.
We then address the retrosynthesis planning problem with our novel framework and introduce RetroBridge, a template-free retrosynthesis modeling approach.
arXiv Detail & Related papers (2023-08-30T15:09:22Z) - Mind the Retrosynthesis Gap: Bridging the divide between Single-step and
Multi-step Retrosynthesis Prediction [0.9134244356393664]
Multi-step approaches repeatedly apply the chemical information stored in single-step retrosynthesis models.
We show that models designed for single-step retrosynthesis, when extended to multi-step, can have a tremendous impact on the route finding capabilities of current multi-step methods.
arXiv Detail & Related papers (2022-12-12T18:06:24Z) - FusionRetro: Molecule Representation Fusion via In-Context Learning for
Retrosynthetic Planning [58.47265392465442]
Retrosynthetic planning aims to devise a complete multi-step synthetic route from starting materials to a target molecule.
Current strategies use a decoupled approach of single-step retrosynthesis models and search algorithms.
We propose a novel framework that utilizes context information for improved retrosynthetic planning.
arXiv Detail & Related papers (2022-09-30T08:44:58Z) - RetroXpert: Decompose Retrosynthesis Prediction like a Chemist [60.463900712314754]
We devise a novel template-free algorithm for automatic retrosynthetic expansion.
Our method disassembles retrosynthesis into two steps.
While outperforming the state-of-the-art baselines, our model also provides chemically reasonable interpretation.
arXiv Detail & Related papers (2020-11-04T04:35:34Z) - Learning Graph Models for Retrosynthesis Prediction [90.15523831087269]
Retrosynthesis prediction is a fundamental problem in organic synthesis.
This paper introduces a graph-based approach that capitalizes on the idea that the graph topology of precursor molecules is largely unaltered during a chemical reaction.
Our model achieves a top-1 accuracy of $53.7%$, outperforming previous template-free and semi-template-based methods.
arXiv Detail & Related papers (2020-06-12T09:40:42Z) - Retrosynthesis Prediction with Conditional Graph Logic Network [118.70437805407728]
Computer-aided retrosynthesis is finding renewed interest from both chemistry and computer science communities.
We propose a new approach to this task using the Conditional Graph Logic Network, a conditional graphical model built upon graph neural networks.
arXiv Detail & Related papers (2020-01-06T05:36:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.