Re-evaluating Retrosynthesis Algorithms with Syntheseus
- URL: http://arxiv.org/abs/2310.19796v3
- Date: Fri, 6 Sep 2024 17:55:53 GMT
- Title: Re-evaluating Retrosynthesis Algorithms with Syntheseus
- Authors: Krzysztof Maziarz, Austin Tripp, Guoqing Liu, Megan Stanley, Shufang Xie, Piotr GaiĆski, Philipp Seidl, Marwin Segler,
- Abstract summary: We present a synthesis planning library with an extensive benchmarking framework, called syntheseus.
We demonstrate the capabilities of syntheseus by re-evaluating several previous retrosynthesis algorithms.
We end with guidance for future works in this area, and call the community to engage in the discussion on how to improve benchmarks for synthesis planning.
- Score: 13.384695742156152
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automated Synthesis Planning has recently re-emerged as a research area at the intersection of chemistry and machine learning. Despite the appearance of steady progress, we argue that imperfect benchmarks and inconsistent comparisons mask systematic shortcomings of existing techniques, and unnecessarily hamper progress. To remedy this, we present a synthesis planning library with an extensive benchmarking framework, called syntheseus, which promotes best practice by default, enabling consistent meaningful evaluation of single-step models and multi-step planning algorithms. We demonstrate the capabilities of syntheseus by re-evaluating several previous retrosynthesis algorithms, and find that the ranking of state-of-the-art models changes in controlled evaluation experiments. We end with guidance for future works in this area, and call the community to engage in the discussion on how to improve benchmarks for synthesis planning.
Related papers
- Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search [27.09693306892583]
We present a formulation of synthesis planning with starting material constraints.
We propose Double-Ended Synthesis Planning (DESP), a novel CASP algorithm under a bidirectional graph search scheme.
DESP can make use of existing one-step retrosynthesis models, and we anticipate its performance to scale as these one-step model capabilities improve.
arXiv Detail & Related papers (2024-07-08T18:56:00Z) - SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation [55.2480439325792]
We study the synthesis of six datasets, covering topic classification, sentiment analysis, tone detection, and humor.
We find that SynthesizRR greatly improves lexical and semantic diversity, similarity to human-written text, and distillation performance.
arXiv Detail & Related papers (2024-05-16T12:22:41Z) - Systematic Assessment of Tabular Data Synthesis Algorithms [9.08530697055844]
We present a systematic evaluation framework for assessing data synthesis algorithms.
We introduce a set of new metrics in terms of fidelity, privacy, and utility to address their limitations.
Based on the proposed metrics, we also devise a unified objective for tuning, which can consistently improve the quality of synthetic data.
arXiv Detail & Related papers (2024-02-09T22:07:59Z) - Locally Optimal Descent for Dynamic Stepsize Scheduling [45.6809308002043]
We introduce a novel dynamic learning scheduling scheme grounded in theory with the goal of simplifying the manual and time-consuming tuning of schedules in phases.
Our approach is based on estimating a locally-optimal practice tuning-rate in the direction of a smooth gradient.
Our findings indicate that our method needs minimal tuning when compared to existing approaches.
arXiv Detail & Related papers (2023-11-23T09:57:35Z) - Models Matter: The Impact of Single-Step Retrosynthesis on Synthesis
Planning [0.8620335948752805]
Retrosynthesis consists of breaking down a chemical compound step-by-step into molecular precursors.
Its two primary research directions, single-step retrosynthesis prediction and multi-step synthesis planning, are inherently intertwined.
We show that the choice of the single-step model can improve the overall success rate of synthesis planning by up to +28%.
arXiv Detail & Related papers (2023-08-10T12:04:47Z) - ExeDec: Execution Decomposition for Compositional Generalization in Neural Program Synthesis [54.18659323181771]
We characterize several different forms of compositional generalization that are desirable in program synthesis.
We propose ExeDec, a novel decomposition-based strategy that predicts execution subgoals to solve problems step-by-step informed by program execution at each step.
arXiv Detail & Related papers (2023-07-26T01:07:52Z) - FusionRetro: Molecule Representation Fusion via In-Context Learning for
Retrosynthetic Planning [58.47265392465442]
Retrosynthetic planning aims to devise a complete multi-step synthetic route from starting materials to a target molecule.
Current strategies use a decoupled approach of single-step retrosynthesis models and search algorithms.
We propose a novel framework that utilizes context information for improved retrosynthetic planning.
arXiv Detail & Related papers (2022-09-30T08:44:58Z) - Compositional Generalization and Decomposition in Neural Program
Synthesis [59.356261137313275]
In this paper, we focus on measuring the ability of learned program synthesizers to compositionally generalize.
We first characterize several different axes along which program synthesis methods would be desired to generalize.
We introduce a benchmark suite of tasks to assess these abilities based on two popular existing datasets.
arXiv Detail & Related papers (2022-04-07T22:16:05Z) - RetroXpert: Decompose Retrosynthesis Prediction like a Chemist [60.463900712314754]
We devise a novel template-free algorithm for automatic retrosynthetic expansion.
Our method disassembles retrosynthesis into two steps.
While outperforming the state-of-the-art baselines, our model also provides chemically reasonable interpretation.
arXiv Detail & Related papers (2020-11-04T04:35:34Z) - Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search [83.22850633478302]
Retrosynthetic planning identifies a series of reactions that can lead to the synthesis of a target product.
Existing methods either require expensive return estimation by rollout with high variance, or optimize for search speed rather than the quality.
We propose Retro*, a neural-based A*-like algorithm that finds high-quality synthetic routes efficiently.
arXiv Detail & Related papers (2020-06-29T05:53:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.