RetCL: A Selection-based Approach for Retrosynthesis via Contrastive
Learning
- URL: http://arxiv.org/abs/2105.00795v1
- Date: Mon, 3 May 2021 12:47:57 GMT
- Title: RetCL: A Selection-based Approach for Retrosynthesis via Contrastive
Learning
- Authors: Hankook Lee, Sungsoo Ahn, Seung-Woo Seo, You Young Song, Sung-Ju
Hwang, Eunho Yang, Jinwoo Shin
- Abstract summary: Retrosynthesis is an emerging research area of deep learning.
We propose a new approach that reformulating retrosynthesis into a selection problem of reactants from a candidate set of commercially available molecules.
For learning the score functions, we also propose a novel contrastive training scheme with hard negative mining.
- Score: 107.64562550844146
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Retrosynthesis, of which the goal is to find a set of reactants for
synthesizing a target product, is an emerging research area of deep learning.
While the existing approaches have shown promising results, they currently lack
the ability to consider availability (e.g., stability or purchasability) of the
reactants or generalize to unseen reaction templates (i.e., chemical reaction
rules). In this paper, we propose a new approach that mitigates the issues by
reformulating retrosynthesis into a selection problem of reactants from a
candidate set of commercially available molecules. To this end, we design an
efficient reactant selection framework, named RetCL (retrosynthesis via
contrastive learning), for enumerating all of the candidate molecules based on
selection scores computed by graph neural networks. For learning the score
functions, we also propose a novel contrastive training scheme with hard
negative mining. Extensive experiments demonstrate the benefits of the proposed
selection-based approach. For example, when all 671k reactants in the USPTO
{database} are given as candidates, our RetCL achieves top-1 exact match
accuracy of $71.3\%$ for the USPTO-50k benchmark, while a recent
transformer-based approach achieves $59.6\%$. We also demonstrate that RetCL
generalizes well to unseen templates in various settings in contrast to
template-based approaches.
Related papers
- RetroGFN: Diverse and Feasible Retrosynthesis using GFlowNets [8.308430428140413]
Single-step retrosynthesis aims to predict a set of reactions that lead to the creation of a target molecule.
We propose a novel model, RetroGFN, that can explore outside the limited dataset and return a diverse set of feasible reactions.
We show that RetroGFN achieves competitive results on standard top-k accuracy while outperforming existing methods on round-trip accuracy.
arXiv Detail & Related papers (2024-06-26T20:10:03Z) - Retrosynthesis prediction enhanced by in-silico reaction data
augmentation [66.5643280109899]
We present RetroWISE, a framework that employs a base model inferred from real paired data to perform in-silico reaction generation and augmentation.
On three benchmark datasets, RetroWISE achieves the best overall performance against state-of-the-art models.
arXiv Detail & Related papers (2024-01-31T07:40:37Z) - Retrosynthesis Prediction with Local Template Retrieval [112.23386062396622]
Retrosynthesis, which predicts the reactants of a given target molecule, is an essential task for drug discovery.
In this work, we introduce RetroKNN, a local reaction template retrieval method.
We conduct comprehensive experiments on two widely used benchmarks, the USPTO-50K and USPTO-MIT.
arXiv Detail & Related papers (2023-06-07T03:38:03Z) - MetaRF: Differentiable Random Forest for Reaction Yield Prediction with
a Few Trails [58.47364143304643]
In this paper, we focus on the reaction yield prediction problem.
We first put forth MetaRF, an attention-based differentiable random forest model specially designed for the few-shot yield prediction.
To improve the few-shot learning performance, we further introduce a dimension-reduction based sampling method.
arXiv Detail & Related papers (2022-08-22T06:40:13Z) - RetroComposer: Discovering Novel Reactions by Composing Templates for
Retrosynthesis Prediction [63.14937611038264]
We propose an innovative retrosynthesis prediction framework that can compose novel templates beyond training templates.
Experimental results show that our method can produce novel templates for 328 test reactions in the USPTO-50K dataset.
arXiv Detail & Related papers (2021-12-20T05:57:07Z) - Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis [1.6449390849183363]
Retrosynthesis is a problem to infer reactant compounds to synthesize a given product compound through chemical reactions.
Recent studies on retrosynthesis focus on proposing more sophisticated prediction models.
The dataset to feed the models also plays an essential role in achieving the best generalizing models.
arXiv Detail & Related papers (2020-10-02T05:27:51Z) - A Bayesian algorithm for retrosynthesis [3.1092085121563526]
This study is to discover synthetic routes backwardly from a given desired molecule to commercially available compounds.
A deep neural network is trained that forwardly predicts a product of the given reactants with a high level of accuracy.
Using the backward model, a diverse set of highly probable reaction sequences ending with a given synthetic target is exhaustively explored using a Monte Carlo search.
arXiv Detail & Related papers (2020-03-06T13:30:30Z) - Retrosynthesis Prediction with Conditional Graph Logic Network [118.70437805407728]
Computer-aided retrosynthesis is finding renewed interest from both chemistry and computer science communities.
We propose a new approach to this task using the Conditional Graph Logic Network, a conditional graphical model built upon graph neural networks.
arXiv Detail & Related papers (2020-01-06T05:36:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.