SemiRetro: Semi-template framework boosts deep retrosynthesis prediction
- URL: http://arxiv.org/abs/2202.08205v1
- Date: Sat, 12 Feb 2022 00:38:11 GMT
- Title: SemiRetro: Semi-template framework boosts deep retrosynthesis prediction
- Authors: Zhangyang Gao, Cheng Tan, Lirong Wu, Stan Z. Li
- Abstract summary: template-based (TB) and template-free (TF) molecule graph learning methods have shown promising results to retrosynthesis.
We propose breaking a full-template into several semi-templates and embedding them into the two-step TF framework.
Experimental results show that SemiRetro significantly outperforms both existing TB and TF methods.
- Score: 38.42917984016527
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recently, template-based (TB) and template-free (TF) molecule graph learning
methods have shown promising results to retrosynthesis. TB methods are more
accurate using pre-encoded reaction templates, and TF methods are more scalable
by decomposing retrosynthesis into subproblems, i.e., center identification and
synthon completion. To combine both advantages of TB and TF, we suggest
breaking a full-template into several semi-templates and embedding them into
the two-step TF framework. Since many semi-templates are reduplicative, the
template redundancy can be reduced while the essential chemical knowledge is
still preserved to facilitate synthon completion. We call our method SemiRetro,
introduce a new GNN layer (DRGAT) to enhance center identification, and propose
a novel self-correcting module to improve semi-template classification.
Experimental results show that SemiRetro significantly outperforms both
existing TB and TF methods. In scalability, SemiRetro covers 98.9\% data using
150 semi-templates, while previous template-based GLN requires 11,647 templates
to cover 93.3\% data. In top-1 accuracy, SemiRetro exceeds template-free G2G
4.8\% (class known) and 6.0\% (class unknown). Besides, SemiRetro has better
training efficiency than existing methods.
Related papers
- UAlign: Pushing the Limit of Template-free Retrosynthesis Prediction with Unsupervised SMILES Alignment [51.49238426241974]
This paper introduces UAlign, a template-free graph-to-sequence pipeline for retrosynthesis prediction.
By combining graph neural networks and Transformers, our method can more effectively leverage the inherent graph structure of molecules.
arXiv Detail & Related papers (2024-03-25T03:23:03Z) - Minusformer: Improving Time Series Forecasting by Progressively Learning Residuals [14.741951369068877]
We find that ubiquitous time series (TS) forecasting models are prone to severe overfitting.
We introduce a dual-stream and subtraction mechanism, which is a deep Boosting ensemble learning method.
The proposed method outperform existing state-of-the-art methods, yielding an average performance improvement of 11.9% across various datasets.
arXiv Detail & Related papers (2024-02-04T03:54:31Z) - Retrosynthesis Prediction with Local Template Retrieval [112.23386062396622]
Retrosynthesis, which predicts the reactants of a given target molecule, is an essential task for drug discovery.
In this work, we introduce RetroKNN, a local reaction template retrieval method.
We conduct comprehensive experiments on two widely used benchmarks, the USPTO-50K and USPTO-MIT.
arXiv Detail & Related papers (2023-06-07T03:38:03Z) - RetroComposer: Discovering Novel Reactions by Composing Templates for
Retrosynthesis Prediction [63.14937611038264]
We propose an innovative retrosynthesis prediction framework that can compose novel templates beyond training templates.
Experimental results show that our method can produce novel templates for 328 test reactions in the USPTO-50K dataset.
arXiv Detail & Related papers (2021-12-20T05:57:07Z) - MTC: Multiresolution Tensor Completion from Partial and Coarse
Observations [49.931849672492305]
Existing completion formulation mostly relies on partial observations from a single tensor.
We propose an efficient Multi-resolution Completion model (MTC) to solve the problem.
arXiv Detail & Related papers (2021-06-14T02:20:03Z) - RetCL: A Selection-based Approach for Retrosynthesis via Contrastive
Learning [107.64562550844146]
Retrosynthesis is an emerging research area of deep learning.
We propose a new approach that reformulating retrosynthesis into a selection problem of reactants from a candidate set of commercially available molecules.
For learning the score functions, we also propose a novel contrastive training scheme with hard negative mining.
arXiv Detail & Related papers (2021-05-03T12:47:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.