ChemiRise: a data-driven retrosynthesis engine
- URL: http://arxiv.org/abs/2108.04682v1
- Date: Mon, 9 Aug 2021 05:13:14 GMT
- Title: ChemiRise: a data-driven retrosynthesis engine
- Authors: Xiangyan Sun, Ke Liu, Yuquan Lin, Lingjie Wu, Haoming Xing, Minghong
Gao, Ji Liu, Suocheng Tan, Zekun Ni, Qi Han, Junqiu Wu, Jie Fan
- Abstract summary: ChemiRise can propose complete retrosynthesis routes for organic compounds rapidly and reliably.
System was trained on a processed patent database of over 3 million organic reactions.
- Score: 19.52621175562223
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We have developed an end-to-end, retrosynthesis system, named ChemiRise, that
can propose complete retrosynthesis routes for organic compounds rapidly and
reliably. The system was trained on a processed patent database of over 3
million organic reactions. Experimental reactions were atom-mapped, clustered,
and extracted into reaction templates. We then trained a graph convolutional
neural network-based one-step reaction proposer using template embeddings and
developed a guiding algorithm on the directed acyclic graph (DAG) of chemical
compounds to find the best candidate to explore. The atom-mapping algorithm and
the one-step reaction proposer were benchmarked against previous studies and
showed better results. The final product was demonstrated by retrosynthesis
routes reviewed and rated by human experts, showing satisfying functionality
and a potential productivity boost in real-life use cases.
Related papers
- Learning Chemical Reaction Representation with Reactant-Product Alignment [50.28123475356234]
This paper introduces modelname, a novel chemical reaction representation learning model tailored for a variety of organic-reaction-related tasks.
By integrating atomic correspondence between reactants and products, our model discerns the molecular transformations that occur during the reaction, thereby enhancing the comprehension of the reaction mechanism.
We have designed an adapter structure to incorporate reaction conditions into the chemical reaction representation, allowing the model to handle diverse reaction conditions and adapt to various datasets and downstream tasks, e.g., reaction performance prediction.
arXiv Detail & Related papers (2024-11-26T17:41:44Z) - BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction [65.93303145891628]
BatGPT-Chem is a large language model with 15 billion parameters, tailored for enhanced retrosynthesis prediction.
Our model captures a broad spectrum of chemical knowledge, enabling precise prediction of reaction conditions.
This development empowers chemists to adeptly address novel compounds, potentially expediting the innovation cycle in drug manufacturing and materials science.
arXiv Detail & Related papers (2024-08-19T05:17:40Z) - Retro-prob: Retrosynthetic Planning Based on a Probabilistic Model [5.044138778500218]
Retrosynthesis is a fundamental but challenging task in organic chemistry.
Given a target molecule, the goal of retrosynthesis is to find out a series of reactions which could be assembled into a synthetic route.
We propose a new retrosynthetic planning algorithm called retro-prob to maximize the successful synthesis probability of target molecules.
arXiv Detail & Related papers (2024-05-25T08:23:40Z) - Retrosynthesis prediction enhanced by in-silico reaction data
augmentation [66.5643280109899]
We present RetroWISE, a framework that employs a base model inferred from real paired data to perform in-silico reaction generation and augmentation.
On three benchmark datasets, RetroWISE achieves the best overall performance against state-of-the-art models.
arXiv Detail & Related papers (2024-01-31T07:40:37Z) - Retroformer: Pushing the Limits of Interpretable End-to-end
Retrosynthesis Transformer [15.722719721123054]
Retrosynthesis prediction is one of the fundamental challenges in organic synthesis.
We propose Retroformer, a novel Transformer-based architecture for retrosynthesis prediction.
Retroformer reaches the new state-of-the-art accuracy for the end-to-end template-free retrosynthesis.
arXiv Detail & Related papers (2022-01-29T02:03:55Z) - RetroXpert: Decompose Retrosynthesis Prediction like a Chemist [60.463900712314754]
We devise a novel template-free algorithm for automatic retrosynthetic expansion.
Our method disassembles retrosynthesis into two steps.
While outperforming the state-of-the-art baselines, our model also provides chemically reasonable interpretation.
arXiv Detail & Related papers (2020-11-04T04:35:34Z) - Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search [83.22850633478302]
Retrosynthetic planning identifies a series of reactions that can lead to the synthesis of a target product.
Existing methods either require expensive return estimation by rollout with high variance, or optimize for search speed rather than the quality.
We propose Retro*, a neural-based A*-like algorithm that finds high-quality synthetic routes efficiently.
arXiv Detail & Related papers (2020-06-29T05:53:33Z) - A Bayesian algorithm for retrosynthesis [3.1092085121563526]
This study is to discover synthetic routes backwardly from a given desired molecule to commercially available compounds.
A deep neural network is trained that forwardly predicts a product of the given reactants with a high level of accuracy.
Using the backward model, a diverse set of highly probable reaction sequences ending with a given synthetic target is exhaustively explored using a Monte Carlo search.
arXiv Detail & Related papers (2020-03-06T13:30:30Z) - Retrosynthesis Prediction with Conditional Graph Logic Network [118.70437805407728]
Computer-aided retrosynthesis is finding renewed interest from both chemistry and computer science communities.
We propose a new approach to this task using the Conditional Graph Logic Network, a conditional graphical model built upon graph neural networks.
arXiv Detail & Related papers (2020-01-06T05:36:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.