ReactZyme: A Benchmark for Enzyme-Reaction Prediction
- URL: http://arxiv.org/abs/2408.13659v3
- Date: Tue, 1 Oct 2024 02:12:41 GMT
- Title: ReactZyme: A Benchmark for Enzyme-Reaction Prediction
- Authors: Chenqing Hua, Bozitao Zhong, Sitao Luan, Liang Hong, Guy Wolf, Doina Precup, Shuangjia Zheng,
- Abstract summary: We introduce a new approach to annotating enzymes based on their catalyzed reactions.
We employ machine learning algorithms to analyze enzyme reaction datasets.
We frame the enzyme-reaction prediction as a retrieval problem, aiming to rank enzymes by their catalytic ability for specific reactions.
- Score: 41.33939896203491
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Enzymes, with their specific catalyzed reactions, are necessary for all aspects of life, enabling diverse biological processes and adaptations. Predicting enzyme functions is essential for understanding biological pathways, guiding drug development, enhancing bioproduct yields, and facilitating evolutionary studies. Addressing the inherent complexities, we introduce a new approach to annotating enzymes based on their catalyzed reactions. This method provides detailed insights into specific reactions and is adaptable to newly discovered reactions, diverging from traditional classifications by protein family or expert-derived reaction classes. We employ machine learning algorithms to analyze enzyme reaction datasets, delivering a much more refined view on the functionality of enzymes. Our evaluation leverages the largest enzyme-reaction dataset to date, derived from the SwissProt and Rhea databases with entries up to January 8, 2024. We frame the enzyme-reaction prediction as a retrieval problem, aiming to rank enzymes by their catalytic ability for specific reactions. With our model, we can recruit proteins for novel reactions and predict reactions in novel proteins, facilitating enzyme discovery and function annotation (https://github.com/WillHua127/ReactZyme).
Related papers
- UniZyme: A Unified Protein Cleavage Site Predictor Enhanced with Enzyme Active-Site Knowledge [10.678089839728889]
We introduce a unified protein cleavage site predictor named UniZyme, which can generalize across diverse enzymes.
Experiments demonstrate that UniZyme achieves high accuracy in predicting cleavage sites across a range of proteolytic enzymes.
arXiv Detail & Related papers (2025-02-10T09:46:26Z) - Learning Chemical Reaction Representation with Reactant-Product Alignment [50.28123475356234]
RAlign is a novel chemical reaction representation learning model for various organic reaction-related tasks.
By integrating atomic correspondence between reactants and products, our model discerns the molecular transformations that occur during the reaction.
We introduce a reaction-center-aware attention mechanism that enables the model to concentrate on key functional groups.
arXiv Detail & Related papers (2024-11-26T17:41:44Z) - Reaction-conditioned De Novo Enzyme Design with GENzyme [64.14088142258498]
textscGENzyme is a textitde novo enzyme design model that takes a catalytic reaction as input and generates the catalytic pocket, full enzyme structure, and enzyme-substrate binding complex.
textscGENzyme is an end-to-end, three-staged model that integrates (1) a catalytic pocket generation and sequence co-design module, (2) a pocket inpainting and enzyme inverse folding module, and (3) a binding and screening module to optimize and predict enzyme-substrate complexes.
arXiv Detail & Related papers (2024-11-10T00:37:26Z) - EnzymeFlow: Generating Reaction-specific Enzyme Catalytic Pockets through Flow Matching and Co-Evolutionary Dynamics [51.47520281819253]
Enzyme design is a critical area in biotechnology, with applications ranging from drug development to synthetic biology.
Traditional methods for enzyme function prediction or protein binding pocket design often fall short in capturing the dynamic and complex nature of enzyme-substrate interactions.
We introduce EnzymeFlow, a generative model that employs flow matching with hierarchical pre-training and enzyme-reaction co-evolution to generate catalytic pockets.
arXiv Detail & Related papers (2024-10-01T02:04:01Z) - Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule Substrates [16.5169461287914]
We propose EnzyGen, an approach to learn a unified model to design enzymes across all functional families.
Our key idea is to generate an enzyme's amino acid sequence and their 3D coordinates based on functionally important sites and substrates corresponding to a desired catalytic function.
arXiv Detail & Related papers (2024-05-13T21:48:48Z) - React-OT: Optimal Transport for Generating Transition State in Chemical Reactions [45.99250641377074]
We develop React-OT, an optimal transport approach for generating unique Transition State structures from reactants and products.
Re React-OT generates highly accurate TS structures with a median structural root mean square deviation (RMSD) of 0.053AA and median barrier height error of 1.06 kcal/mol requiring only 0.4 second per reaction.
arXiv Detail & Related papers (2024-04-20T17:31:45Z) - Mapping the Space of Chemical Reactions Using Attention-Based Neural
Networks [0.3848364262836075]
This work shows that transformer-based models can infer reaction classes from non-annotated, simple text-based representations of chemical reactions.
Our best model reaches a classification accuracy of 98.2%.
The insights into chemical reaction space enabled by our learned fingerprints are illustrated by an interactive reaction atlas.
arXiv Detail & Related papers (2020-12-09T10:25:30Z) - RetroXpert: Decompose Retrosynthesis Prediction like a Chemist [60.463900712314754]
We devise a novel template-free algorithm for automatic retrosynthetic expansion.
Our method disassembles retrosynthesis into two steps.
While outperforming the state-of-the-art baselines, our model also provides chemically reasonable interpretation.
arXiv Detail & Related papers (2020-11-04T04:35:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.