Multimodal Regression for Enzyme Turnover Rates Prediction
- URL: http://arxiv.org/abs/2509.11782v1
- Date: Mon, 15 Sep 2025 11:07:26 GMT
- Title: Multimodal Regression for Enzyme Turnover Rates Prediction
- Authors: Bozhen Hu, Cheng Tan, Siyuan Li, Jiangbin Zheng, Sizhe Qiu, Jun Xia, Stan Z. Li,
- Abstract summary: We propose a framework for predicting the enzyme turnover rate by integrating enzyme sequences, substrate structures, and environmental factors.<n>Our model combines a pre-trained language model and a convolutional neural network to extract features from protein sequences.<n>We leverage symbolic regression via Kolmogorov-Arnold Networks to explicitly learn mathematical formulas that govern the enzyme turnover rate.
- Score: 57.60697333734054
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The enzyme turnover rate is a fundamental parameter in enzyme kinetics, reflecting the catalytic efficiency of enzymes. However, enzyme turnover rates remain scarce across most organisms due to the high cost and complexity of experimental measurements. To address this gap, we propose a multimodal framework for predicting the enzyme turnover rate by integrating enzyme sequences, substrate structures, and environmental factors. Our model combines a pre-trained language model and a convolutional neural network to extract features from protein sequences, while a graph neural network captures informative representations from substrate molecules. An attention mechanism is incorporated to enhance interactions between enzyme and substrate representations. Furthermore, we leverage symbolic regression via Kolmogorov-Arnold Networks to explicitly learn mathematical formulas that govern the enzyme turnover rate, enabling interpretable and accurate predictions. Extensive experiments demonstrate that our framework outperforms both traditional and state-of-the-art deep learning approaches. This work provides a robust tool for studying enzyme kinetics and holds promise for applications in enzyme engineering, biotechnology, and industrial biocatalysis.
Related papers
- Improving Enzyme Prediction with Chemical Reaction Equations by Hypergraph-Enhanced Knowledge Graph Embeddings [7.848535217281907]
Predicting enzyme-substrate interactions has long been a fundamental problem in biochemistry and metabolic engineering.<n>Existing methods could leverage databases of expert-curated enzyme-substrate pairs for models to learn from known pair interactions.<n>This lack of sufficient training data significantly hinders the ability of traditional enzyme prediction models to generalize to unseen interactions.
arXiv Detail & Related papers (2026-01-08T19:17:18Z) - Fused Gromov-Wasserstein Contrastive Learning for Effective Enzyme-Reaction Screening [32.25999474073762]
FGW-CLIP is a contrastive learning framework based on optimizing the Gromov-Wasserstein distance.<n>FGW-CLIP consistently outperforms across all three splits of ReactZyme, the largest enzyme-reaction benchmark.<n>These results position FGW-CLIP as a promising framework for enzyme discovery in complex biochemical settings.
arXiv Detail & Related papers (2025-12-09T11:49:24Z) - OmniESI: A unified framework for enzyme-substrate interaction prediction with progressive conditional deep learning [46.402707495664174]
We introduce a two-stage progressive framework, OmniESI, for enzyme-substrate interaction prediction through conditional deep learning.<n>We show that OmniESI consistently delivered superior performance than state-of-the-art specialized methods.<n>Overall, OmniESI represents a unified predictive approach for enzyme-substrate interactions.
arXiv Detail & Related papers (2025-06-22T09:40:40Z) - Learning Chemical Reaction Representation with Reactant-Product Alignment [50.28123475356234]
RAlign is a novel chemical reaction representation learning model for various organic reaction-related tasks.<n>By integrating atomic correspondence between reactants and products, our model discerns the molecular transformations that occur during the reaction.<n>We introduce a reaction-center-aware attention mechanism that enables the model to concentrate on key functional groups.
arXiv Detail & Related papers (2024-11-26T17:41:44Z) - Reaction-conditioned De Novo Enzyme Design with GENzyme [64.14088142258498]
textscGENzyme is a textitde novo enzyme design model that takes a catalytic reaction as input and generates the catalytic pocket, full enzyme structure, and enzyme-substrate binding complex.<n>textscGENzyme is an end-to-end, three-staged model that integrates (1) a catalytic pocket generation and sequence co-design module, (2) a pocket inpainting and enzyme inverse folding module, and (3) a binding and screening module to optimize and predict enzyme-substrate complexes.
arXiv Detail & Related papers (2024-11-10T00:37:26Z) - EnzymeFlow: Generating Reaction-specific Enzyme Catalytic Pockets through Flow Matching and Co-Evolutionary Dynamics [51.47520281819253]
Enzyme design is a critical area in biotechnology, with applications ranging from drug development to synthetic biology.
Traditional methods for enzyme function prediction or protein binding pocket design often fall short in capturing the dynamic and complex nature of enzyme-substrate interactions.
We introduce EnzymeFlow, a generative model that employs flow matching with hierarchical pre-training and enzyme-reaction co-evolution to generate catalytic pockets.
arXiv Detail & Related papers (2024-10-01T02:04:01Z) - ReactZyme: A Benchmark for Enzyme-Reaction Prediction [41.33939896203491]
We introduce a new approach to annotating enzymes based on their catalyzed reactions.
We employ machine learning algorithms to analyze enzyme reaction datasets.
We frame the enzyme-reaction prediction as a retrieval problem, aiming to rank enzymes by their catalytic ability for specific reactions.
arXiv Detail & Related papers (2024-08-24T19:19:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.