Accelerating High-Throughput Catalyst Screening by Direct Generation of Equilibrium Adsorption Structures
- URL: http://arxiv.org/abs/2512.15228v1
- Date: Wed, 17 Dec 2025 09:26:58 GMT
- Title: Accelerating High-Throughput Catalyst Screening by Direct Generation of Equilibrium Adsorption Structures
- Authors: Songze Huo, Xiao-Ming Cao,
- Abstract summary: We present DBCata, a deep generative model that integrates a periodic Brownian-bridge framework with an equivariant graph neural network to establish a low-dimensional transition manifold between unrelaxed and DFT-relaxed structures.<n>Upon training, DBCata effectively generates high-fidelity, interatomic distance mean absolute error (DMAE) of 0.035 text on the Catalysis-Hub dataset.<n>The corresponding DFT accuracy can be improved within 0.1 eV in 94% of instances by identifying and refining anomalous predictions.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The adsorption energy serves as a crucial descriptor for the large-scale screening of catalysts. Nevertheless, the limited distribution of training data for the extensively utilised machine learning interatomic potential (MLIP), predominantly sourced from near-equilibrium structures, results in unreliable adsorption structures and consequent adsorption energy predictions. In this context, we present DBCata, a deep generative model that integrates a periodic Brownian-bridge framework with an equivariant graph neural network to establish a low-dimensional transition manifold between unrelaxed and DFT-relaxed structures, without requiring explicit energy or force information. Upon training, DBCata effectively generates high-fidelity adsorption geometries, achieving an interatomic distance mean absolute error (DMAE) of 0.035 \textÃ… on the Catalysis-Hub dataset, which is nearly three times superior to that of the current state-of-the-art machine learning potential models. Moreover, the corresponding DFT accuracy can be improved within 0.1 eV in 94\% of instances by identifying and refining anomalous predictions through a hybrid chemical-heuristic and self-supervised outlier detection approach. We demonstrate that the remarkable performance of DBCata facilitates accelerated high-throughput computational screening for efficient alloy catalysts in the oxygen reduction reaction, highlighting the potential of DBCata as a powerful tool for catalyst design and optimisation.
Related papers
- Equivariant Evidential Deep Learning for Interatomic Potentials [55.6997213490859]
Uncertainty quantification is critical for assessing the reliability of machine learning interatomic potentials in molecular dynamics simulations.<n>Existing UQ approaches for MLIPs are often limited by high computational cost or suboptimal performance.<n>We propose textitEquivariant Evidential Deep Learning for Interatomic Potentials ($texte2$IP), a backbone-agnostic framework that models atomic forces and their uncertainty jointly.
arXiv Detail & Related papers (2026-02-11T02:00:25Z) - QE-Catalytic: A Graph-Language Multimodal Base Model for Relaxed-Energy Prediction in Catalytic Adsorption [44.77883047868218]
We propose QE-Catalytic, a multimodal framework that couples a large language model with an E(3)-equivariant graph Transformer.<n>During prediction, QE-Catalytic jointly leverages three-dimensional structures and structured configuration text, and injects 3D geometric information'' into the language channel.<n>On OC20, QE-Catalytic reduces the MAE of relaxed adsorb energy from 0.713eV to 0.486eV, and consistently outperforms baseline models.
arXiv Detail & Related papers (2025-12-23T06:27:30Z) - Accurate, transferable, and verifiable machine-learned interatomic potentials for layered materials [0.0]
Twisted layered van-der-Waals materials often exhibit unique electronic and optical properties absent in their non-twisted counterparts.<n>Here, we introduce a split machine-learned interatomic potential and dataset curation approach that separates intralayer and interlayer interactions.<n>Our approach integrates seamlessly with various intralayer and interlayer interaction models, enabling computationally tractable relaxation of moir'e materials.
arXiv Detail & Related papers (2025-03-19T17:14:02Z) - Inorganic Catalyst Efficiency Prediction Based on EAPCR Model: A Deep Learning Solution for Multi-Source Heterogeneous Data [9.022023762759641]
This study introduces the Embedding-Attention-Permutated CNN-Residual (EAPCR) deep learning model.<n>EAPCR constructs a feature association matrix using embedding and attention mechanisms and enhances predictive performance.<n>We evaluate EAPCR on datasets from heterogeneous photocatalysis, thermal, and electrocatalysis.
arXiv Detail & Related papers (2025-03-10T15:10:22Z) - Adsorb-Agent: Autonomous Identification of Stable Adsorption Configurations via Large Language Model Agent [5.417632175667162]
Adsorb-Agent is a Large Language Model (LLM) agent designed to efficiently identify stable adsorbate-catalyst configurations.<n>Tested on twenty diverse systems, Adsorb-Agent identifies comparable adsorbate-catalyst energies for 84% of cases and achieves lower energies for 35%.
arXiv Detail & Related papers (2024-10-22T03:19:16Z) - Dumpling GNN: Hybrid GNN Enables Better ADC Payload Activity Prediction Based on Chemical Structure [53.76752789814785]
DumplingGNN is a hybrid Graph Neural Network architecture specifically designed for predicting ADC payload activity based on chemical structure.
We evaluate it on a comprehensive ADC payload dataset focusing on DNA Topoisomerase I inhibitors.
It demonstrates exceptional accuracy (91.48%), sensitivity (95.08%), and specificity (97.54%) on our specialized ADC payload dataset.
arXiv Detail & Related papers (2024-09-23T17:11:04Z) - Manifold-Constrained Nucleus-Level Denoising Diffusion Model for Structure-Based Drug Design [81.95343363178662]
atoms must maintain a minimum pairwise distance to avoid separation violations.
NucleusDiff models the interactions between atomic nuclei and their surrounding electron clouds by enforcing the distance constraint.
It reduces violation rate by up to 1000% and enhances binding affinity by up to 22.16%, surpassing state-of-the-art models for structure-based drug design.
arXiv Detail & Related papers (2024-09-16T08:42:46Z) - Adaptive Catalyst Discovery Using Multicriteria Bayesian Optimization with Representation Learning [17.00084254889438]
High-performance catalysts are crucial for sustainable energy conversion and human health.
The discovery of catalysts faces challenges due to the absence of efficient approaches to navigating vast and high-dimensional structure and composition spaces.
arXiv Detail & Related papers (2024-04-18T18:11:06Z) - On the importance of catalyst-adsorbate 3D interactions for relaxed
energy predictions [98.70797778496366]
We investigate whether it is possible to predict a system's relaxed energy in the OC20 dataset while ignoring the relative position of the adsorbate.
We find that while removing binding site information impairs accuracy as expected, modified models are able to predict relaxed energies with remarkably decent MAE.
arXiv Detail & Related papers (2023-10-10T14:57:04Z) - Enhanced physics-constrained deep neural networks for modeling vanadium
redox flow battery [62.997667081978825]
We propose an enhanced version of the physics-constrained deep neural network (PCDNN) approach to provide high-accuracy voltage predictions.
The ePCDNN can accurately capture the voltage response throughout the charge--discharge cycle, including the tail region of the voltage discharge curve.
arXiv Detail & Related papers (2022-03-03T19:56:24Z) - Machine learning models predict calculation outcomes with the
transferability necessary for computational catalysis [0.4063872661554894]
Virtual high throughput screening (VHTS) and machine learning (ML) have greatly accelerated the design of single-site transition-metal catalysts.
We show that a convolutional neural network that monitors geometry optimization on the fly can exploit its good performance and transferability for catalyst design.
We rationalize this superior model transferability to the use of on-the-fly electronic structure and geometric information generated from density functional theory calculations.
arXiv Detail & Related papers (2022-03-02T18:02:12Z) - BIGDML: Towards Exact Machine Learning Force Fields for Materials [55.944221055171276]
Machine-learning force fields (MLFF) should be accurate, computationally and data efficient, and applicable to molecules, materials, and interfaces thereof.
Here, we introduce the Bravais-Inspired Gradient-Domain Machine Learning approach and demonstrate its ability to construct reliable force fields using a training set with just 10-200 atoms.
arXiv Detail & Related papers (2021-06-08T10:14:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.