Boltzmann Graph Ensemble Embeddings for Aptamer Libraries
- URL: http://arxiv.org/abs/2510.21980v1
- Date: Fri, 24 Oct 2025 19:13:36 GMT
- Title: Boltzmann Graph Ensemble Embeddings for Aptamer Libraries
- Authors: Starlika Bauskar, Jade Jiao, Narayanan Kannan, Alexander Kimm, Justin M. Baker, Matthew J. Tyler, Andrea L. Bertozzi, Anne M. Andrews,
- Abstract summary: Machine-learning methods in biochemistry commonly represent molecules as graphs of pairwise intermolecular interactions for property and structure predictions.<n>We introduce a thermodynamically parameterized exponential-family random graph (ERGM) embedding that models molecules as Boltzmann-weighted ensembles of interaction graphs.<n>We show that the proposed embedding enables robust community detection and subgraph-level explanations for aptamer affinity, even in the presence of biased observations.
- Score: 37.52407391187203
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Machine-learning methods in biochemistry commonly represent molecules as graphs of pairwise intermolecular interactions for property and structure predictions. Most methods operate on a single graph, typically the minimal free energy (MFE) structure, for low-energy ensembles (conformations) representative of structures at thermodynamic equilibrium. We introduce a thermodynamically parameterized exponential-family random graph (ERGM) embedding that models molecules as Boltzmann-weighted ensembles of interaction graphs. We evaluate this embedding on SELEX datasets, where experimental biases (e.g., PCR amplification or sequencing noise) can obscure true aptamer-ligand affinity, producing anomalous candidates whose observed abundance diverges from their actual binding strength. We show that the proposed embedding enables robust community detection and subgraph-level explanations for aptamer ligand affinity, even in the presence of biased observations. This approach may be used to identify low-abundance aptamer candidates for further experimental evaluation.
Related papers
- M-GLC: Motif-Driven Global-Local Context Graphs for Few-shot Molecular Property Prediction [17.236893652698196]
Few-shot Molecular property prediction (FSMPP) is a cornerstone of drug discovery and materials science.<n>We propose a comprehensive solution: Motif Driven Global-Local Context Graph.<n>At the global level, chemically meaningful motif nodes representing shared substructures are introduced to form a global tri-partite heterogeneous graph.<n>At the local level, we build a subgraph for each node in the molecule-property pair and encode them separately to concentrate the model's attention on the most informative neighboring molecules.
arXiv Detail & Related papers (2025-10-24T02:00:41Z) - Torsional-GFN: a conditional conformation generator for small molecules [75.91029322687771]
We introduce a conditional GFlowNet specifically designed to sample conformations of molecules proportionally to their Boltzmann distribution.<n>Our work presents a promising avenue for scaling the proposed approach to larger molecular systems.
arXiv Detail & Related papers (2025-07-15T21:53:25Z) - Graph Multi-Similarity Learning for Molecular Property Prediction [13.214794693837785]
We introduce Graph Multi-Similarity Learning for Molecular Property Prediction (GraphMSL) framework.
We first define a self-similarity metric (i.e., similarity between an anchor molecule and another molecule), and then transform it into a generalized multi-similarity metric for the anchor through a pair weighting function.
GraphMSL validates the efficacy of the multi-similarity metric across MoleculeNet datasets.
arXiv Detail & Related papers (2024-01-31T05:59:38Z) - Graph Diffusion Transformers for Multi-Conditional Molecular Generation [16.58392955245203]
We present the Graph Diffusion Transformer (Graph DiT) for multi-conditional molecular generation.
Graph DiT integrates an encoder to learn numerical and categorical property representations with the Transformer-based denoiser.
Results demonstrate the superiority of Graph DiT across nine metrics from distribution learning to condition control for molecular properties.
arXiv Detail & Related papers (2024-01-24T23:45:31Z) - SE(3)-Invariant Multiparameter Persistent Homology for Chiral-Sensitive
Molecular Property Prediction [1.534667887016089]
We present a novel method for generating molecular fingerprints using multi parameter persistent homology (MPPH)
This technique holds considerable significance for drug discovery and materials science, where precise molecular property prediction is vital.
We demonstrate its superior performance over existing state-of-the-art methods in predicting molecular properties through extensive evaluations on the MoleculeNet benchmark.
arXiv Detail & Related papers (2023-12-12T09:33:54Z) - Towards Predicting Equilibrium Distributions for Molecular Systems with
Deep Learning [60.02391969049972]
We introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems.
DiG employs deep neural networks to transform a simple distribution towards the equilibrium distribution, conditioned on a descriptor of a molecular system.
arXiv Detail & Related papers (2023-06-08T17:12:08Z) - Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations [68.32093648671496]
We introduce GODE, which accounts for the dual-level structure inherent in molecules.<n> Molecules possess an intrinsic graph structure and simultaneously function as nodes within a broader molecular knowledge graph.<n>By pre-training two GNNs on different graph structures, GODE effectively fuses molecular structures with their corresponding knowledge graph substructures.
arXiv Detail & Related papers (2023-06-02T15:49:45Z) - Atomic and Subgraph-aware Bilateral Aggregation for Molecular
Representation Learning [57.670845619155195]
We introduce a new model for molecular representation learning called the Atomic and Subgraph-aware Bilateral Aggregation (ASBA)
ASBA addresses the limitations of previous atom-wise and subgraph-wise models by incorporating both types of information.
Our method offers a more comprehensive way to learn representations for molecular property prediction and has broad potential in drug and material discovery applications.
arXiv Detail & Related papers (2023-05-22T00:56:00Z) - Conditional Graph Information Bottleneck for Molecular Relational
Learning [9.56625683182106]
We propose a novel relational learning framework, CGIB, that predicts the interaction behavior between a pair of graphs by detecting core subgraphs therein.
Our proposed method mimics the nature of chemical reactions, i.e., the core substructure of a molecule varies depending on which other molecule it interacts with.
arXiv Detail & Related papers (2023-04-29T01:17:43Z) - ASGN: An Active Semi-supervised Graph Neural Network for Molecular
Property Prediction [61.33144688400446]
We propose a novel framework called Active Semi-supervised Graph Neural Network (ASGN) by incorporating both labeled and unlabeled molecules.
In the teacher model, we propose a novel semi-supervised learning method to learn general representation that jointly exploits information from molecular structure and molecular distribution.
At last, we proposed a novel active learning strategy in terms of molecular diversities to select informative data during the whole framework learning.
arXiv Detail & Related papers (2020-07-07T04:22:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.