Point transformer for protein structural heterogeneity analysis using CryoEM
- URL: http://arxiv.org/abs/2601.18713v1
- Date: Mon, 26 Jan 2026 17:38:52 GMT
- Title: Point transformer for protein structural heterogeneity analysis using CryoEM
- Authors: Muyuan Chen, Muchen Li, Renjie Liao,
- Abstract summary: Point Transformer is a self-attention network designed for point cloud analysis.<n>We characterize the dynamics of highly complex protein systems in a more human-interpretable way.
- Score: 14.1714744852828
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Structural dynamics of macromolecules is critical to their structural-function relationship. Cryogenic electron microscopy (CryoEM) provides snapshots of vitrified protein at different compositional and conformational states, and the structural heterogeneity of proteins can be characterized through computational analysis of the images. For protein systems with multiple degrees of freedom, it is still challenging to disentangle and interpret the different modes of dynamics. Here, by implementing Point Transformer, a self-attention network designed for point cloud analysis, we are able to improve the performance of heterogeneity analysis on CryoEM data, and characterize the dynamics of highly complex protein systems in a more human-interpretable way.
Related papers
- Mixture of neural fields for heterogeneous reconstruction in cryo-EM [29.837972881181102]
We present Hydra, an approach that models both conformational and compositional heterogeneity fully ab initio.<n>We employ a new likelihood-based loss function and demonstrate the effectiveness of our approach on synthetic datasets composed of mixtures of proteins.
arXiv Detail & Related papers (2024-12-12T16:26:38Z) - SFM-Protein: Integrative Co-evolutionary Pre-training for Advanced Protein Sequence Representation [97.99658944212675]
We introduce a novel pre-training strategy for protein foundation models.
It emphasizes the interactions among amino acid residues to enhance the extraction of both short-range and long-range co-evolutionary features.
Trained on a large-scale protein sequence dataset, our model demonstrates superior generalization ability.
arXiv Detail & Related papers (2024-10-31T15:22:03Z) - Long-context Protein Language Modeling Using Bidirectional Mamba with Shared Projection Layers [76.95505296417866]
Self-supervised training of language models (LMs) has seen great success for protein sequences in learning meaningful representations and for generative drug design.<n>Most protein LMs are based on the Transformer architecture trained on individual proteins with short context lengths.<n>In this work, we propose LC-PLM based on an alternative protein LM architecture, BiMamba-S, built upon selective structured state-space models.
arXiv Detail & Related papers (2024-10-29T16:43:28Z) - ProtSCAPE: Mapping the landscape of protein conformations in molecular dynamics [3.6181745062052495]
We introduce a novel deep learning architecture, Protein Transformer with Scattering, Attention, and Positional Embedding (ProtSCAPE)
ProtSCAPE utilizes the multi-scale nature of the geometric scattering transform to extract features from protein structures conceptualized as graphs.
It integrates these features with dual attention structures that focus on residues and amino acid signals, generating latent representations of protein trajectories.
arXiv Detail & Related papers (2024-10-27T02:59:48Z) - cryoSPHERE: Single-particle heterogeneous reconstruction from cryo EM [9.887133861477233]
Single-particle cryo-electron microscopy (cryo-EM) is a powerful tool for determining the structures of large protein complexes.<n>In this paper, we introduce cryoSPHERE, which is a deep learning method that uses a nominal protein structure as input.<n>We show that cryoSPHERE is very resilient to the high levels of noise typically encountered in experiments.
arXiv Detail & Related papers (2024-05-29T15:12:19Z) - Machine Learning Coarse-Grained Potentials of Protein Thermodynamics [1.3543803103181613]
We build coarse-grained molecular potentials based on artificial neural networks and grounded in statistical mechanics.
For training, we build a unique dataset of unbiased all-atom molecular dynamics simulations of approximately 9 ms for twelve different proteins.
The coarse-grained models are capable of accelerating the dynamics by more than three orders of magnitude while preserving the thermodynamics of the systems.
arXiv Detail & Related papers (2022-12-14T20:23:11Z) - State-specific protein-ligand complex structure prediction with a
multi-scale deep generative model [68.28309982199902]
We present NeuralPLexer, a computational approach that can directly predict protein-ligand complex structures.
Our study suggests that a data-driven approach can capture the structural cooperativity between proteins and small molecules, showing promise in accelerating the design of enzymes, drug molecules, and beyond.
arXiv Detail & Related papers (2022-09-30T01:46:38Z) - Learning Geometrically Disentangled Representations of Protein Folding
Simulations [72.03095377508856]
This work focuses on learning a generative neural network on a structural ensemble of a drug-target protein.
Model tasks involve characterizing the distinct structural fluctuations of the protein bound to various drug molecules.
Results show that our geometric learning-based method enjoys both accuracy and efficiency for generating complex structural variations.
arXiv Detail & Related papers (2022-05-20T19:38:00Z) - Transfer Learning for Protein Structure Classification at Low Resolution [124.5573289131546]
We show that it is possible to make accurate ($geq$80%) predictions of protein class and architecture from structures determined at low ($leq$3A) resolution.
We provide proof of concept for high-speed, low-cost protein structure classification at low resolution, and a basis for extension to prediction of function.
arXiv Detail & Related papers (2020-08-11T15:01:32Z) - BERTology Meets Biology: Interpreting Attention in Protein Language
Models [124.8966298974842]
We demonstrate methods for analyzing protein Transformer models through the lens of attention.
We show that attention captures the folding structure of proteins, connecting amino acids that are far apart in the underlying sequence, but spatially close in the three-dimensional structure.
We also present a three-dimensional visualization of the interaction between attention and protein structure.
arXiv Detail & Related papers (2020-06-26T21:50:17Z) - Energy-based models for atomic-resolution protein conformations [88.68597850243138]
We propose an energy-based model (EBM) of protein conformations that operates at atomic scale.
The model is trained solely on crystallized protein data.
An investigation of the model's outputs and hidden representations finds that it captures physicochemical properties relevant to protein energy.
arXiv Detail & Related papers (2020-04-27T20:45:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.