Completion of partial structures using Patterson maps with the CrysFormer machine learning model
- URL: http://arxiv.org/abs/2511.10440v1
- Date: Fri, 14 Nov 2025 01:51:31 GMT
- Title: Completion of partial structures using Patterson maps with the CrysFormer machine learning model
- Authors: Tom Pan, Evan Dramko, Mitchell D. Miller, Anastasios Kyrillidis, George N. Phillips,
- Abstract summary: We introduce an initial dataset of small protein fragments taken from Protein Data Bank entries.<n>We demonstrate that our method is effective at both improving the phases of the crystallographic structure factors and completing the regions missing from partial structure templates.
- Score: 7.123623187944631
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Protein structure determination has long been one of the primary challenges of structural biology, to which deep machine learning (ML)-based approaches have increasingly been applied. However, these ML models generally do not incorporate the experimental measurements directly, such as X-ray crystallographic diffraction data. To this end, we explore an approach that more tightly couples these traditional crystallographic and recent ML-based methods, by training a hybrid 3-d vision transformer and convolutional network on inputs from both domains. We make use of two distinct input constructs / Patterson maps, which are directly obtainable from crystallographic data, and ``partial structure'' template maps derived from predicted structures deposited in the AlphaFold Protein Structure Database with subsequently omitted residues. With these, we predict electron density maps that are then post-processed into atomic models through standard crystallographic refinement processes. Introducing an initial dataset of small protein fragments taken from Protein Data Bank entries and placing them in hypothetical crystal settings, we demonstrate that our method is effective at both improving the phases of the crystallographic structure factors and completing the regions missing from partial structure templates, as well as improving the agreement of the electron density maps with the ground truth atomic structures.
Related papers
- DiffSpectra: Molecular Structure Elucidation from Spectra using Diffusion Models [68.19129717255053]
We present DiffSpectra, a generative framework that formulates molecular structure elucidation as a conditional generation process.<n>Our experiments demonstrate that DiffSpectra accurately elucidates molecular structures, achieving 40.76% top-1 and 99.49% top-10 accuracy.
arXiv Detail & Related papers (2025-07-09T13:57:20Z) - STEM Diffraction Pattern Analysis with Deep Learning Networks [0.0]
This work presents a machine learning-based approach for predicting Euler angles directly from scanning transmission electron microscopy (STEM) diffraction patterns (DPs)<n>It enables the automated generation of high-resolution crystal orientation maps, facilitating the analysis of internal microstructures at the nanoscale.<n>Three deep learning architectures--convolutional neural networks (CNNs), Dense Convolutional Networks (DenseNets), and Shifted Windows (Swin) Transformers--are evaluated, using an experimentally acquired dataset labelled via a commercial TM algorithm.
arXiv Detail & Related papers (2025-07-02T16:58:09Z) - Multiscale guidance of AlphaFold3 with heterogeneous cryo-EM data [33.562685684224995]
cryo-electron microscopy (cryo-EM) has emerged as a powerful tool for imaging near-native structural heterogeneity.<n>Here, we combine cryo-EM density maps with the rich sequence and biophysical priors learned by protein structure prediction models.<n>Our method, CryoBoltz, guides the sampling trajectory of a pretrained protein structure prediction model using both global and local structural constraints.
arXiv Detail & Related papers (2025-06-04T22:16:27Z) - 3D variational autoencoder for fingerprinting microstructure volume elements [0.5892638927736115]
We present a 3D variational autoencoder (VAE) for encoding microstructure volume elements (VEs)<n>Crystal symmetries in the orientation space are accounted for by mapping to the crystallographic fundamental zone as a preprocessing step.<n>VAE is then used to encode a training set of VEs with an equiaxed polycrystalline microstructure with random texture.<n>We show that the model generalises well to microstructures with textures, grain sizes and aspect ratios outside the training distribution.
arXiv Detail & Related papers (2025-03-21T11:17:10Z) - RecCrysFormer: Refined Protein Structural Prediction from 3D Patterson Maps via Recycling Training Runs [7.642939155349805]
$textttRecCrysFormer$ is a hybrid model that exploits the strengths of transformers to integrate experimental and ML approaches to protein structure determination from crystallographic data.<n>We show that $textttRecCrysFormer$ achieves good accuracy in structural predictions and shows robustness against variations in crystal parameters, such as unit cell dimensions and angles.
arXiv Detail & Related papers (2025-02-28T19:40:09Z) - CrysFormer: Protein Structure Prediction via 3d Patterson Maps and
Partial Structure Attention [7.716601082662128]
A protein's three-dimensional structure often poses nontrivial computation costs.
We propose the first transformer-based model that directly utilizes protein crystallography and partial structure information.
We demonstrate our method, dubbed textttCrysFormer, can achieve accurate predictions, based on a much smaller dataset size and with reduced computation costs.
arXiv Detail & Related papers (2023-10-05T21:10:22Z) - State-specific protein-ligand complex structure prediction with a
multi-scale deep generative model [68.28309982199902]
We present NeuralPLexer, a computational approach that can directly predict protein-ligand complex structures.
Our study suggests that a data-driven approach can capture the structural cooperativity between proteins and small molecules, showing promise in accelerating the design of enzymes, drug molecules, and beyond.
arXiv Detail & Related papers (2022-09-30T01:46:38Z) - Tracking perovskite crystallization via deep learning-based feature
detection on 2D X-ray scattering data [137.47124933818066]
We propose an automated pipeline for the analysis of X-ray diffraction images based on the Faster R-CNN deep learning architecture.
We demonstrate our method on real-time tracking of organic-inorganic perovskite structure crystallization and test it on two applications.
arXiv Detail & Related papers (2022-02-22T15:39:00Z) - Disentangling multiple scattering with deep learning: application to
strain mapping from electron diffraction patterns [48.53244254413104]
We implement a deep neural network called FCU-Net to invert highly nonlinear electron diffraction patterns into quantitative structure factor images.
We trained the FCU-Net using over 200,000 unique dynamical diffraction patterns which include many different combinations of crystal structures.
Our simulated diffraction pattern library, implementation of FCU-Net, and trained model weights are freely available in open source repositories.
arXiv Detail & Related papers (2022-02-01T03:53:39Z) - An End-to-End Framework for Molecular Conformation Generation via
Bilevel Programming [71.82571553927619]
We propose an end-to-end solution for molecular conformation prediction called ConfVAE.
Specifically, the molecular graph is first encoded in a latent space, and then the 3D structures are generated by solving a principled bilevel optimization program.
arXiv Detail & Related papers (2021-05-15T15:22:29Z) - tFold-TR: Combining Deep Learning Enhanced Hybrid Potential Energy for
Template-Based Modelling Structure Refinement [53.98034511648985]
The current template-based modeling approach suffers from two important problems.
The accuracy of the distance pairs from different regions of the template varies, and this information is not well introduced into the modeling.
Two neural network models predict the distance information of the missing regions and the accuracy of the distance pairs of different regions in the template modeling structure.
arXiv Detail & Related papers (2021-05-10T13:32:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.