Artificial Intelligence Advances for De Novo Molecular Structure
Modeling in Cryo-EM
- URL: http://arxiv.org/abs/2102.06125v1
- Date: Thu, 11 Feb 2021 17:06:20 GMT
- Title: Artificial Intelligence Advances for De Novo Molecular Structure
Modeling in Cryo-EM
- Authors: Dong Si, Andrew Nakamura, Runbang Tang, Haowen Guan, Jie Hou, Ammaar
Firozi, Renzhi Cao, Kyle Hippe, Minglei Zhao
- Abstract summary: cryo-electron microscopy (cryo-EM) has become a major experimental technology to determine the structures of large protein complexes and molecular assemblies.
Machine learning (ML) and deep learning (DL) has ranked among the top-performing methods in macromolecular structure modeling.
This review provides an introductory guide to modern research on artificial intelligence for de novo molecular structure modeling.
- Score: 1.6301630538569722
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Cryo-electron microscopy (cryo-EM) has become a major experimental technology
to determine the structures of large protein complexes and molecular
assemblies, as evidenced by the 2017 Nobel Prize. Although cryo-EM has been
drastically improved to generate high-resolution three-dimensional (3D) maps
that contain detailed structural information about macromolecules, the
computational methods for using the data to automatically build structure
models are lagging far behind. Traditional cryo-EM model building approach is
template-based homology modeling. Manual de novo modeling is very
time-consuming when no template model could be found in the database. In recent
years, de novo cryo-EM modeling using machine learning (ML) and deep learning
(DL) has ranked among the top-performing methods in macromolecular structure
modeling. Deep-learning-based de novo cryo-EM modeling is an important
application of artificial intelligence, with impressive results and great
potential for the next generation of molecular biomedicine. Accordingly, we
systematically review the representative ML/DL-based de novo cryo-EM modeling
methods. And their significances are discussed from both practical and
methodological viewpoints. We also briefly describe the background of cryo-EM
data processing workflow. Overall, this review provides an introductory guide
to modern research on artificial intelligence (AI) for de novo molecular
structure modeling and future directions in this emerging field.
Related papers
- Pre-trained Molecular Language Models with Random Functional Group Masking [54.900360309677794]
We propose a SMILES-based underlineem Molecular underlineem Language underlineem Model, which randomly masking SMILES subsequences corresponding to specific molecular atoms.
This technique aims to compel the model to better infer molecular structures and properties, thus enhancing its predictive capabilities.
arXiv Detail & Related papers (2024-11-03T01:56:15Z) - CryoFM: A Flow-based Foundation Model for Cryo-EM Densities [50.291974465864364]
We present CryoFM, a foundation model designed as a generative model, learning the distribution of high-quality density maps.
Built on flow matching, CryoFM is trained to accurately capture the prior distribution of biomolecular density maps.
arXiv Detail & Related papers (2024-10-11T08:53:58Z) - Generative Modeling of Molecular Dynamics Trajectories [12.255021091552441]
We introduce generative modeling of molecular trajectories as a paradigm for learning flexible multi-task surrogate models of MD from data.
We show such generative models can be adapted to diverse tasks such as forward simulation, transition path sampling, and trajectory upsampling.
arXiv Detail & Related papers (2024-09-26T13:02:28Z) - LLamol: A Dynamic Multi-Conditional Generative Transformer for De Novo
Molecular Design [0.0]
"LLamol" is a single novel generative transformer model based on the LLama 2 architecture.
We demonstrate that the model adeptly handles single- and multi-conditional organic molecule generation with up to four conditions.
In detail, we showcase the model's capability to utilize token sequences for conditioning, either individually or in combination with numerical properties.
arXiv Detail & Related papers (2023-11-24T10:59:12Z) - An Equivariant Generative Framework for Molecular Graph-Structure
Co-Design [54.92529253182004]
We present MolCode, a machine learning-based generative framework for underlineMolecular graph-structure underlineCo-design.
In MolCode, 3D geometric information empowers the molecular 2D graph generation, which in turn helps guide the prediction of molecular 3D structure.
Our investigation reveals that the 2D topology and 3D geometry contain intrinsically complementary information in molecule design.
arXiv Detail & Related papers (2023-04-12T13:34:22Z) - Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation.
We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria.
Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z) - Deep Generative Modeling for Volume Reconstruction in Cryo-Electron
Microscopy [4.014524824655106]
New breakthroughs in high resolution imaging of biomolecules in solution with cryo-electron microscopy (cryo-EM) have unlocked new doors for the reconstruction of molecular volumes.
Despite significant headway, the immense challenges in cryo-EM data analysis remain legion and intricately inter-disciplinary in nature.
Next-generation volume reconstruction algorithms that combine generative modeling with end-to-end unsupervised deep learning techniques have shown promising results on simulated data.
arXiv Detail & Related papers (2022-01-08T17:45:18Z) - Augmenting Molecular Deep Generative Models with Topological Data
Analysis Representations [21.237758981760784]
We present a SMILES Variational Auto-Encoder (VAE) augmented with topological data analysis (TDA) representations of molecules.
Our experiments show that this TDA augmentation enables a SMILES VAE to capture the complex relation between 3D geometry and electronic properties.
arXiv Detail & Related papers (2021-06-08T15:49:21Z) - Learning Neural Generative Dynamics for Molecular Conformation
Generation [89.03173504444415]
We study how to generate molecule conformations (textiti.e., 3D structures) from a molecular graph.
We propose a novel probabilistic framework to generate valid and diverse conformations given a molecular graph.
arXiv Detail & Related papers (2021-02-20T03:17:58Z) - Conditional Constrained Graph Variational Autoencoders for Molecule
Design [70.59828655929194]
We present Conditional Constrained Graph Variational Autoencoder (CCGVAE), a model that implements this key-idea in a state-of-the-art model.
We show improved results on several evaluation metrics on two commonly adopted datasets for molecule generation.
arXiv Detail & Related papers (2020-09-01T21:58:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.