Direct Molecular Conformation Generation
- URL: http://arxiv.org/abs/2202.01356v1
- Date: Thu, 3 Feb 2022 01:01:58 GMT
- Title: Direct Molecular Conformation Generation
- Authors: Jinhua Zhu, Yingce Xia, Chang Liu, Lijun Wu, Shufang Xie, Tong Wang,
Yusong Wang, Wengang Zhou, Tao Qin, Houqiang Li, Tie-Yan Liu
- Abstract summary: We propose a method that directly predicts the coordinates of atoms.
Our method achieves state-of-the-art results on four public benchmarks.
- Score: 217.4815525740703
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Molecular conformation generation aims to generate three-dimensional
coordinates of all the atoms in a molecule and is an important task in
bioinformatics and pharmacology. Previous distance-based methods first predict
interatomic distances and then generate conformations based on them, which
could result in conflicting distances. In this work, we propose a method that
directly predicts the coordinates of atoms. We design a dedicated loss function
for conformation generation, which is invariant to roto-translation of
coordinates of conformations and permutation of symmetric atoms in molecules.
We further design a backbone model that stacks multiple blocks, where each
block refines the conformation generated by its preceding block. Our method
achieves state-of-the-art results on four public benchmarks: on small-scale
GEOM-QM9 and GEOM-Drugs which have $200$K training data, we can improve the
previous best matching score by $3.5\%$ and $28.9\%$; on large-scale GEOM-QM9
and GEOM-Drugs which have millions of training data, those two improvements are
$47.1\%$ and $36.3\%$. This shows the effectiveness of our method and the great
potential of the direct approach. Our code is released at
\url{https://github.com/DirectMolecularConfGen/DMCG}.
Related papers
- A Group Symmetric Stochastic Differential Equation Model for Molecule
Multi-modal Pretraining [36.48602272037559]
molecule pretraining has quickly become the go-to schema to boost the performance of AI-based drug discovery.
Here, we propose MoleculeSDE to generate the 3D reflection from 2D topologies, and vice versa, directly in the input space.
By comparing with 17 pretraining baselines, we empirically verify that MoleculeSDE can learn an expressive representation with state-of-the-art performance on 26 out of 32 downstream tasks.
arXiv Detail & Related papers (2023-05-28T15:56:02Z) - MiDi: Mixed Graph and 3D Denoising Diffusion for Molecule Generation [47.15291538945242]
This work introduces MiDi, a novel diffusion model for jointly generating molecular graphs and their corresponding 3D arrangement of atoms.
Unlike existing methods that rely on predefined rules to determine molecular bonds based on the 3D conformation, MiDi offers an end-to-end differentiable approach that streamlines the molecule generation process.
arXiv Detail & Related papers (2023-02-17T18:27:14Z) - Heterogenous Ensemble of Models for Molecular Property Prediction [55.91865861896012]
We propose a method for considering different modalities on molecules.
We ensemble these models with a HuberRegressor.
This yields a winning solution to the 2textsuperscriptnd edition of the OGB Large-Scale Challenge (2022)
arXiv Detail & Related papers (2022-11-20T17:25:26Z) - Unified 2D and 3D Pre-Training of Molecular Representations [237.36667670201473]
We propose a new representation learning method based on a unified 2D and 3D pre-training.
Atom coordinates and interatomic distances are encoded and then fused with atomic representations through graph neural networks.
Our method achieves state-of-the-art results on 10 tasks, and the average improvement on 2D-only tasks is 8.3%.
arXiv Detail & Related papers (2022-07-14T11:36:56Z) - Molecule3D: A Benchmark for Predicting 3D Geometries from Molecular
Graphs [79.06686274377009]
We develop a benchmark, known as Molecule3D, that includes a dataset with precise ground-state geometries of approximately 4 million molecules.
We implement two baseline methods that either predict the pairwise distance between atoms or atom coordinates in 3D space.
Our method can achieve comparable prediction accuracy but with much smaller computational costs.
arXiv Detail & Related papers (2021-09-30T22:09:28Z) - GeoMol: Torsional Geometric Generation of Molecular 3D Conformer
Ensembles [60.12186997181117]
Prediction of a molecule's 3D conformer ensemble from the molecular graph holds a key role in areas of cheminformatics and drug discovery.
Existing generative models have several drawbacks including lack of modeling important molecular geometry elements.
We propose GeoMol, an end-to-end, non-autoregressive and SE(3)-invariant machine learning approach to generate 3D conformers.
arXiv Detail & Related papers (2021-06-08T14:17:59Z) - An End-to-End Framework for Molecular Conformation Generation via
Bilevel Programming [71.82571553927619]
We propose an end-to-end solution for molecular conformation prediction called ConfVAE.
Specifically, the molecular graph is first encoded in a latent space, and then the 3D structures are generated by solving a principled bilevel optimization program.
arXiv Detail & Related papers (2021-05-15T15:22:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.