Automated 3D Pre-Training for Molecular Property Prediction
- URL: http://arxiv.org/abs/2306.07812v2
- Date: Sun, 2 Jul 2023 13:03:27 GMT
- Title: Automated 3D Pre-Training for Molecular Property Prediction
- Authors: Xu Wang and Huan Zhao and Weiwei Tu and Quanming Yao
- Abstract summary: We propose a novel 3D pre-training framework (dubbed 3D PGT)
It pre-trains a model on 3D molecular graphs, and then fine-tunes it on molecular graphs without 3D structures.
Extensive experiments on 2D molecular graphs are conducted to demonstrate the accuracy, efficiency and generalization ability of the proposed 3D PGT.
- Score: 54.15788181794094
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Molecular property prediction is an important problem in drug discovery and
materials science. As geometric structures have been demonstrated necessary for
molecular property prediction, 3D information has been combined with various
graph learning methods to boost prediction performance. However, obtaining the
geometric structure of molecules is not feasible in many real-world
applications due to the high computational cost. In this work, we propose a
novel 3D pre-training framework (dubbed 3D PGT), which pre-trains a model on 3D
molecular graphs, and then fine-tunes it on molecular graphs without 3D
structures. Based on fact that bond length, bond angle, and dihedral angle are
three basic geometric descriptors corresponding to a complete molecular 3D
conformer, we first develop a multi-task generative pre-train framework based
on these three attributes. Next, to automatically fuse these three generative
tasks, we design a surrogate metric using the \textit{total energy} to search
for weight distribution of the three pretext task since total energy
corresponding to the quality of 3D conformer.Extensive experiments on 2D
molecular graphs are conducted to demonstrate the accuracy, efficiency and
generalization ability of the proposed 3D PGT compared to various pre-training
baselines.
Related papers
- A Hitchhiker's Guide to Geometric GNNs for 3D Atomic Systems [87.30652640973317]
Recent advances in computational modelling of atomic systems represent them as geometric graphs with atoms embedded as nodes in 3D Euclidean space.
Geometric Graph Neural Networks have emerged as the preferred machine learning architecture powering applications ranging from protein structure prediction to molecular simulations and material generation.
This paper provides a comprehensive and self-contained overview of the field of Geometric GNNs for 3D atomic systems.
arXiv Detail & Related papers (2023-12-12T18:44:19Z) - Geometry-aware Line Graph Transformer Pre-training for Molecular
Property Prediction [4.598522704308923]
Geometry-aware line graph transformer (Galformer) pre-training is a novel self-supervised learning framework.
Galformer consistently outperforms all baselines on both classification and regression tasks.
arXiv Detail & Related papers (2023-09-01T14:20:48Z) - 3D Molecular Geometry Analysis with 2D Graphs [79.47097907673877]
Ground-state 3D geometries of molecules are essential for many molecular analysis tasks.
Modern quantum mechanical methods can compute accurate 3D geometries but are computationally prohibitive.
We propose a novel deep learning framework to predict 3D geometries from molecular graphs.
arXiv Detail & Related papers (2023-05-01T19:00:46Z) - Geometry-Complete Diffusion for 3D Molecule Generation and Optimization [3.8366697175402225]
We introduce the Geometry-Complete Diffusion Model (GCDM) for 3D molecule generation.
GCDM outperforms existing 3D molecular diffusion models by significant margins across conditional and unconditional settings.
We also show that GCDM's geometric features can be repurposed to consistently optimize the geometry and chemical composition of existing 3D molecules.
arXiv Detail & Related papers (2023-02-08T20:01:51Z) - Unified 2D and 3D Pre-Training of Molecular Representations [237.36667670201473]
We propose a new representation learning method based on a unified 2D and 3D pre-training.
Atom coordinates and interatomic distances are encoded and then fused with atomic representations through graph neural networks.
Our method achieves state-of-the-art results on 10 tasks, and the average improvement on 2D-only tasks is 8.3%.
arXiv Detail & Related papers (2022-07-14T11:36:56Z) - Non-equilibrium molecular geometries in graph neural networks [2.6040244706888998]
Graph neural networks have become a powerful framework for learning complex structure-property relationships.
Recently proposed methods have demonstrated that using 3D geometry information of the molecule along with the bonding structure can lead to more accurate prediction on a wide range of properties.
arXiv Detail & Related papers (2022-03-07T20:20:52Z) - 3D Infomax improves GNNs for Molecular Property Prediction [1.9703625025720701]
We propose pre-training a model to reason about the geometry of molecules given only their 2D molecular graphs.
We show that 3D pre-training provides significant improvements for a wide range of properties.
arXiv Detail & Related papers (2021-10-08T13:30:49Z) - Molecule3D: A Benchmark for Predicting 3D Geometries from Molecular
Graphs [79.06686274377009]
We develop a benchmark, known as Molecule3D, that includes a dataset with precise ground-state geometries of approximately 4 million molecules.
We implement two baseline methods that either predict the pairwise distance between atoms or atom coordinates in 3D space.
Our method can achieve comparable prediction accuracy but with much smaller computational costs.
arXiv Detail & Related papers (2021-09-30T22:09:28Z) - GeoMol: Torsional Geometric Generation of Molecular 3D Conformer
Ensembles [60.12186997181117]
Prediction of a molecule's 3D conformer ensemble from the molecular graph holds a key role in areas of cheminformatics and drug discovery.
Existing generative models have several drawbacks including lack of modeling important molecular geometry elements.
We propose GeoMol, an end-to-end, non-autoregressive and SE(3)-invariant machine learning approach to generate 3D conformers.
arXiv Detail & Related papers (2021-06-08T14:17:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.