Scalable Fragment-Based 3D Molecular Design with Reinforcement Learning
- URL: http://arxiv.org/abs/2202.00658v1
- Date: Tue, 1 Feb 2022 18:54:24 GMT
- Title: Scalable Fragment-Based 3D Molecular Design with Reinforcement Learning
- Authors: Daniel Flam-Shepherd, Alexander Zhigalin and Al\'an Aspuru-Guzik
- Abstract summary: We introduce a novel framework for scalable 3D design that uses a hierarchical agent to build molecules.
In a variety of experiments, we show that our agent, guided only by energy considerations, can efficiently learn to produce molecules with over 100 atoms.
- Score: 68.8204255655161
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Machine learning has the potential to automate molecular design and
drastically accelerate the discovery of new functional compounds. Towards this
goal, generative models and reinforcement learning (RL) using string and graph
representations have been successfully used to search for novel molecules.
However, these approaches are limited since their representations ignore the
three-dimensional (3D) structure of molecules. In fact, geometry plays an
important role in many applications in inverse molecular design, especially in
drug discovery. Thus, it is important to build models that can generate
molecular structures in 3D space based on property-oriented geometric
constraints. To address this, one approach is to generate molecules as 3D point
clouds by sequentially placing atoms at locations in space -- this allows the
process to be guided by physical quantities such as energy or other properties.
However, this approach is inefficient as placing individual atoms makes the
exploration unnecessarily deep, limiting the complexity of molecules that can
be generated. Moreover, when optimizing a molecule, organic and medicinal
chemists use known fragments and functional groups, not single atoms. We
introduce a novel RL framework for scalable 3D design that uses a hierarchical
agent to build molecules by placing molecular substructures sequentially in 3D
space, thus attempting to build on the existing human knowledge in the field of
molecular design. In a variety of experiments with different substructures, we
show that our agent, guided only by energy considerations, can efficiently
learn to produce molecules with over 100 atoms from many distributions
including drug-like molecules, organic LED molecules, and biomolecules.
Related papers
- UniIF: Unified Molecule Inverse Folding [67.60267592514381]
We propose a unified model UniIF for inverse folding of all molecules.
Our proposed method surpasses state-of-the-art methods on all tasks.
arXiv Detail & Related papers (2024-05-29T10:26:16Z) - MUDiff: Unified Diffusion for Complete Molecule Generation [104.7021929437504]
We present a new model for generating a comprehensive representation of molecules, including atom features, 2D discrete molecule structures, and 3D continuous molecule coordinates.
We propose a novel graph transformer architecture to denoise the diffusion process.
Our model is a promising approach for designing stable and diverse molecules and can be applied to a wide range of tasks in molecular modeling.
arXiv Detail & Related papers (2023-04-28T04:25:57Z) - An Equivariant Generative Framework for Molecular Graph-Structure
Co-Design [54.92529253182004]
We present MolCode, a machine learning-based generative framework for underlineMolecular graph-structure underlineCo-design.
In MolCode, 3D geometric information empowers the molecular 2D graph generation, which in turn helps guide the prediction of molecular 3D structure.
Our investigation reveals that the 2D topology and 3D geometry contain intrinsically complementary information in molecule design.
arXiv Detail & Related papers (2023-04-12T13:34:22Z) - Implicit Geometry and Interaction Embeddings Improve Few-Shot Molecular
Property Prediction [53.06671763877109]
We develop molecular embeddings that encode complex molecular characteristics to improve the performance of few-shot molecular property prediction.
Our approach leverages large amounts of synthetic data, namely the results of molecular docking calculations.
On multiple molecular property prediction benchmarks, training from the embedding space substantially improves Multi-Task, MAML, and Prototypical Network few-shot learning performance.
arXiv Detail & Related papers (2023-02-04T01:32:40Z) - DiffBP: Generative Diffusion of 3D Molecules for Target Protein Binding [51.970607704953096]
Previous works usually generate atoms in an auto-regressive way, where element types and 3D coordinates of atoms are generated one by one.
In real-world molecular systems, the interactions among atoms in an entire molecule are global, leading to the energy function pair-coupled among atoms.
In this work, a generative diffusion model for molecular 3D structures based on target proteins is established, at a full-atom level in a non-autoregressive way.
arXiv Detail & Related papers (2022-11-21T07:02:15Z) - A 3D Molecule Generative Model for Structure-Based Drug Design [18.29582138009123]
We study a fundamental problem in structure-based drug design -- generating molecules that bind to specific protein binding sites.
We propose a 3D generative model that generates molecules given a designated 3D protein binding site.
arXiv Detail & Related papers (2022-03-20T03:54:47Z) - Learning to design drug-like molecules in three-dimensional space using
deep generative models [0.0]
Ligand Neural Network (L-Net) is a novel graph generative model for designing drug-like molecules with high-quality 3D structures.
L-Net is capable of generating chemically correct, conformationally valid, and highly druglike molecules.
arXiv Detail & Related papers (2021-04-17T07:30:23Z) - Learning a Continuous Representation of 3D Molecular Structures with
Deep Generative Models [0.0]
Generative models are an entirely different approach that learn to represent and optimize molecules in a continuous latent space.
We describe deep generative models of three dimensional molecular structures using atomic density grids.
We are also able to sample diverse sets of molecules based on a given input compound to increase the probability of creating valid, drug-like molecules.
arXiv Detail & Related papers (2020-10-17T01:15:47Z) - Reinforcement Learning for Molecular Design Guided by Quantum Mechanics [10.112779201155005]
We present a novel RL formulation for molecular design in coordinates, thereby extending the class of molecules that can be built.
Our reward function is directly based on fundamental physical properties such as the energy, which we approximate via fast quantum-chemical methods.
In our experiments, we show that our agent can efficiently learn to solve these tasks from scratch by working in a translation and rotation invariant state-action space.
arXiv Detail & Related papers (2020-02-18T16:43:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.