Dual-Space Optimization: Improved Molecule Sequence Design by Latent
Prompt Transformer
- URL: http://arxiv.org/abs/2402.17179v1
- Date: Tue, 27 Feb 2024 03:33:23 GMT
- Title: Dual-Space Optimization: Improved Molecule Sequence Design by Latent
Prompt Transformer
- Authors: Deqian Kong, Yuhao Huang, Jianwen Xie, Edouardo Honig, Ming Xu,
Shuanghong Xue, Pei Lin, Sanping Zhou, Sheng Zhong, Nanning Zheng, Ying Nian
Wu
- Abstract summary: We propose the Dual-Space Optimization (DSO) method that integrates latent space sampling and data space selection to solve this problem.
DSO iteratively updates a latent space generative model and a synthetic dataset in an optimization process that shifts the generative model and the synthetic data towards regions of desired property values.
Our experiments demonstrate effectiveness of the proposed method, which sets new performance benchmarks across single-objective, multi-objective and constrained molecule design tasks.
- Score: 78.47949363282868
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Designing molecules with desirable properties, such as drug-likeliness and
high binding affinities towards protein targets, is a challenging problem. In
this paper, we propose the Dual-Space Optimization (DSO) method that integrates
latent space sampling and data space selection to solve this problem. DSO
iteratively updates a latent space generative model and a synthetic dataset in
an optimization process that gradually shifts the generative model and the
synthetic data towards regions of desired property values. Our generative model
takes the form of a Latent Prompt Transformer (LPT) where the latent vector
serves as the prompt of a causal transformer. Our extensive experiments
demonstrate effectiveness of the proposed method, which sets new performance
benchmarks across single-objective, multi-objective and constrained molecule
design tasks.
Related papers
- Conditional Synthesis of 3D Molecules with Time Correction Sampler [58.0834973489875]
Time-Aware Conditional Synthesis (TACS) is a novel approach to conditional generation on diffusion models.
It integrates adaptively controlled plug-and-play "online" guidance into a diffusion model, driving samples toward the desired properties.
arXiv Detail & Related papers (2024-11-01T12:59:25Z) - Molecule Design by Latent Prompt Transformer [61.68502207071992]
This paper proposes a latent prompt Transformer model for solving challenging optimization problems such as molecule design.
The goal is to find molecules with optimal values of a target chemical or biological property that can be computed by an existing software.
arXiv Detail & Related papers (2023-10-05T02:09:51Z) - Variational Autoencoding Molecular Graphs with Denoising Diffusion
Probabilistic Model [0.0]
We propose a novel deep generative model that incorporates a hierarchical structure into the probabilistic latent vectors.
We demonstrate that our model can design effective molecular latent vectors for molecular property prediction from some experiments by small datasets on physical properties and activity.
arXiv Detail & Related papers (2023-07-02T17:29:41Z) - Molecule Design by Latent Space Energy-Based Modeling and Gradual
Distribution Shifting [53.44684898432997]
Generation of molecules with desired chemical and biological properties is critical for drug discovery.
We propose a probabilistic generative model to capture the joint distribution of molecules and their properties.
Our method achieves very strong performances on various molecule design tasks.
arXiv Detail & Related papers (2023-06-09T03:04:21Z) - Controlled Molecule Generator for Optimizing Multiple Chemical
Properties [9.10095508718581]
We propose a new optimized molecule generator model based on the Transformer with two constraint networks.
Experiments demonstrate that our proposed model outperforms state-of-the-art models by a significant margin for optimizing multiple properties simultaneously.
arXiv Detail & Related papers (2020-10-26T21:26:14Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z) - Goal-directed Generation of Discrete Structures with Conditional
Generative Models [85.51463588099556]
We introduce a novel approach to directly optimize a reinforcement learning objective, maximizing an expected reward.
We test our methodology on two tasks: generating molecules with user-defined properties and identifying short python expressions which evaluate to a given target value.
arXiv Detail & Related papers (2020-10-05T20:03:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.