Phenotypic Profile-Informed Generation of Drug-Like Molecules via Dual-Channel Variational Autoencoders
- URL: http://arxiv.org/abs/2506.02051v1
- Date: Sun, 01 Jun 2025 07:46:39 GMT
- Title: Phenotypic Profile-Informed Generation of Drug-Like Molecules via Dual-Channel Variational Autoencoders
- Authors: Hui Liu, Shiye Tian, Xuejun Liu,
- Abstract summary: SmilesGEN is a novel generative model based on variational autoencoder (VAE) architecture to generate molecules with potential therapeutic effects.<n>SmilesGEN integrates a pre-trained drug VAE with an expression profile VAE (ProfileNet) to generate drug-like molecules.<n>Our experiments demonstrate that SmilesGEN outperforms current state-of-the-art models in generating molecules with higher degree of validity, uniqueness, novelty, as well as higher Tanimoto similarity to known targeting the relevant proteins.
- Score: 4.474508237015231
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The de novo generation of drug-like molecules capable of inducing desirable phenotypic changes is receiving increasing attention. However, previous methods predominantly rely on expression profiles to guide molecule generation, but overlook the perturbative effect of the molecules on cellular contexts. To overcome this limitation, we propose SmilesGEN, a novel generative model based on variational autoencoder (VAE) architecture to generate molecules with potential therapeutic effects. SmilesGEN integrates a pre-trained drug VAE (SmilesNet) with an expression profile VAE (ProfileNet), jointly modeling the interplay between drug perturbations and transcriptional responses in a common latent space. Specifically, ProfileNet is imposed to reconstruct pre-treatment expression profiles when eliminating drug-induced perturbations in the latent space, while SmilesNet is informed by desired expression profiles to generate drug-like molecules. Our empirical experiments demonstrate that SmilesGEN outperforms current state-of-the-art models in generating molecules with higher degree of validity, uniqueness, novelty, as well as higher Tanimoto similarity to known ligands targeting the relevant proteins. Moreover, we evaluate SmilesGEN for scaffold-based molecule optimization and generation of therapeutic agents, and confirmed its superior performance in generating molecules with higher similarity to approved drugs. SmilesGEN establishes a robust framework that leverages gene signatures to generate drug-like molecules that hold promising potential to induce desirable cellular phenotypic changes.
Related papers
- Improved Molecular Generation through Attribute-Driven Integrative Embeddings and GAN Selectivity [0.0]
This paper introduces a transformer-based vector embedding generator combined with a modified Generative Adrialversa Network (GAN) to generate molecules with desired properties.<n>The embedding generator utilizes a novel molecular descriptor, integrating Morgan fingerprints with global molecular attributes.<n>The approach is validated by generating novel odorant molecules using a labeled dataset of odorant and non-odorant compounds.
arXiv Detail & Related papers (2025-04-26T22:15:25Z) - GenMol: A Drug Discovery Generalist with Discrete Diffusion [43.29814519270451]
Generalist Molecular generative model (GenMol) is a versatile framework that uses only a single discrete diffusion model to handle diverse drug discovery scenarios.<n>GenMol generates Sequential Attachment-based Fragment Embedding sequences through non-autoregressive bidirectional parallel decoding.
arXiv Detail & Related papers (2025-01-10T18:30:05Z) - De Novo Generation of Hit-like Molecules from Gene Expression Profiles via Deep Learning [3.9518122220368905]
We propose a hybrid neural network, HNN2Mol, to generate new molecules with potential bioactivities and drug-like properties.<n> Experimental results and case studies demonstrate that the proposed HNN2Mol model can produce new molecules with potential bioactivities and drug-like properties.
arXiv Detail & Related papers (2024-12-27T03:16:56Z) - Fragment-Masked Diffusion for Molecular Optimization [71.13210858056527]
We propose a fragment-masked molecular optimization method based on phenotypic drug discovery (PDD)<n>PDD-based molecular optimization can reduce potential safety risks while optimizing phenotypic activity, thereby increasing the likelihood of clinical success.<n>The overall experiments demonstrate that the in-silico optimization success rate reaches 95.4%, with an average efficacy increase of 7.5%.
arXiv Detail & Related papers (2024-08-17T06:00:58Z) - Cell Morphology-Guided Small Molecule Generation with GFlowNets [41.8027680592766]
We propose an unsupervised multimodal joint embedding to define a latent similarity as a reward for GFlowNets.
The proposed model learns to generate new molecules that could produce phenotypic effects similar to those of the given image target.
We demonstrate that the proposed method generates molecules with high morphological and structural similarity to the target, increasing the likelihood of similar biological activity.
arXiv Detail & Related papers (2024-08-09T17:40:35Z) - Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization [147.7899503829411]
AliDiff is a novel framework to align pretrained target diffusion models with preferred functional properties.
It can generate molecules with state-of-the-art binding energies with up to -7.07 Avg. Vina Score.
arXiv Detail & Related papers (2024-07-01T06:10:29Z) - Molecule Design by Latent Space Energy-Based Modeling and Gradual
Distribution Shifting [53.44684898432997]
Generation of molecules with desired chemical and biological properties is critical for drug discovery.
We propose a probabilistic generative model to capture the joint distribution of molecules and their properties.
Our method achieves very strong performances on various molecule design tasks.
arXiv Detail & Related papers (2023-06-09T03:04:21Z) - SILVR: Guided Diffusion for Molecule Generation [0.0]
We introduce a machine-learning method for conditioning an existing generative model without retraining.
The model allows the generation of new molecules that fit into a binding site of a protein based on fragment hits.
We show that moderate SILVR rates make it possible to generate new molecules of similar shape to the original fragments.
arXiv Detail & Related papers (2023-04-21T11:47:38Z) - Exploring Chemical Space with Score-based Out-of-distribution Generation [57.15855198512551]
We propose a score-based diffusion scheme that incorporates out-of-distribution control in the generative differential equation (SDE)
Since some novel molecules may not meet the basic requirements of real-world drugs, MOOD performs conditional generation by utilizing the gradients from a property predictor.
We experimentally validate that MOOD is able to explore the chemical space beyond the training distribution, generating molecules that outscore ones found with existing methods, and even the top 0.01% of the original training pool.
arXiv Detail & Related papers (2022-06-06T06:17:11Z) - Improved Drug-target Interaction Prediction with Intermolecular Graph
Transformer [98.8319016075089]
We propose a novel approach to model intermolecular information with a three-way Transformer-based architecture.
Intermolecular Graph Transformer (IGT) outperforms state-of-the-art approaches by 9.1% and 20.5% over the second best for binding activity and binding pose prediction respectively.
IGT exhibits promising drug screening ability against SARS-CoV-2 by identifying 83.1% active drugs that have been validated by wet-lab experiments with near-native predicted binding poses.
arXiv Detail & Related papers (2021-10-14T13:28:02Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.