PhenoMoler: Phenotype-Guided Molecular Optimization via Chemistry Large Language Model
- URL: http://arxiv.org/abs/2509.21424v1
- Date: Thu, 25 Sep 2025 09:37:19 GMT
- Title: PhenoMoler: Phenotype-Guided Molecular Optimization via Chemistry Large Language Model
- Authors: Ran Song, Hui Liu,
- Abstract summary: PhenoMoler generates chemically valid, novel, and diverse molecules aligned with desired phenotypic profiles.<n>Compared to FDA-approved drugs, the generated compounds exhibit comparable or enhanced drug-likeness (QED), optimized physicochemical properties, and superior binding affinity to key cancer targets.
- Score: 9.301996807475282
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Current molecular generative models primarily focus on improving drug-target binding affinity and specificity, often neglecting the system-level phenotypic effects elicited by compounds. Transcriptional profiles, as molecule-level readouts of drug-induced phenotypic shifts, offer a powerful opportunity to guide molecular design in a phenotype-aware manner. We present PhenoMoler, a phenotype-guided molecular generation framework that integrates a chemistry large language model with expression profiles to enable biologically informed drug design. By conditioning the generation on drug-induced differential expression signatures, PhenoMoler explicitly links transcriptional responses to chemical structure. By selectively masking and reconstructing specific substructures-scaffolds, side chains, or linkers-PhenoMoler supports fine-grained, controllable molecular optimization. Extensive experiments demonstrate that PhenoMoler generates chemically valid, novel, and diverse molecules aligned with desired phenotypic profiles. Compared to FDA-approved drugs, the generated compounds exhibit comparable or enhanced drug-likeness (QED), optimized physicochemical properties, and superior binding affinity to key cancer targets. These findings highlight PhenoMoler's potential for phenotype-guided and structure-controllable molecular optimization.
Related papers
- ExMolRL: Phenotype-Target Joint Generation of De Novo Molecules via Multi-Objective Reinforcement Learning [4.998189068886174]
ExMoIRL is a novel generative framework that integrates phenotypic and target-specific cues for de novo molecular generation.<n>It fuses docking affinity and drug-likeness scores, augmented with ranking loss, prior-likelihood regularization, and entropy.<n>Extensive experiments demonstrate ExMoIRL's superior performance over state-of-the-art-based and target-based models.
arXiv Detail & Related papers (2025-09-25T11:13:24Z) - Functional Groups are All you Need for Chemically Interpretable Molecular Property Prediction [0.0]
This work proposes developing molecule representations using the concept of Functional Groups (FG) in chemistry.<n>We introduce the Functional Group Representation (FGR) framework, a novel approach to encoding molecules based on their fundamental chemical substructures.<n>We demonstrate that the FGR framework achieves state-of-the-art performance on a diverse range of 33 benchmark datasets.
arXiv Detail & Related papers (2025-09-11T17:01:31Z) - Pharmacophore-Conditioned Diffusion Model for Ligand-Based De Novo Drug Design [0.0]
We present PharmaDiff, a pharmacophore-conditioned diffusion model for 3D molecular generation.<n>PharmaDiff employs a transformer-based architecture to integrate an atom-based representation of the 3D pharmacophore into the generative process.
arXiv Detail & Related papers (2025-05-15T17:54:29Z) - Knowledge-aware contrastive heterogeneous molecular graph learning [77.94721384862699]
We propose a paradigm shift by encoding molecular graphs into Heterogeneous Molecular Graph Learning (KCHML)<n>KCHML conceptualizes molecules through three distinct graph views-molecular, elemental, and pharmacological-enhanced by heterogeneous molecular graphs and a dual message-passing mechanism.<n>This design offers a comprehensive representation for property prediction, as well as for downstream tasks such as drug-drug interaction (DDI) prediction.
arXiv Detail & Related papers (2025-02-17T11:53:58Z) - FARM: Functional Group-Aware Representations for Small Molecules [55.281754551202326]
We introduce Functional Group-Aware Representations for Small Molecules (FARM)<n>FARM is a novel model designed to bridge the gap between SMILES, natural language, and molecular graphs.<n>We evaluate FARM on the MoleculeNet dataset, where it achieves state-of-the-art performance on 11 out of 13 tasks.
arXiv Detail & Related papers (2024-10-02T23:04:58Z) - Cell Morphology-Guided Small Molecule Generation with GFlowNets [41.8027680592766]
We propose an unsupervised multimodal joint embedding to define a latent similarity as a reward for GFlowNets.
The proposed model learns to generate new molecules that could produce phenotypic effects similar to those of the given image target.
We demonstrate that the proposed method generates molecules with high morphological and structural similarity to the target, increasing the likelihood of similar biological activity.
arXiv Detail & Related papers (2024-08-09T17:40:35Z) - Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization [147.7899503829411]
AliDiff is a novel framework to align pretrained target diffusion models with preferred functional properties.
It can generate molecules with state-of-the-art binding energies with up to -7.07 Avg. Vina Score.
arXiv Detail & Related papers (2024-07-01T06:10:29Z) - An Equivariant Generative Framework for Molecular Graph-Structure
Co-Design [54.92529253182004]
We present MolCode, a machine learning-based generative framework for underlineMolecular graph-structure underlineCo-design.
In MolCode, 3D geometric information empowers the molecular 2D graph generation, which in turn helps guide the prediction of molecular 3D structure.
Our investigation reveals that the 2D topology and 3D geometry contain intrinsically complementary information in molecule design.
arXiv Detail & Related papers (2023-04-12T13:34:22Z) - Domain-Agnostic Molecular Generation with Chemical Feedback [44.063584808910896]
MolGen is a pre-trained molecular language model tailored specifically for molecule generation.
It internalizes structural and grammatical insights through the reconstruction of over 100 million molecular SELFIES.
Our chemical feedback paradigm steers the model away from molecular hallucinations, ensuring alignment between the model's estimated probabilities and real-world chemical preferences.
arXiv Detail & Related papers (2023-01-26T17:52:56Z) - Reinforced Molecular Optimization with Neighborhood-Controlled Grammars [63.84003497770347]
We propose MNCE-RL, a graph convolutional policy network for molecular optimization.
We extend the original neighborhood-controlled embedding grammars to make them applicable to molecular graph generation.
We show that our approach achieves state-of-the-art performance in a diverse range of molecular optimization tasks.
arXiv Detail & Related papers (2020-11-14T05:42:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.