Synthesizable Molecular Generation via Soft-constrained GFlowNets with Rich Chemical Priors
- URL: http://arxiv.org/abs/2602.04119v1
- Date: Wed, 04 Feb 2026 01:27:42 GMT
- Title: Synthesizable Molecular Generation via Soft-constrained GFlowNets with Rich Chemical Priors
- Authors: Hyeonah Kim, Minsu Kim, Celine Roget, Dionessa Biton, Louis Vaillancourt, Yves V. Brun, Yoshua Bengio, Alex Hernandez-Garcia,
- Abstract summary: We propose S3-GFN, which generates synthesizable SMILES molecules via simple soft regularization of a sequence-based GFlowNet.<n>Our experiments show that S3-GFN learns to generate synthesizable molecules with higher rewards in diverse tasks.
- Score: 42.095574458478616
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The application of generative models for experimental drug discovery campaigns is severely limited by the difficulty of designing molecules de novo that can be synthesized in practice. Previous works have leveraged Generative Flow Networks (GFlowNets) to impose hard synthesizability constraints through the design of state and action spaces based on predefined reaction templates and building blocks. Despite the promising prospects of this approach, it currently lacks flexibility and scalability. As an alternative, we propose S3-GFN, which generates synthesizable SMILES molecules via simple soft regularization of a sequence-based GFlowNet. Our approach leverages rich molecular priors learned from large-scale SMILES corpora to steer molecular generation towards high-reward, synthesizable chemical spaces. The model induces constraints through off-policy replay training with a contrastive learning signal based on separate buffers of synthesizable and unsynthesizable samples. Our experiments show that S3-GFN learns to generate synthesizable molecules ($\geq 95\%$) with higher rewards in diverse tasks.
Related papers
- Rethinking Molecule Synthesizability with Chain-of-Reaction [47.744071119775676]
We introduce ReaSyn, a generative framework for synthesizable projection.<n>We propose a novel perspective that views synthetic pathways akin to reasoning paths in large language models (LLMs)<n>With the CoR notation, ReaSyn can get dense supervision in every reaction step to explicitly learn chemical reaction rules.
arXiv Detail & Related papers (2025-09-19T15:29:57Z) - SynCoGen: Synthesizable 3D Molecule Generation via Joint Reaction and Coordinate Modeling [29.856853267388924]
We present SynCoGen, a framework that combines masked graph diffusion and flow matching for synthesizable 3D molecule generation.<n>To train the model, we curated SynSpace, a dataset containing over 600K-aware building block graphs and 3.3M conformers.
arXiv Detail & Related papers (2025-07-16T00:36:35Z) - Generative Molecular Design with Steerable and Granular Synthesizability Control [0.3065062372337749]
We propose a small molecule generative design framework that enables steerable and granular synthesizability control.<n>We show the capability to mix-and-match these reaction constraints across the most common medicinal chemistry transformations.<n>We demonstrate how our framework can be used to valorize industrial byproducts towards de novo optimized molecules.
arXiv Detail & Related papers (2025-05-13T17:53:54Z) - Generative Flows on Synthetic Pathway for Drug Design [39.69010664056235]
We propose RxnFlow, which sequentially assembles molecules using predefined molecular building blocks and chemical reaction templates.<n> RxnFlow achieves state-of-the-art performance on CrossDocked 2020 for pocket-conditional generation, with an average Vina score of -8 kcal.85/mol and 34.8% synthesizability.
arXiv Detail & Related papers (2024-10-06T16:34:01Z) - RGFN: Synthesizable Molecular Generation Using GFlowNets [51.33672611338754]
We propose Reaction-GFlowNet, an extension of the GFlowNet framework that operates directly in the space of chemical reactions.
RGFN allows out-of-the-box synthesizability while maintaining comparable quality of generated candidates.
We demonstrate the effectiveness of the proposed approach across a range of oracle models, including pretrained proxy models and GPU-accelerated docking.
arXiv Detail & Related papers (2024-06-01T13:11:11Z) - SynFlowNet: Design of Diverse and Novel Molecules with Synthesis Constraints [16.21161274235011]
We introduce SynFlowNet, a GFlowNet model whose action space uses chemical reactions and purchasable reactants to sequentially build new molecules.<n>By incorporating forward synthesis as an explicit constraint of the generative mechanism, we aim at bridging the gap between in silico molecular generation and real world synthesis capabilities.
arXiv Detail & Related papers (2024-05-02T10:15:59Z) - Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation.
We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria.
Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z) - Learning To Navigate The Synthetically Accessible Chemical Space Using
Reinforcement Learning [75.95376096628135]
We propose a novel forward synthesis framework powered by reinforcement learning (RL) for de novo drug design.
In this setup, the agent learns to navigate through the immense synthetically accessible chemical space.
We describe how the end-to-end training in this study represents an important paradigm in radically expanding the synthesizable chemical space.
arXiv Detail & Related papers (2020-04-26T21:40:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.