MolAct: An Agentic RL Framework for Molecular Editing and Property Optimization
- URL: http://arxiv.org/abs/2512.20135v2
- Date: Wed, 24 Dec 2025 02:19:21 GMT
- Title: MolAct: An Agentic RL Framework for Molecular Editing and Property Optimization
- Authors: Zhuo Yang, Yeyun Chen, Jiaqing Xie, Ben Gao, Shuaike Shen, Wanhao Liu, Liujia Yang, Beilun Wang, Tianfan Fu, Yuqiang Li,
- Abstract summary: We introduce MolAct, an agentic reinforcement learning framework for molecular design problems.<n>We instantiate the framework to train two model families: MolEditAgent for molecular editing tasks and MolOptAgent for molecular optimization tasks.<n>Results show that treating molecular design as a multi-step, tool-augmented process is key to reliable and interpretable improvements.
- Score: 16.402871404193576
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Molecular editing and optimization are multi-step problems that require iteratively improving properties while keeping molecules chemically valid and structurally similar. We frame both tasks as sequential, tool-guided decisions and introduce MolAct, an agentic reinforcement learning framework that employs a two-stage training paradigm: first building editing capability, then optimizing properties while reusing the learned editing behaviors. To the best of our knowledge, this is the first work to formalize molecular design as an Agentic Reinforcement Learning problem, where an LLM agent learns to interleave reasoning, tool-use, and molecular optimization. The framework enables agents to interact in multiple turns, invoking chemical tools for validity checking, property assessment, and similarity control, and leverages their feedback to refine subsequent edits. We instantiate the MolAct framework to train two model families: MolEditAgent for molecular editing tasks and MolOptAgent for molecular optimization tasks. In molecular editing, MolEditAgent-7B delivers 100, 95, and 98 valid add, delete, and substitute edits, outperforming strong closed "thinking" baselines such as DeepSeek-R1; MolEditAgent-3B approaches the performance of much larger open "thinking" models like Qwen3-32B-think. In molecular optimization, MolOptAgent-7B (trained on MolEditAgent-7B) surpasses the best closed "thinking" baseline (e.g., Claude 3.7) on LogP and remains competitive on solubility, while maintaining balanced performance across other objectives. These results highlight that treating molecular design as a multi-step, tool-augmented process is key to reliable and interpretable improvements.
Related papers
- Agentic reinforcement learning empowers next-generation chemical language models for molecular design and synthesis [51.83339196548892]
ChemCraft is a novel framework that decouples chemical reasoning from knowledge storage.<n>ChemCraft achieves superior performance with minimal inference costs.<n>This work establishes a cost-effective and privacy-preserving paradigm for AI-aided chemistry.
arXiv Detail & Related papers (2026-01-25T04:23:34Z) - M^4olGen: Multi-Agent, Multi-Stage Molecular Generation under Precise Multi-Property Constraints [35.83366265234892]
textbfM olGen is a fragment-level, retrieval-augmented, two-stage framework for molecule generation under multi-property constraints.<n>A dataset with reasoning chains of fragment edits and measured property deltas underpins both stages.<n>Experiments on generation under two sets of property constraints show consistent gains in validity and precise satisfaction of multi-property targets.
arXiv Detail & Related papers (2026-01-15T07:18:05Z) - MolEdit: Knowledge Editing for Multimodal Molecule Language Models [57.85765246726558]
MolEdit is a framework for molecule-to-caption generation and caption-to-molecule generation.<n>MolEdit combines a Multi-Expert Knowledge Adapter that routes edits to specialized experts for different molecular facets with an Expertise-Aware Editing Switcher.<n>MolEdit delivers up to 18.8% higher Reliability and 12.0% better Locality than baselines while maintaining efficiency.
arXiv Detail & Related papers (2025-11-16T20:48:37Z) - Coder as Editor: Code-driven Interpretable Molecular Optimization [36.84386817559159]
We introduce MECo, a framework that bridges reasoning and execution by translating editing actions into executable code.<n>Our approach achieves over 98% accuracy in reproducing held-out realistic edits from chemical reactions and target-specific compound pairs.
arXiv Detail & Related papers (2025-10-16T08:55:06Z) - An Auditable Agent Platform For Automated Molecular Optimisation [0.0]
Drug discovery frequently loses momentum when data, expertise, and tools are scattered.<n>To shorten this loop we built a hierarchical, tool using agent framework that automates molecular optimisation.
arXiv Detail & Related papers (2025-08-05T13:41:32Z) - MT-Mol:Multi Agent System with Tool-based Reasoning for Molecular Optimization [13.94416046565452]
We introduce MT-Mol, a multi-agent framework for molecular optimization using large language models (LLMs)<n>Our system incorporates comprehensive RDKit tools, categorized into five distinct domains: structural descriptors, electronic and topological features, fragment-based functional groups, molecular representations, and miscellaneous chemical properties.<n> MT-Mol produces molecules with tool-aligned and stepwise reasoning through the interaction between the analyst agents, a molecule-generating scientist, a reasoning-output verifier, and a reviewer agent.
arXiv Detail & Related papers (2025-05-27T07:27:30Z) - MolEditRL: Structure-Preserving Molecular Editing via Discrete Diffusion and Reinforcement Learning [4.430115182041077]
MolEditRL is a molecular editing framework that integrates structural constraints with precise property optimization.<n>For comprehensive evaluation, we construct MolEdit-Instruct, the largest and most property-rich molecular editing dataset.
arXiv Detail & Related papers (2025-05-26T15:29:08Z) - Text-Guided Multi-Property Molecular Optimization with a Diffusion Language Model [20.250683535089617]
We propose a text-guided multi-property molecular optimization method utilizing transformer-based diffusion language model (TransDLM)<n>By fusing physically and chemically detailed semantics with specialized molecular representations, TransDLM effectively integrates diverse information sources to guide precise optimization.
arXiv Detail & Related papers (2024-10-17T14:30:27Z) - Learning to Use Tools via Cooperative and Interactive Agents [58.77710337157665]
Tool learning empowers large language models (LLMs) as agents to use external tools and extend their utility.
We propose ConAgents, a Cooperative and interactive Agents framework, which coordinates three specialized agents for tool selection, tool execution, and action calibration separately.
Our experiments on three datasets show that the LLMs, when equipped with ConAgents, outperform baselines with substantial improvement.
arXiv Detail & Related papers (2024-03-05T15:08:16Z) - Empowering Molecule Discovery for Molecule-Caption Translation with Large Language Models: A ChatGPT Perspective [53.300288393173204]
Large Language Models (LLMs) have shown remarkable performance in various cross-modal tasks.
In this work, we propose an In-context Few-Shot Molecule Learning paradigm for molecule-caption translation.
We evaluate the effectiveness of MolReGPT on molecule-caption translation, including molecule understanding and text-based molecule generation.
arXiv Detail & Related papers (2023-06-11T08:16:25Z) - Molecular Attributes Transfer from Non-Parallel Data [57.010952598634944]
We formulate molecular optimization as a style transfer problem and present a novel generative model that could automatically learn internal differences between two groups of non-parallel data.
Experiments on two molecular optimization tasks, toxicity modification and synthesizability improvement, demonstrate that our model significantly outperforms several state-of-the-art methods.
arXiv Detail & Related papers (2021-11-30T06:10:22Z) - Optimizing Molecules using Efficient Queries from Property Evaluations [66.66290256377376]
We propose QMO, a generic query-based molecule optimization framework.
QMO improves the desired properties of an input molecule based on efficient queries.
We show that QMO outperforms existing methods in the benchmark tasks of optimizing small organic molecules.
arXiv Detail & Related papers (2020-11-03T18:51:18Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.