Fugu-MT 論文翻訳(概要): Fine-Tuning Diffusion Models for Molecular Generation via Reinforcement Learning and Fast Sampling

論文の概要: Fine-Tuning Diffusion Models for Molecular Generation via Reinforcement Learning and Fast Sampling

arxiv url: http://arxiv.org/abs/2606.01220v1
Date: Sun, 31 May 2026 13:11:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:29.392472
Title: Fine-Tuning Diffusion Models for Molecular Generation via Reinforcement Learning and Fast Sampling
Title（参考訳）: 強化学習と高速サンプリングによる分子生成のための微調整拡散モデル
Authors: Guang Lin, Shikui Tu, Lei Xu,
Abstract要約: 既存の生成的アプローチは、サンプリング中にコストのかかるポストホック処理に依存したり、トレーニング中に慎重にキュレートされたデータセットを必要とする場合が多い。構造制約下での拡散型分子生成に適した強化学習ファインチューニングフレームワークであるFTDiffを提案する。 FTDiffは、コストのかかるポストホック最適化や複雑なデータエンジニアリングを必要とせず、従来手法よりも一貫して優れていることを示す。
参考スコア（独自算出の注目度）: 22.66917124518062
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Generating molecules that simultaneously satisfy drug-like properties and conform to the 3D structure of a target protein is a core challenge in structure-based drug design (SBDD). Existing generative approaches, however, often rely on costly post-hoc processing during Sampling or require carefully curated datasets during training, yet still achieve modest gains. These limitations are especially pronounced in multi-objective settings, where balancing conflicting criteria remains a core challenge. To address these challenges, We propose FTDiff, a reinforcement learning fine-tuning framework tailored for diffusion-based molecular generation under structural constraints. To ensure stable and sample-efficient optimization, FTDiff adopts a group relative policy optimization (GRPO) style strategy. Furthermore, FTDiff builds upon a time-free pretrained diffusion model and incorporates a fast sampling mechanism that reduces the number of denoising steps, significantly accelerating both training and inference while maintaining generation quality. By optimizing a fixed threshold-aware reward, FTDiff effectively guides the model to produce valid, diverse, and high- quality molecules that balance multiple drug design objectives. Extensive experiments on benchmark datasets demonstrate that FTDiff consistently outperforms prior methods, without requiring expensive post-hoc optimization or intricate data engineering.
Abstract（参考訳）: 標的タンパク質の3次元構造を同時に満足する分子の生成は、構造に基づく薬物設計(SBDD)における中核的な課題である。しかし、既存の生成的アプローチは、サンプリング中にコストのかかるポストホック処理に依存したり、トレーニング中に慎重にキュレートされたデータセットを必要とする場合が多いが、それでも適度な利益は得られない。これらの制限は、矛盾する基準のバランスが依然として重要な課題である、多目的設定において特に顕著である。これらの課題に対処するために,構造制約下での拡散型分子生成に適した強化学習微調整フレームワークFTDiffを提案する。安定かつサンプル効率の最適化を保証するため、FTDiffはグループ相対ポリシー最適化(GRPO)方式を採用する。さらに、FTDiffは、時間のない事前学習拡散モデルを構築し、高速サンプリング機構を導入し、デノナイジングステップの数を削減し、生成品質を維持しながら、トレーニングと推論の両方を著しく加速する。固定しきい値認識報酬を最適化することにより、FTDiffは、複数の医薬品設計目標のバランスをとる有効で多種多様で高品質な分子を生成するために、モデルを効果的にガイドする。ベンチマークデータセットに関する大規模な実験は、FTDiffが高価なポストホック最適化や複雑なデータエンジニアリングを必要とせず、従来手法よりも一貫してパフォーマンスが向上していることを示している。

論文の概要: Fine-Tuning Diffusion Models for Molecular Generation via Reinforcement Learning and Fast Sampling

関連論文リスト