Fugu-MT 論文翻訳(概要): Reward-Guided Discrete Diffusion via Clean-Sample Markov Chain for Molecule and Biological Sequence Design

論文の概要: Reward-Guided Discrete Diffusion via Clean-Sample Markov Chain for Molecule and Biological Sequence Design

arxiv url: http://arxiv.org/abs/2602.09424v1
Date: Tue, 10 Feb 2026 05:39:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-11 20:17:43.386077
Title: Reward-Guided Discrete Diffusion via Clean-Sample Markov Chain for Molecule and Biological Sequence Design
Title（参考訳）: 分子・生物配列設計のためのクリーンサンプルマルコフ鎖による逆ガイド型離散拡散
Authors: Prin Phunyaphibarn, Minhyuk Sung,
Abstract要約: 離散拡散モデルに対する試験時間報酬誘導サンプリング法を提案する。 clean-Sample Markov Chain (CSMC) Smplerは、中間報酬に頼らずにローカル検索を可能にする。我々の手法は、中間報酬に依存する先行アプローチを一貫して上回る。
参考スコア（独自算出の注目度）: 25.729441906206997
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Discrete diffusion models have recently emerged as a powerful class of generative models for chemistry and biology data. In these fields, the goal is to generate various samples with high rewards (e.g., drug-likeness in molecules), making reward-based guidance crucial. Most existing methods are based on guiding the diffusion model using intermediate rewards but tend to underperform since intermediate rewards are noisy due to the non-smooth nature of reward functions used in scientific domains. To address this, we propose Clean-Sample Markov Chain (CSMC) Sampler, a method that performs effective test-time reward-guided sampling for discrete diffusion models, enabling local search without relying on intermediate rewards. CSMC constructs a Markov chain of clean samples using the Metropolis-Hastings algorithm such that its stationary distribution is the target distribution. We design a proposal distribution by sequentially applying the forward and backward diffusion processes, making the acceptance probability tractable. Experiments on molecule and biological sequence generation with various reward functions demonstrate that our method consistently outperforms prior approaches that rely on intermediate rewards.
Abstract（参考訳）: 離散拡散モデルは最近、化学および生物学データのための強力な生成モデルのクラスとして登場した。これらの分野では、高い報酬(例えば分子の薬物類似性)を持つ様々なサンプルを生成し、報酬に基づくガイダンスを不可欠にすることを目的としている。既存の手法の多くは、中間報酬を用いた拡散モデルを導くことに基づいているが、科学領域で使われる報酬関数の非滑らかな性質のため、中間報酬はノイズが多いため、性能が劣る傾向がある。そこで本研究では,離散拡散モデルに対して効率的な実時間報酬誘導サンプリングを行う手法であるCSMCサアンプを提案し,中間報酬に頼らずに局所探索を可能にする。 CSMCは、その定常分布がターゲット分布であるように、メトロポリス・ハスティングスアルゴリズムを用いてクリーンサンプルのマルコフ連鎖を構築する。本研究では,前方及び後方拡散過程を逐次適用して提案分布を設計し,受理確率を抽出可能とした。様々な報酬関数を持つ分子および生物学的配列生成の実験は、中間報酬に依存する先行アプローチを一貫して上回っていることを示す。

論文の概要: Reward-Guided Discrete Diffusion via Clean-Sample Markov Chain for Molecule and Biological Sequence Design

関連論文リスト