Fugu-MT 論文翻訳(概要): Boosted GFlowNets: Improving Exploration via Sequential Learning

論文の概要: Boosted GFlowNets: Improving Exploration via Sequential Learning

arxiv url: http://arxiv.org/abs/2511.09677v1
Date: Fri, 14 Nov 2025 01:03:55 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-14 22:53:22.408759
Title: Boosted GFlowNets: Improving Exploration via Sequential Learning
Title（参考訳）: Boosted GFlowNets: シーケンシャルラーニングによる探索の改善
Authors: Pedro Dall'Antonia, Tiago da Silva, Daniel Augusto de Souza, César Lincoln C. Mattos, Diego Mesquita,
Abstract要約: Boosted GFlowNetsは、GFlowNetsのアンサンブルを逐次訓練する手法で、それぞれが、以前のモデルで既に捕獲された質量を補償する残留報酬を最適化する。本稿では, マルチモーダル合成ベンチマークとペプチド設計タスクにおいて, GFlowNetsの探索とサンプルの多様性を著しく向上することを示す。
参考スコア（独自算出の注目度）: 13.119757506183392
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Generative Flow Networks (GFlowNets) are powerful samplers for compositional objects that, by design, sample proportionally to a given non-negative reward. Nonetheless, in practice, they often struggle to explore the reward landscape evenly: trajectories toward easy-to-reach regions dominate training, while hard-to-reach modes receive vanishing or uninformative gradients, leading to poor coverage of high-reward areas. We address this imbalance with Boosted GFlowNets, a method that sequentially trains an ensemble of GFlowNets, each optimizing a residual reward that compensates for the mass already captured by previous models. This residual principle reactivates learning signals in underexplored regions and, under mild assumptions, ensures a monotone non-degradation property: adding boosters cannot worsen the learned distribution and typically improves it. Empirically, Boosted GFlowNets achieve substantially better exploration and sample diversity on multimodal synthetic benchmarks and peptide design tasks, while preserving the stability and simplicity of standard trajectory-balance training.
Abstract（参考訳）: 生成フローネットワーク(GFlowNets)は、設計上、与えられた非負の報酬に比例する、構成オブジェクトのための強力なサンプルである。それにもかかわらず、実際には、彼らは報酬の風景を均等に探究するのに苦労することが多く、すなわち、易解な地域への軌道が訓練を支配しているのに対して、難解なモードは消滅または非形式的な勾配を受けており、高水準の地域をカバーしていない。我々は、GFlowNetsのアンサンブルを逐次訓練するBoosted GFlowNetsを用いて、この不均衡に対処する。この残留原理は、探索されていない地域での学習信号を再活性化し、穏やかな仮定の下では、単調な非劣化特性を保証する。実証的に、Boosted GFlowNetsは、標準軌道バランストレーニングの安定性と単純さを維持しながら、マルチモーダルな合成ベンチマークとペプチド設計タスクにおいて、かなり優れた探索とサンプル多様性を実現している。

論文の概要: Boosted GFlowNets: Improving Exploration via Sequential Learning

関連論文リスト