Fugu-MT 論文翻訳(概要): Compositional Transduction with Latent Analogies for Offline Goal-Conditioned Reinforcement Learning

論文の概要: Compositional Transduction with Latent Analogies for Offline Goal-Conditioned Reinforcement Learning

arxiv url: http://arxiv.org/abs/2605.20609v1
Date: Wed, 20 May 2026 01:54:18 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-21 19:19:56.430451
Title: Compositional Transduction with Latent Analogies for Offline Goal-Conditioned Reinforcement Learning
Title（参考訳）: オフラインゴール・コンディション強化学習のための潜在アナロジーを用いた構成変換
Authors: Junseok Kim, Dohyeong Kim, Mineui Hong, Songhwai Oh,
Abstract要約: 構成一般化は、オフラインの目標条件強化学習において、目に見えない目標を達成するために不可欠である。我々は、タスク内在的な類似を与えられた文脈で構成することにより、アナログ変換を新しい計画として定式化する。我々は,OGBench操作環境におけるアプローチの有効性を実証的に実証した。
参考スコア（独自算出の注目度）: 17.14266617553098
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Compositional generalization is essential for reaching unseen goals under novel contextual variations in offline goal-conditioned reinforcement learning (GCRL), where a generalist goal-reaching agent must be learned from limited data. Most prior approaches pursue this via trajectory stitching over temporally contiguous segments, which limits composing behaviors across varying contexts. To overcome this limitation, we formalize analogy transduction as synthesizing new plans by composing task-endogenous analogies with given contexts and propose a novel analogy representation tailored for it. Grounded in our theory, this analogy representation captures what changes under optimal task execution, remains invariant to contextual variations, and is sufficient for optimal goal reaching. We further contend that generalization to unseen analogy-context pairs is a practical obstacle in analogy transduction, and introduce a new approach for offline GCRL that enables analogy transduction beyond seen pairs to unseen combinations. We empirically demonstrate the effectiveness of our approach on OGBench manipulation environments, substantially outperforming prior methods that do not perform analogy transduction. Project page: https://rllab-snu.github.io/projects/CTA/
Abstract（参考訳）: オフライン目標条件強化学習 (GCRL) では, 限定データから一般目標取得エージェントを学習しなければならない。従来のほとんどのアプローチは、時間的に連続した部分の軌跡を縫い合わせることによってこれを追求しており、様々な文脈における構成行動を制限する。この制限を克服するために、タスク内在的な類似を与えられた文脈で構成し、新しい計画の合成としてアナログ変換を形式化し、それに適した新しい類似表現を提案する。この類似表現は、最適タスク実行下での変化を捉え、文脈変化に不変であり、最適なゴールに到達するのに十分である。さらに、見知らぬ類義語対への一般化は、類義語変換の実践的な障害であり、見つからない組み合わせへの類義語変換を可能にするオフラインGCRLの新しいアプローチを提案する。我々は,OGBench操作環境におけるアプローチの有効性を実証的に実証し,アナログ変換を行わない先行手法を著しく上回った。プロジェクトページ: https://rllab-snu.github.io/projects/CTA/

論文の概要: Compositional Transduction with Latent Analogies for Offline Goal-Conditioned Reinforcement Learning

関連論文リスト