Fugu-MT 論文翻訳(概要): Learning Lifted Action Models From Traces of Incomplete Actions and States

論文の概要: Learning Lifted Action Models From Traces of Incomplete Actions and States

arxiv url: http://arxiv.org/abs/2508.21449v1
Date: Fri, 29 Aug 2025 09:27:53 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-01 19:45:10.988445
Title: Learning Lifted Action Models From Traces of Incomplete Actions and States
Title（参考訳）: 不完全な行動の痕跡と状態からリフテッドアクションモデルを学ぶ
Authors: Niklas Jansen, Jonas Gösgens, Hector Geffner,
Abstract要約: ランダムな状態-動作トレースからスライディングタイルパズルの昇降STRIPSモデルを学習する問題を考察する。我々はSTRIPS+と呼ばれるSTRIPSの変種を紹介し、そこでは特定のSTRIPSアクション引数を事前条件で暗黙的に残すことができる。提案した学習アルゴリズムはSynTHと呼ばれ、各アクションに対して事前条件式やクエリ'の階層化されたシーケンス(接続)を構築する。
参考スコア（独自算出の注目度）: 6.30853901017231
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Consider the problem of learning a lifted STRIPS model of the sliding-tile puzzle from random state-action traces where the states represent the location of the tiles only, and the actions are the labels up, down, left, and right, with no arguments. Two challenges are involved in this problem. First, the states are not full STRIPS states, as some predicates are missing, like the atoms representing the position of the ``blank''. Second, the actions are not full STRIPS either, as they do not reveal all the objects involved in the actions effects and preconditions. Previous approaches have addressed different versions of this model learning problem, but most assume that actions in the traces are full STRIPS actions or that the domain predicates are all observable. The new setting considered in this work is more ``realistic'', as the atoms observed convey the state of the world but not full STRIPS states, and the actions reveal the arguments needed for selecting the action but not the ones needed for modeling it in STRIPS. For formulating and addressing the learning problem, we introduce a variant of STRIPS, which we call STRIPS+, where certain STRIPS action arguments can be left implicit in preconditions which can also involve a limited form of existential quantification. The learning problem becomes the problem of learning STRIPS+ models from STRIPS+ state-action traces. For this, the proposed learning algorithm, called SYNTH, constructs a stratified sequence (conjunction) of precondition expressions or ``queries'' for each action, that denote unique objects in the state and ground the implicit action arguments in STRIPS+. The correctness and completeness of SYNTH is established, and its scalability is tested on state-action traces obtained from STRIPS+ models derived from existing STRIPS domains.
Abstract（参考訳）: 状態がタイルの位置のみを表すランダムな状態-動作トレースからスライディングタイルパズルの昇降STRIPSモデルを学習する問題を考える。この問題には2つの課題がある。第一に、状態は「ブランク」の位置を表す原子のように、一部の述語が欠落しているため、完全なSTRIPS状態ではない。第二に、アクションエフェクトとプレコンディションに関連するすべてのオブジェクトを明らかにすることはないため、アクションも完全なSTRIPSではない。従来のアプローチでは、このモデル学習問題の異なるバージョンに対処してきたが、ほとんどの場合、トレース中のアクションが完全なSTRIPSアクションであるか、ドメイン述語がすべて観測可能であると仮定している。この研究で考慮された新しい設定はより「現実的」であり、観測された原子は世界の状態を伝達するが、完全なSTRIPS状態ではないため、アクションはアクションを選択するのに必要な議論を明らかにするが、STRIPSでそれをモデル化するために必要なものではない。学習問題を定式化し,解決するために,STRIPS+と呼ばれるSTRIPSの変種を導入する。学習問題はSTRIPS+状態-動作トレースからSTRIPS+モデルを学習する問題となる。このために、SynTHと呼ばれる学習アルゴリズムは、各アクションに対して事前条件表現の階層化されたシーケンス(接続)を構築し、状態のユニークなオブジェクトを示し、STRIPS+の暗黙的なアクション引数を基底とする。 SynTHの正確性と完全性を確立し、既存のSTRIPSドメインから派生したSTRIPS+モデルから得られた状態-作用トレースに対してスケーラビリティを検証した。

論文の概要: Learning Lifted Action Models From Traces of Incomplete Actions and States

関連論文リスト