Fugu-MT 論文翻訳(概要): GRADE: Graph Representation of LLM Agent Dependency and Execution

論文の概要: GRADE: Graph Representation of LLM Agent Dependency and Execution

arxiv url: http://arxiv.org/abs/2606.22741v1
Date: Mon, 22 Jun 2026 01:03:21 GMT
ステータス: 情報取得中
システム内更新日: 2026-06-24 20:32:28.896289
Title: GRADE: Graph Representation of LLM Agent Dependency and Execution
Title（参考訳）: GRADE: LLMエージェント依存と実行のグラフ表現
Authors: Yue Zhao,
Abstract要約: GRADEは、その欠落したレイヤを回復する: ステップノード上の任意の実行を1つのグラフとしてモデル化する。依存関係層は、実行サイズが弱い障害を予測することができる。実行層は、失敗したマルチエージェント実行における障害ステップをローカライズする。
参考スコア（独自算出の注目度）: 6.230697997280125
License:
Abstract: Can one graph represent every kind of LLM agent's run? A trace records what each step did, never what it relied on, the state it read, and the results it reused. GRADE recovers that missing layer: it models any run as one graph over its step nodes with two edge layers, execution edges (what ran in what order) read from the trace for free, and dependency edges (what each step relied on) rarely logged, so each is graded by how it is known, observed, declared, or inferred. One representation, and each layer earns its place. Across six corpora of LLM agents spanning tool use, coding, and the web, the dependency layer can predict failure where run size is weak and, under leave-one-corpus-out transfer, stays above chance on every held-out class while run size fails. Meanwhile, the execution layer localizes the faulting step in a failed multi-agent run. This work also provides a more in-depth analysis of why generic graph neural networks may misread the dependency layer, unlike our feature-based alternative. The same graph representation opens further uses, carrying from failure diagnosis in a single run to efficiency and robustness optimization at scale.
Abstract（参考訳）: 1つのグラフは、全ての LLM エージェントの実行を表現できますか? トレースは、各ステップが何をしたか、何を頼ったか、読んだ状態、そして再利用した結果を記録します。 GRADEは、すべての実行をステップノード上の1つのグラフとしてモデル化し、2つのエッジ層で、実行エッジ(どの順序で実行されたか)がトレースから自由に読み取られ、依存関係エッジ(各ステップが依存したもの)がログに記録されることが滅多にないため、各実行は、その既知の、観察された、宣言された、推測された方法によって評価される。 1つの表現と各レイヤがその場所を得る。ツールの使用、コーディング、Webにまたがる6つのLLMエージェントのコーパスにまたがって、依存性レイヤは、実行サイズが弱く、残り1コーパスアウト転送の下では、実行サイズが失敗する間、すべてのホールトアウトクラスにチャンスを保ちます。一方、実行層は、失敗したマルチエージェント実行における障害ステップをローカライズする。この研究は、ジェネリックグラフニューラルネットワークが、機能ベースの代替手段とは異なり、依存層を誤って読み取る可能性がある理由について、より詳細な分析も提供する。同じグラフ表現は、単一実行時の障害診断から、効率性と大規模なロバストネスの最適化に至るまで、さらなる用途に開放される。

論文の概要: GRADE: Graph Representation of LLM Agent Dependency and Execution

関連論文リスト