Fugu-MT 論文翻訳(概要): VineLM: Trie-Based Fine-Grained Control for Agentic Workflows

論文の概要: VineLM: Trie-Based Fine-Grained Control for Agentic Workflows

arxiv url: http://arxiv.org/abs/2605.23914v1
Date: Thu, 09 Apr 2026 15:31:22 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-15 07:09:36.460135
Title: VineLM: Trie-Based Fine-Grained Control for Agentic Workflows
Title（参考訳）: VineLM: エージェントワークフローのためのトライベースファイングラインドコントロール
Authors: Nikos Pagonas, Matthew Lou, Tianyi Peng, Dan Rubenstein, Kostis Kaffes,
Abstract要約: VineLMは、モデル選択プレフィックスの注釈付きトリエとして実行可能な実行を表す。 VineLMは、モデル選択プレフィックスの注釈付きトリエとして実行可能な実行を表す。
参考スコア（独自算出の注目度）: 4.984467399543813
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Agentic workflows interleave configurable LLM stages with tool stages and often include retries or refinement loops. Existing workflow managers profile full workflow configurations offline and assign each request a static workflow-level plan that binds each configurable LLM stage to a single model, reuses that model across repeated loop iterations, and does not revisit those choices at runtime. We present VineLM, a workflow manager that enables fine-grained control by choosing the model for each stage invocation as execution unfolds under request-level objectives such as maximizing accuracy under cost or latency budgets. VineLM represents feasible executions as an annotated trie of model-choice prefixes and uses checkpointing and cascade profiling to estimate path accuracy, cost, and latency without exhaustively profiling every request on every path. At runtime, VineLM re-roots the trie after each stage invocation and replans over the remaining subtrie using the realized execution prefix and remaining latency budget. On NL2SQL and math reasoning workflows, VineLM improves the cost-latency-accuracy frontier over coarse workflow-level baselines, achieving up to 18% higher accuracy at the same per-request budget with its sparse profiling reducing offline profiling cost by 98-99.8% when compared to exhaustive profiling.
Abstract（参考訳）: エージェントワークフローは、設定可能なLCMステージとツールステージをインターリーブする。既存のワークフローマネージャは、オフラインでワークフロー設定をプロファイルし、各リクエストに静的なワークフローレベルプランを割り当て、各構成可能なLCMステージをひとつのモデルにバインドし、繰り返しループを繰り返してモデルを再利用し、実行時にそれらの選択を再考しない。我々は、コストやレイテンシの予算の下での精度の最大化といった要求レベルの目的の下で実行が展開されるにつれて、各ステージ呼び出しのモデルを選択することで、きめ細かい制御を可能にするワークフローマネージャであるVineLMを提案する。 VineLMは、モデル選択プレフィックスの注釈付きトリエとして実行可能な実行を表し、チェックポイントとカスケードプロファイリングを使用して、すべてのパスに対するすべての要求を徹底的にプロファイリングすることなく、パスの正確性、コスト、レイテンシを推定する。実行時にVineLMは、各ステージの実行後にトリエを再ルートし、実行プレフィックスと待ち時間予算を使用して残りのサブストリーをリプランする。 NL2SQLと数学推論のワークフローでは、VineLMは、粗いワークフローレベルのベースラインよりもコスト-レイテンシ-精度のフロンティアを改善し、全体のプロファイリングに比べて、オフラインプロファイリングコストを98-99.8%削減するスパースプロファイリングにより、要求毎の予算で最大18%高い精度を達成する。

論文の概要: VineLM: Trie-Based Fine-Grained Control for Agentic Workflows

関連論文リスト