Fugu-MT 論文翻訳(概要): Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory

論文の概要: Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory

arxiv url: http://arxiv.org/abs/2602.06025v1
Date: Thu, 05 Feb 2026 18:57:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-06 18:49:09.146179
Title: Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory
Title（参考訳）: 実行時エージェントメモリのためのクエリ対応予算階層ルーティングの学習
Authors: Haozhen Zhang, Haodong Yue, Tao Feng, Quanyu Long, Jianzhu Bao, Bowen Jin, Weizhi Zhang, Xiao Li, Jiaxuan You, Chengwei Qin, Wenya Wang,
Abstract要約: BudgetMemは、明示的でクエリ対応のパフォーマンスコスト管理のためのランタイムエージェントメモリフレームワークである。軽量ルータは、タスク性能とメモリ構築コストのバランスをとるために、モジュール間の予算層ルーティングを実行する。 LoCoMo、LongMemEval、HotpotQAの他、BudgetMemはパフォーマンスが優先されるときに、強力なベースラインを超える。
参考スコア（独自算出の注目度）: 56.0946692457838
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Memory is increasingly central to Large Language Model (LLM) agents operating beyond a single context window, yet most existing systems rely on offline, query-agnostic memory construction that can be inefficient and may discard query-critical information. Although runtime memory utilization is a natural alternative, prior work often incurs substantial overhead and offers limited explicit control over the performance-cost trade-off. In this work, we present \textbf{BudgetMem}, a runtime agent memory framework for explicit, query-aware performance-cost control. BudgetMem structures memory processing as a set of memory modules, each offered in three budget tiers (i.e., \textsc{Low}/\textsc{Mid}/\textsc{High}). A lightweight router performs budget-tier routing across modules to balance task performance and memory construction cost, which is implemented as a compact neural policy trained with reinforcement learning. Using BudgetMem as a unified testbed, we study three complementary strategies for realizing budget tiers: implementation (method complexity), reasoning (inference behavior), and capacity (module model size). Across LoCoMo, LongMemEval, and HotpotQA, BudgetMem surpasses strong baselines when performance is prioritized (i.e., high-budget setting), and delivers better accuracy-cost frontiers under tighter budgets. Moreover, our analysis disentangles the strengths and weaknesses of different tiering strategies, clarifying when each axis delivers the most favorable trade-offs under varying budget regimes.
Abstract（参考訳）: メモリは、単一のコンテキストウインドウを越えて動作するLarge Language Model (LLM)エージェントの中心的存在になりつつあるが、既存のほとんどのシステムはオフラインでクエリに依存しないメモリ構造に依存しており、非効率であり、クエリクリティカルな情報を破棄する可能性がある。実行時のメモリ利用は自然な代替手段だが、以前の作業は大きなオーバーヘッドを伴い、パフォーマンスコストのトレードオフに対して限定的なコントロールを提供することが多い。本稿では,明示的なクエリ対応パフォーマンスコスト制御のための実行時エージェントメモリフレームワークである \textbf{BudgetMem} を紹介する。 BudgetMemはメモリ処理をメモリモジュールのセットとして構成し、それぞれが3つの予算層(すなわち \textsc{Low}/\textsc{Mid}/\textsc{High})で提供される。軽量ルータは、タスク性能とメモリ構築コストのバランスをとるためにモジュール間の予算層ルーティングを実行し、強化学習で訓練されたコンパクトなニューラルポリシーとして実装される。 BudgetMemを統合テストベッドとして使用し、実装(メソッドの複雑さ)、推論(推論の振る舞い)、キャパシティ(モジュールモデルサイズ)という、予算層を実現するための3つの補完的な戦略について検討する。 LoCoMo、LongMemEval、HotpotQAにまたがって、BudgetMemはパフォーマンスが優先される(高予算設定)ときに強力なベースラインを超え、より厳格な予算の下でより精度の高いフロンティアを提供する。さらに、分析は異なる階層戦略の強みと弱みを解き、各軸が様々な予算体制の下で最も有利なトレードオフをいつ達成するかを明確にする。

論文の概要: Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory

関連論文リスト