Fugu-MT 論文翻訳(概要): Are We Ready For An Agent-Native Memory System?

論文の概要: Are We Ready For An Agent-Native Memory System?

arxiv url: http://arxiv.org/abs/2606.24775v1
Date: Tue, 23 Jun 2026 16:34:55 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-24 22:16:49.074451
Title: Are We Ready For An Agent-Native Memory System?
Title（参考訳）: エージェントネイティブメモリシステムの準備はできているか?
Authors: Wei Zhou, Xuanhe Zhou, Shaokun Han, Hongming Xu, Guoliang Li, Zhiyu Li, Feiyu Xiong, Fan Wu,
Abstract要約: 大規模言語モデル(LLM)エージェントのメモリは、単純な検索拡張機構からデータ管理システムへと進化してきた。既存の評価は、主にエンドツーエンドのタスク成功メトリクスを通じてエージェントメモリをベンチマークする。データ管理の観点からエージェントメモリの系統的研究を行った。
参考スコア（独自算出の注目度）: 37.760278978612874
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Memory for large language model (LLM) agents has rapidly evolved from simple retrieval-augmented mechanisms into a data management system that supports persistent information storage, retrieval, update, consolidation, and dynamic lifecycle governance throughout agent execution. Despite this evolution, existing evaluations still benchmark agent memory mainly through end-to-end task success metrics (e.g., F1, BLEU), while treating the underlying system as a monolithic black box. As a result, critical system-level concerns, including operational costs, architectural trade-offs across memory modules, and robustness under dynamic knowledge updates, remain insufficiently explored. In this paper, we present a systematic experimental study of agent memory from a data management perspective. We propose an analytical framework that decomposes agent memory into four core modules: memory representation and storage, extraction, retrieval and routing, and maintenance. Under this framework, we evaluate 12 representative memory systems and two reference baselines across five benchmark workloads spanning 11 datasets. Our extensive end-to-end evaluation shows that no single architecture dominates across all scenarios; instead, effectiveness depends heavily on how well the memory structure aligns with the workload bottleneck. Furthermore, through fine-grained ablation studies, we quantify their individual effects on representation fidelity, retrieval precision, update correctness, and long-horizon stability. Finally, we reveal cost-performance trade-offs under realistic workloads, showing localized maintenance is more cost-efficient than global reorganization. Based on these findings, we identify promising directions towards building truly agent-native memory systems. The code is publicly available at https://github.com/OpenDataBox/MemoryData.
Abstract（参考訳）: 大規模言語モデル(LLM)エージェントのメモリは、単純な検索拡張メカニズムから、永続的な情報ストレージ、検索、更新、統合、エージェントの実行を通して動的ライフサイクルガバナンスをサポートするデータ管理システムへと急速に進化してきた。このような進化にもかかわらず、既存の評価は、主にエンドツーエンドのタスク成功メトリクス(例えば、F1、BLEU)を通じてエージェントメモリをベンチマークし、基盤となるシステムをモノリシックなブラックボックスとして扱う。その結果、運用コスト、メモリモジュール間のアーキテクチャトレードオフ、動的知識更新の下での堅牢性など、システムレベルの重要な懸念は、まだ十分に調査されていない。本稿では,データ管理の観点からエージェントメモリの系統的研究を行う。本稿では,エージェントメモリをメモリ表現と記憶,抽出,検索とルーティング,メンテナンスの4つのコアモジュールに分解する分析フレームワークを提案する。このフレームワークでは、11のデータセットにまたがる5つのベンチマークワークロードに対して、12の代表的なメモリシステムと2つの基準ベースラインを評価します。大規模なエンドツーエンド評価では、すべてのシナリオで単一のアーキテクチャが支配的でないことが示されています。さらに, 微細なアブレーション研究を通じて, 表現の忠実度, 検索精度, 更新精度, 長期安定性に対する個々の効果を定量化する。最後に、現実的なワークロード下でのコストパフォーマンスのトレードオフを明らかにし、グローバルな再編成よりも局所的なメンテナンスの方がコスト効率が高いことを示す。これらの知見に基づき、真にエージェントネイティブなメモリシステムを構築するための有望な方向性を特定する。コードはhttps://github.com/OpenDataBox/MemoryDataで公開されている。

論文の概要: Are We Ready For An Agent-Native Memory System?

関連論文リスト