Fugu-MT 論文翻訳(概要): Organize then Retrieve: Hierarchical Memory Navigation for Efficient Agents

論文の概要: Organize then Retrieve: Hierarchical Memory Navigation for Efficient Agents

arxiv url: http://arxiv.org/abs/2606.11680v1
Date: Wed, 10 Jun 2026 05:49:14 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-11 16:42:38.312614
Title: Organize then Retrieve: Hierarchical Memory Navigation for Efficient Agents
Title（参考訳）: 組織化して検索する: 効率的なエージェントのための階層型メモリナビゲーション
Authors: Hao-Lun Hsu, Nikki Lijing Kuang, Boyi Liu, Zhewei Yao, Yuxiong He,
Abstract要約: HORMAは、大規模な言語モデルのための階層的なオーガナイズ・アンド・レトリーブメモリエージェントである。ワーキングメモリを構造化メモリ構築とナビゲーションベースの検索の2つのステージに分割する。優れた効率性とパフォーマンスのトレードオフを一貫して達成し、目に見えないタスクに効果的に一般化する。
参考スコア（独自算出の注目度）: 29.28371047415946
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language model (LLM) agents struggle with long-horizon tasks due to their inherent statelessness, requiring all task-relevant information to be encoded in growing input contexts. The resulting degraded reasoning quality, increased inference cost, and higher latency necessitate efficient working memory mechanisms. However, existing approaches either rely on lossy compression or similarity-based retrieval, which often fail to capture temporal structure and causal dependencies required for multi-step agentic tasks. In this work, we present HORMA, a Hierarchical Organize-and-Retrieve Memory Agent that organizes experience into a file-system-like hierarchical structure, where summarized entities are linked to the corresponding raw trajectories, enabling efficient access without losing detailed information. HORMA decomposes working memory into two stages: structured memory construction and navigation-based retrieval. The construction module iteratively refines how experiences are structured by distinguishing between failures caused by missing information and those caused by misleading or overloaded context. The navigation module retrieves task-relevant context by traversing the hierarchy using a lightweight agent trained with reinforcement learning to select minimal yet sufficient context, thereby reducing latency along the critical execution path. Across ALFWorld, LoCoMo, and LongMemEval, HORMA improves task performance under constrained context budgets while requiring at most 22.17% of the baseline token usage in long conversation tasks. Compared to existing methods, it consistently achieves better efficiency-performance trade-offs and generalizes effectively to unseen tasks.
Abstract（参考訳）: 大規模言語モデル(LLM)エージェントは、その固有のステートレス性のために、長い水平タスクに苦しむ。その結果、劣化した推論品質、推論コストの増大、高いレイテンシは効率的なワーキングメモリ機構を必要とする。しかし、既存の手法は圧縮の欠如や類似性に基づく検索に依存しており、多段階のエージェントタスクに必要な時間的構造や因果関係を捉えるのに失敗することが多い。本稿では,階層型組織記憶エージェントであるHORMAについて述べる。このHORMAは,経験をファイルシステムのような階層構造に整理し,要約されたエンティティを対応する生のトラジェクトリにリンクし,詳細な情報を失うことなく効率的なアクセスを可能にする。 HORMAは、ワーキングメモリを構造化メモリ構築とナビゲーションベースの検索の2つのステージに分解する。構築モジュールは、情報不足に起因する障害と、誤解を招いたり過負荷になったりするコンテキストによって引き起こされる障害を区別することで、経験がどのように構造化されるかを繰り返し洗練する。ナビゲーションモジュールは、強化学習で訓練された軽量エージェントを使用して階層をトラバースすることでタスク関連コンテキストを検索し、最小限の十分なコンテキストを選択することにより、クリティカル実行パスに沿ったレイテンシを低減する。 ALFWorld、LoCoMo、LongMemEval全体で、HORMAは制約付きコンテキスト予算下でのタスクパフォーマンスを改善し、長い会話タスクにおいて、ベースライントークンの使用の少なくとも22.17%を必要としている。既存の手法と比較して、優れた効率と性能のトレードオフを一貫して達成し、目に見えないタスクに効果的に一般化する。

論文の概要: Organize then Retrieve: Hierarchical Memory Navigation for Efficient Agents

関連論文リスト