Fugu-MT 論文翻訳(概要): ExpWeaver: LLM Agents Learn from Experience via Latent RAG

論文の概要: ExpWeaver: LLM Agents Learn from Experience via Latent RAG

arxiv url: http://arxiv.org/abs/2606.01041v1
Date: Sun, 31 May 2026 06:11:07 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:29.160592
Title: ExpWeaver: LLM Agents Learn from Experience via Latent RAG
Title（参考訳）: ExpWeaver: LLMエージェントは潜在RAGを通じて経験から学ぶ
Authors: Tao Feng, Tianyang Luo, Jingjun Xu, Zhigang Hua, Yan Xie, Shuang Yang, Ge Liu, Jiaxuan You,
Abstract要約: 我々は,LLMエージェントが潜時検索拡張生成を通じて経験から学習できるフレームワークであるExpWeaverを提案する。 ExpWeaverはLLM自身の隠された状態を使用してエクスペリエンスをエンコードし、各デコードステップで遅延空間で関連するエクスペリエンスを直接取得する。 ExpWeaverは、質問応答、推論、コーディング、科学的予測、レコメンデーションを含む13のタスクで評価する。
参考スコア（独自算出の注目度）: 31.285842755478896
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Experience learning has achieved promising results in enhancing LLM agent planning and reasoning by integrating past interactions as reusable knowledge. However, existing methods remain confined to explicit text space, retrieving experiences via semantic similarity and concatenating them into the context window, leading to substantial token overhead and a decoupled architecture that separates retrieval from generation. To address these limitations, we propose ExpWeaver, a framework that enables LLM agents to learn from experience via latent retrieval-augmented generation, without requiring a separate RAG module. ExpWeaver encodes experiences using the LLM's own hidden states, retrieves relevant experiences directly in latent space at each decoding step, and integrates them through cross-attention aggregation and gated residual mechanisms. The entire pipeline is optimized end-to-end with reinforcement learning, supporting both generative and ranking tasks. We evaluate ExpWeaver on 13 diverse tasks spanning question answering, reasoning, coding, scientific prediction, and recommendation. Results demonstrate that ExpWeaver achieves state-of-the-art performance on 12 out of 13 tasks, outperforming the strongest baseline by over 6.8%; maintains token efficiency comparable to non-retrieval baselines while text-based retrieval methods require 1.5 to 2 times more tokens; and exhibits superior cross-domain generalization, outperforming the strongest baseline by 16.32% under zero-shot transfer and 15.21% under few-shot transfer. Our code for ExpWeaver is released at https://github.com/ulab-uiuc/ExpWeaver.
Abstract（参考訳）: 経験学習は、過去のインタラクションを再利用可能な知識として統合することで、LLMエージェントの計画と推論を強化するという有望な成果を達成した。しかし、既存のメソッドは明示的なテキスト空間に限られており、セマンティックな類似性を通じて経験を検索し、それらをコンテキストウィンドウにまとめる。これらの制約に対処するため,我々は,LLMエージェントがRAGモジュールを別途必要とせずに,潜時検索拡張生成を通じて経験から学習できるフレームワークであるExpWeaverを提案する。 ExpWeaverはLLM自身の隠された状態を使用して経験をエンコードし、各デコードステップで遅延空間で関連する経験を直接取得し、相互アテンションアグリゲーションを通じてそれらを統合し、残留メカニズムをゲートする。パイプライン全体がエンドツーエンドに最適化され、強化学習され、生成タスクとランキングタスクの両方をサポートする。 ExpWeaverは、質問応答、推論、コーディング、科学的予測、レコメンデーションを含む13のタスクで評価する。その結果、ExpWeaverは13タスク中12タスクで最先端のパフォーマンスを達成し、最強のベースラインを6.8%以上上回り、非検索ベースラインに匹敵するトークン効率を維持し、テキストベースの検索手法では1.5倍から2倍のトークンを必要とし、最強のベースラインを16.32%上回り、少ショット転送では15.21%上回った。 ExpWeaverのコードはhttps://github.com/ulab-uiuc/ExpWeaver.comで公開されています。

論文の概要: ExpWeaver: LLM Agents Learn from Experience via Latent RAG

関連論文リスト