Fugu-MT 論文翻訳(概要): ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory

論文の概要: ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory

arxiv url: http://arxiv.org/abs/2509.25140v1
Date: Mon, 29 Sep 2025 17:51:03 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:20.188015
Title: ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory
Title（参考訳）: ReasoningBank: Reasoning Memoryによるスケーリングエージェントの自己進化
Authors: Siru Ouyang, Jun Yan, I-Hung Hsu, Yanfei Chen, Ke Jiang, Zifeng Wang, Rujun Han, Long T. Le, Samira Daruki, Xiangru Tang, Vishy Tirumalashetty, George Lee, Mahsan Rofouei, Hangfei Lin, Jiawei Han, Chen-Yu Lee, Tomas Pfister,
Abstract要約: ReasoningBankは、エージェントの自己判断の成功と失敗の経験から一般化可能な推論戦略を抽出するメモリフレームワークである。テスト時には、エージェントがReasoningBankから関連する記憶を取得してそのインタラクションを知らせ、新しい学習を統合することで、時間が経つにつれてより有能になる。本稿では,エージェントのインタラクションエクスペリエンスをスケールアップすることにより,学習プロセスの高速化と多様化を図るメモリ対応テストタイムスケーリング(MaTTS)を提案する。
参考スコア（独自算出の注目度）: 57.517214479414726
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: With the growing adoption of large language model agents in persistent real-world roles, they naturally encounter continuous streams of tasks. A key limitation, however, is their failure to learn from the accumulated interaction history, forcing them to discard valuable insights and repeat past errors. We propose ReasoningBank, a novel memory framework that distills generalizable reasoning strategies from an agent's self-judged successful and failed experiences. At test time, an agent retrieves relevant memories from ReasoningBank to inform its interaction and then integrates new learnings back, enabling it to become more capable over time. Building on this powerful experience learner, we further introduce memory-aware test-time scaling (MaTTS), which accelerates and diversifies this learning process by scaling up the agent's interaction experience. By allocating more compute to each task, the agent generates abundant, diverse experiences that provide rich contrastive signals for synthesizing higher-quality memory. The better memory in turn guides more effective scaling, establishing a powerful synergy between memory and test-time scaling. Across web browsing and software engineering benchmarks, ReasoningBank consistently outperforms existing memory mechanisms that store raw trajectories or only successful task routines, improving both effectiveness and efficiency; MaTTS further amplifies these gains. These findings establish memory-driven experience scaling as a new scaling dimension, enabling agents to self-evolve with emergent behaviors naturally arise.
Abstract（参考訳）: 大きな言語モデルエージェントが現実世界の永続的な役割に採用されることで、彼らは自然にタスクの連続ストリームに遭遇する。しかし、重要な制限は、蓄積されたインタラクション履歴から学ばなかったことであり、価値ある洞察を捨て、過去のエラーを繰り返すことを余儀なくされている。エージェントの自己判断から一般化可能な推論戦略を抽出する,新たなメモリフレームワークであるReasoningBankを提案する。テスト時には、エージェントがReasoningBankから関連する記憶を取得してそのインタラクションを知らせ、新しい学習を統合することで、時間が経つにつれてより有能になる。この強力な経験学習者に基づいて、エージェントのインタラクションエクスペリエンスをスケールアップすることで、学習プロセスを加速し、多様化するメモリ対応テストタイムスケーリング(MaTTS)を導入する。各タスクにより多くの計算を割り当てることで、エージェントは、高品質なメモリを合成するための豊富なコントラスト信号を提供する豊富な多様なエクスペリエンスを生成する。優れたメモリはより効率的なスケーリングを導き、メモリとテストタイムのスケーリングの強力な相乗効果を確立する。ウェブブラウジングとソフトウェアエンジニアリングのベンチマークを通じて、ReasoningBankは生の軌跡や成功したタスクルーチンのみを格納する既存のメモリメカニズムを一貫して上回り、効率と効率の両方を改善している。これらの知見は、新たなスケーリングディメンションとして、メモリ駆動のエクスペリエンススケーリングを確立し、エージェントが自然に創発的な振る舞いで自己進化することを可能にする。

論文の概要: ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory

関連論文リスト