Fugu-MT 論文翻訳(概要): Experiential Reflective Learning for Self-Improving LLM Agents

論文の概要: Experiential Reflective Learning for Self-Improving LLM Agents

arxiv url: http://arxiv.org/abs/2603.24639v1
Date: Wed, 25 Mar 2026 11:43:22 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-27 20:52:47.90695
Title: Experiential Reflective Learning for Self-Improving LLM Agents
Title（参考訳）: 自己改善LDMエージェントのための実験反射学習
Authors: Marc-Antoine Allard, Arnaud Teinturier, Victor Xing, Gautier Viaud,
Abstract要約: 実験的反射学習(ERL:Experiential Reflective Learning)は,迅速な環境適応を実現するシンプルな自己改善フレームワークである。 ERLはタスクの軌跡と成果を反映して、タスク間で伝達される実行可能なレッスンを生成する。 ERLはReActベースラインよりも成功率を7.8%向上させ、タスク完了の信頼性を大きく向上させる。
参考スコア（独自算出の注目度）: 1.1074589887824053
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent advances in large language models (LLMs) have enabled the development of autonomous agents capable of complex reasoning and multi-step problem solving. However, these agents struggle to adapt to specialized environments and do not leverage past interactions, approaching each new task from scratch regardless of their accumulated experience. We introduce Experiential Reflective Learning (ERL), a simple self-improvement framework that enables rapid environment adaptation through experiential learning. ERL reflects on task trajectories and outcomes to generate heuristics, capturing actionable lessons that transfer across tasks. At test time, relevant heuristics are retrieved based on the current task and injected into the agent's context to guide execution. On the Gaia2 benchmark, ERL improves success rate by 7.8% over a ReAct baseline, with large gains in task completion reliability, and outperforms prior experiential learning methods. Through systematic ablations, we find that selective retrieval is essential and that heuristics provide more transferable abstractions than few-shot trajectory prompting. These results demonstrate that reflecting on single-attempt experiences to extract transferable heuristics enables effective agent self-improvement.
Abstract（参考訳）: 大規模言語モデル(LLM)の最近の進歩は、複雑な推論と多段階の問題解決が可能な自律エージェントの開発を可能にしている。しかし、これらのエージェントは特別な環境に適応するのに苦労し、過去の相互作用を活用せず、蓄積した経験に関係なく、ゼロから新しいタスクにアプローチする。本稿では,経験的学習による環境適応の迅速化を実現する,簡易な自己改善フレームワークであるERLを紹介する。 ERLは、タスクトラジェクトリと結果に基づいてヒューリスティックを生成し、タスク間で伝達される実行可能なレッスンをキャプチャする。テスト時には、関連するヒューリスティックが現在のタスクに基づいて検索され、エージェントのコンテキストに注入されて実行をガイドする。 Gaia2ベンチマークでは、ERLはReActベースラインよりも成功率を7.8%向上させ、タスク完了の信頼性を大きく向上させ、以前の経験的学習方法より優れている。体系的なアブレーションを通じて、選択的検索は不可欠であり、ヒューリスティックスは、数発の軌道のプロンプトよりも、より伝達可能な抽象化を提供する。これらの結果から, 伝達可能なヒューリスティックを抽出するための単一試行経験を反映することにより, 効果的なエージェント自己改善が可能であることが示唆された。

論文の概要: Experiential Reflective Learning for Self-Improving LLM Agents

関連論文リスト