Fugu-MT 論文翻訳(概要): Enhancing Software Engineering Through Closed-Loop Memory Optimization

論文の概要: Enhancing Software Engineering Through Closed-Loop Memory Optimization

arxiv url: http://arxiv.org/abs/2606.05646v1
Date: Thu, 04 Jun 2026 03:17:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-05 22:39:44.520338
Title: Enhancing Software Engineering Through Closed-Loop Memory Optimization
Title（参考訳）: クローズドループメモリ最適化によるソフトウェアエンジニアリングの強化
Authors: Xuehang Guo, Zora Zhiruo Wang, Qingyun Wang, Graham Neubig, Xingyao Wang,
Abstract要約: ソフトウェア工学 (SE) エージェントにおけるメモリ拡張のためのクローズドループフレームワークである ours を紹介する。タスクに依存しない textbf ベンチマークとアノテーションのない textbf 最適化信号としての有用性を確立する。その結果、SEエージェントは設定によって常に改善され、成功率で$uparrow5.25%、解決効率で$uparrow4.63%という絶対的なゲインが得られることが示された。
参考スコア（独自算出の注目度）: 50.418699819003486
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) have enabled powerful software engineering (SE) agents capable of navigating complex codebases and resolving real-world issues. However, these agents remain fundamentally episodic: they fail to retain, refine, and reuse experiences across tasks, repeatedly reconstructing context from scratch and reproducing similar mistakes. Even with memory support, they offer no remedy for the absence of a principled, task-agnostic \textit{memory utility}, making them difficult to evaluate rigorously or generalize across agents and settings. To tackle these limitations, we introduce \ours, a closed-loop framework for memory augmentation in SE agents. \ours grounds memory utility in \textit{validated downstream impact}, establishing utility as both a task-agnostic \textbf{evaluation benchmark} and an annotation-free \textbf{optimization signal}. Through complementary evaluation on \textit{single-episode} and \textit{cross-episode} memory augmentation, results demonstrate that \ours consistently improves SE agents across settings, achieving absolute gains of up to $\uparrow5.25\%$ in success rate and $\uparrow4.63\%$ in resolve efficiency, while substantially reducing computational cost by $\geq9.79\%$. Our project page: \href{https://xhguo7.github.io/MemOp/}{https://xhguo7.github.io/MemOp/}.
Abstract（参考訳）: 大規模言語モデル(LLM)は、複雑なコードベースをナビゲートし、現実世界の問題を解決する強力なソフトウェアエンジニアリング(SE)エージェントを可能にした。しかし、これらのエージェントは基本的にエピソード的であり、タスク全体にわたる経験を維持し、洗練し、再利用することができず、コンテキストをスクラッチから繰り返し再構築し、同様の間違いを再現する。メモリサポートがあっても、基本的なタスクに依存しない \textit{Memory utility} が存在しないため、エージェントや設定を厳格に評価したり一般化したりすることは困難である。これらの制限に対処するため、SEエージェントのメモリ拡張のためのクローズドループフレームワークであるShaoursを紹介した。タスクに依存しない \textbf{evaluation benchmark} とアノテーションのない \textbf{optimization signal} の両方としてユーティリティを確立する。結果として、 \ours は textit{single-episode} と \textit{cross-episode} のメモリ拡張による相補的評価により、セッティング全体の SE エージェントを一貫して改善し、成功率$\uparrow5.25\%$と$\uparrow4.63\%$の絶対的なゲインを達成し、計算コストを$\geq9.79\%$に大幅に削減することを示した。プロジェクトページ: \href{https://xhguo7.github.io/MemOp/}{https://xhguo7.github.io/MemOp/}。

論文の概要: Enhancing Software Engineering Through Closed-Loop Memory Optimization

関連論文リスト