Fugu-MT 論文翻訳(概要): Co-Evolving Agents: Learning from Failures as Hard Negatives

論文の概要: Co-Evolving Agents: Learning from Failures as Hard Negatives

arxiv url: http://arxiv.org/abs/2511.22254v1
Date: Thu, 27 Nov 2025 09:30:33 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-01 19:47:55.482135
Title: Co-Evolving Agents: Learning from Failures as Hard Negatives
Title（参考訳）: 共同進化型エージェント: 失敗からハードネガティクスを学ぶ
Authors: Yeonsung Jung, Trilok Padhi, Sina Shaham, Dipika Khullar, Joonhyun Jeong, Ninareh Mehrabi, Eunho Yang,
Abstract要約: 近年の研究では、自己改善剤を自力で生成し、精製し、自身の軌道で再訓練する研究が進められている。本稿では、目標エージェントが補助故障エージェントと共同で改善する共進化型エージェントフレームワークを提案する。
参考スコア（独自算出の注目度）: 38.61683607205988
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: The rapid progress of large foundation models has accelerated the development of task-specialized agents across diverse domains. However, the effectiveness of agents remains tightly coupled with the quality of training data, while curating task-specific datasets remains costly and often infeasible in real-world scenarios. Recent work has explored self-improving agents that autonomously generate, refine, and re-train on their own trajectories. A prominent line of approaches further leverages preference optimization by pairing predicted trajectories with scarce ground-truth trajectories, enabling agents to learn directly from their own failures. While these methods outperform supervised fine-tuning, their heavy reliance on predicted trajectories under limited ground-truth supervision leaves them prone to overfitting. To address this, we propose a co-evolving agents framework in which a target agent improves jointly with an auxiliary failure agent. The failure agent learns through preference optimization over failure trajectories from both the target and itself, thereby generating hard negatives that are close to success yet remain failures. Incorporating these informative hard negatives into the target agent's optimization sharpens decision boundaries and enhances generalization. Our comprehensive analysis and experiments across benchmark datasets show that our method not only shows improved performance but also demonstrates that failures, instead of being used as-is, can be systematically transformed into structured and valuable learning signals in self-improving agents.
Abstract（参考訳）: 大規模基盤モデルの急速な進歩は、様々な領域にわたるタスク特化エージェントの開発を加速させてきた。しかし、エージェントの有効性はトレーニングデータの品質と密結合であり、一方タスク固有のデータセットのキュレーションは、現実のシナリオではコストがかかり、しばしば実現不可能である。近年の研究では、自己改善剤を自力で生成し、精製し、自身の軌道で再訓練する研究が進められている。顕著なアプローチのラインは、予測されたトラジェクトリと少ない接地トラジェクトリとをペアにすることで、好みの最適化をさらに活用することで、エージェントは自身の障害から直接学習することができる。これらの手法は監督された微調整よりも優れているが、限られた地道監督の下で予測された軌道に依存しているため、過度に適合する傾向にある。そこで本研究では,目標エージェントが補助的障害エージェントと協調的に改善する,共進化型エージェントフレームワークを提案する。障害エージェントは、目標とそれ自身の両方からの障害軌跡よりも優先的な最適化を通じて学習し、成功に近づきながら失敗を継続するハードネガティブを生成する。これらの情報的ハードネガティブを対象エージェントの最適化に組み込むことで、決定境界を鋭くし、一般化を高める。ベンチマークデータセットの総合的な分析と実験により,我々の手法は性能の向上を示すだけでなく,自己改善エージェントにおいて,失敗を体系的に構造化し,価値ある学習信号に変換できることが示されている。

論文の概要: Co-Evolving Agents: Learning from Failures as Hard Negatives

関連論文リスト