Fugu-MT 論文翻訳(概要): MobileSteward: Integrating Multiple App-Oriented Agents with Self-Evolution to Automate Cross-App Instructions

論文の概要: MobileSteward: Integrating Multiple App-Oriented Agents with Self-Evolution to Automate Cross-App Instructions

arxiv url: http://arxiv.org/abs/2502.16796v1
Date: Mon, 24 Feb 2025 03:12:45 GMT
ステータス: 翻訳完了
システム内更新日: 2025-02-25 22:36:56.383019
Title: MobileSteward: Integrating Multiple App-Oriented Agents with Self-Evolution to Automate Cross-App Instructions
Title（参考訳）: MobileSteward: 複数のアプリケーション指向エージェントを自己進化で統合して、クロスアプリケーションのインストラクションを自動化する
Authors: Yuxuan Liu, Hongda Sun, Wei Liu, Jian Luan, Bo Du, Rui Yan,
Abstract要約: 携帯電話のエージェントは、携帯電話で日々のタスクを自動化するのを助けることができる。既存のプロシージャ指向エージェントは、クロスアプリ命令で苦労する。我々はMobileStewardという自己進化型マルチエージェントフレームワークを提案する。
参考スコア（独自算出の注目度）: 45.7564684180131
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Mobile phone agents can assist people in automating daily tasks on their phones, which have emerged as a pivotal research spotlight. However, existing procedure-oriented agents struggle with cross-app instructions, due to the following challenges: (1) complex task relationships, (2) diverse app environment, and (3) error propagation and information loss in multi-step execution. Drawing inspiration from object-oriented programming principles, we recognize that object-oriented solutions is more suitable for cross-app instruction. To address these challenges, we propose a self-evolving multi-agent framework named MobileSteward, which integrates multiple app-oriented StaffAgents coordinated by a centralized StewardAgent. We design three specialized modules in MobileSteward: (1) Dynamic Recruitment generates a scheduling graph guided by information flow to explicitly associate tasks among apps. (2) Assigned Execution assigns the task to app-oriented StaffAgents, each equipped with app-specialized expertise to address the diversity between apps. (3) Adjusted Evaluation conducts evaluation to provide reflection tips or deliver key information, which alleviates error propagation and information loss during multi-step execution. To continuously improve the performance of MobileSteward, we develop a Memory-based Self-evolution mechanism, which summarizes the experience from successful execution, to improve the performance of MobileSteward. We establish the first English Cross-APP Benchmark (CAPBench) in the real-world environment to evaluate the agents' capabilities of solving complex cross-app instructions. Experimental results demonstrate that MobileSteward achieves the best performance compared to both single-agent and multi-agent frameworks, highlighting the superiority of MobileSteward in better handling user instructions with diverse complexity.
Abstract（参考訳）: 携帯電話のエージェントは、携帯電話で日々のタスクを自動化するのを助けることができる。しかし,既存のプロシージャ指向エージェントは,(1)複雑なタスク関係,(2)多様なアプリケーション環境,(3)多段階実行におけるエラーの伝搬と情報損失といった課題により,クロスアプリ命令に苦慮している。オブジェクト指向プログラミングの原則からインスピレーションを得て、オブジェクト指向のソリューションがアプリケーション間プログラミングにもっと適していると認識する。これらの課題に対処するために、集中型StewardAgentによって調整された複数のアプリケーション指向のStashAgentを統合する、MobileStewardという自己進化型マルチエージェントフレームワークを提案する。我々はMobileStewardに3つの特別なモジュールを設計する: 1)動的リクルートは情報フローによって導かれるスケジューリンググラフを生成し、アプリ間でタスクを明示的に関連付ける。 2) Assigned Execution はアプリ指向の StaffAgents にタスクを割り当て,アプリ間の多様性に対処するための専門知識をアプリとして備えている。 (3)適応評価は,多段階実行時の誤りの伝播や情報損失を軽減し,リフレクションチップの提供やキー情報の提供を行う。 MobileStewardの性能を継続的に改善するため,我々はメモリベースの自己進化機構を開発し,実行を成功させた経験を要約し,MobileStewardの性能を向上させる。我々は,複雑なクロスアプリケーション命令を解くエージェントの能力を評価するために,実環境における最初の英語クロスアプリケーションベンチマーク(CAPBench)を構築した。実験の結果、MobileStewardはシングルエージェントとマルチエージェントの両方のフレームワークと比較して最高のパフォーマンスを達成しており、多様な複雑さでユーザーインストラクションをうまく処理する上で、MobileStewardの優位性を強調している。

論文の概要: MobileSteward: Integrating Multiple App-Oriented Agents with Self-Evolution to Automate Cross-App Instructions

関連論文リスト