Fugu-MT 論文翻訳(概要): GUISpector: An MLLM Agent Framework for Automated Verification of Natural Language Requirements in GUI Prototypes

論文の概要: GUISpector: An MLLM Agent Framework for Automated Verification of Natural Language Requirements in GUI Prototypes

arxiv url: http://arxiv.org/abs/2510.04791v1
Date: Mon, 06 Oct 2025 13:15:24 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-07 16:52:59.874592
Title: GUISpector: An MLLM Agent Framework for Automated Verification of Natural Language Requirements in GUI Prototypes
Title（参考訳）: GUISpector:GUIプロトタイプにおける自然言語要求の自動検証のためのMLLMエージェントフレームワーク
Authors: Kristian Kolthoff, Felix Kretzer, Simone Paolo Ponzetto, Alexander Maedche, Christian Bartelt,
Abstract要約: 本稿では,GUIプロトタイプにおけるNL要求の自動検証にマルチモーダル(M)LLMエージェントを利用する新しいフレームワークを提案する。 GuiSpectorはエージェントの検証プロセスから詳細なNLフィードバックを抽出し、開発者に実行可能な洞察を提供する。本稿では,これらの機能を統合化し,検証実行の監視,エージェントの合理性検査,エンドツーエンドの要件検証プロセスの管理を行うインターフェースを提供する。
参考スコア（独自算出の注目度）: 58.197090145723735
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: GUIs are foundational to interactive systems and play a pivotal role in early requirements elicitation through prototyping. Ensuring that GUI implementations fulfill NL requirements is essential for robust software engineering, especially as LLM-driven programming agents become increasingly integrated into development workflows. Existing GUI testing approaches, whether traditional or LLM-driven, often fall short in handling the complexity of modern interfaces, and typically lack actionable feedback and effective integration with automated development agents. In this paper, we introduce GUISpector, a novel framework that leverages a multi-modal (M)LLM-based agent for the automated verification of NL requirements in GUI prototypes. First, GUISpector adapts a MLLM agent to interpret and operationalize NL requirements, enabling to autonomously plan and execute verification trajectories across GUI applications. Second, GUISpector systematically extracts detailed NL feedback from the agent's verification process, providing developers with actionable insights that can be used to iteratively refine the GUI artifact or directly inform LLM-based code generation in a closed feedback loop. Third, we present an integrated tool that unifies these capabilities, offering practitioners an accessible interface for supervising verification runs, inspecting agent rationales and managing the end-to-end requirements verification process. We evaluated GUISpector on a comprehensive set of 150 requirements based on 900 acceptance criteria annotations across diverse GUI applications, demonstrating effective detection of requirement satisfaction and violations and highlighting its potential for seamless integration of actionable feedback into automated LLM-driven development workflows. The video presentation of GUISpector is available at: https://youtu.be/JByYF6BNQeE, showcasing its main capabilities.
Abstract（参考訳）: GUIはインタラクティブシステムの基本であり、プロトタイピングによる早期要求の誘発において重要な役割を担っている。 GUI実装がNL要件を満たすことを保証することは、堅牢なソフトウェアエンジニアリングにとって不可欠である。従来のGUIテストアプローチやLLM駆動のアプローチは、現代的なインターフェースの複雑さを扱うのに不足することが多く、一般的には実用的なフィードバックや自動開発エージェントとの効果的な統合が欠如している。本稿では,GUIプロトタイプにおけるNL要求の自動検証にマルチモーダル(M)LLMエージェントを利用する新しいフレームワークであるGUISpectorを紹介する。まず、GUISpectorはMLLMエージェントを適用してNL要件を解釈し、運用し、GUIアプリケーション間で検証トラジェクトリを自律的に計画し実行することができる。第二に、GUISpectorはエージェントの検証プロセスから詳細なNLフィードバックを体系的に抽出し、GUIアーティファクトを反復的に洗練したり、LLMベースのコード生成をクローズドなフィードバックループで直接通知できる実用的な洞察を提供する。第3に,これらの機能の統合,検証実行の監視,エージェントの合理性検査,エンドツーエンドの要件検証プロセス管理のためのアクセス可能なインターフェースを提供する。我々はGUISpectorを、多様なGUIアプリケーションにまたがる900の受け入れ基準アノテーションに基づいて、150の要件の包括的なセットで評価し、要求満足度と違反を効果的に検出し、自動LLM駆動開発ワークフローへの実行可能なフィードバックのシームレスな統合の可能性を強調した。 GUISpectorのビデオプレゼンテーションは、https://youtu.be/JByYF6BNQeEで公開されており、主な機能を示している。

論文の概要: GUISpector: An MLLM Agent Framework for Automated Verification of Natural Language Requirements in GUI Prototypes

関連論文リスト