Fugu-MT 論文翻訳(概要): Evaluating 5W3H Structured Prompting for Intent Alignment in Human-AI Interaction

論文の概要: Evaluating 5W3H Structured Prompting for Intent Alignment in Human-AI Interaction

arxiv url: http://arxiv.org/abs/2603.18976v1
Date: Thu, 19 Mar 2026 14:41:06 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-20 17:19:06.204693
Title: Evaluating 5W3H Structured Prompting for Intent Alignment in Human-AI Interaction
Title（参考訳）: ヒトとAIの相互作用における直観的アライメントのための5W3H構造的プロンプトの評価
Authors: Peng Gang,
Abstract要約: 我々は人間-AIインタラクションにおける構造化意図表現の枠組みであるPSを評価する。 3つのドメイン(ビジネス、技術、旅行)で60のタスクを調査する。構造化された意図表現は、人間とAIの相互作用におけるアライメントとユーザビリティを向上させることができる。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Natural language prompts often suffer from intent transmission loss: the gap between what users actually need and what they communicate to AI systems. We evaluate PPS (Prompt Protocol Specification), a 5W3H-based framework for structured intent representation in human-AI interaction. In a controlled three-condition study across 60 tasks in three domains (business, technical, and travel), three large language models (DeepSeek-V3, Qwen-Max, and Kimi), and three prompt conditions - (A) simple prompts, (B) raw PPS JSON, and (C) natural-language-rendered PPS - we collect 540 AI-generated outputs evaluated by an LLM judge. We introduce goal_alignment, a user-intent-centered evaluation dimension, and find that rendered PPS outperforms both simple prompts and raw JSON on this metric. PPS gains are task-dependent: gains are large in high-ambiguity business analysis tasks but reverse in low-ambiguity travel planning. We also identify a measurement asymmetry in standard LLM evaluation, where unconstrained prompts can inflate constraint adherence scores and mask the practical value of structured prompting. A preliminary retrospective survey (N = 20) further suggests a 66.1% reduction in follow-up prompts required, from 3.33 to 1.13 rounds. These findings suggest that structured intent representations can improve alignment and usability in human-AI interaction, especially in tasks where user intent is inherently ambiguous.
Abstract（参考訳）: 自然言語のプロンプトは、ユーザが実際に必要とするものと、AIシステムと通信するものとの間のギャップという、意図的な伝達損失に悩まされることが多い。人間のAIインタラクションにおける構造化意図表現のための5W3HベースのフレームワークであるPS(Prompt Protocol Specification)を評価する。 3つの領域における60のタスク(ビジネス、技術、旅行)、3つの大きな言語モデル(DeepSeek-V3、Qwen-Max、Kimi)、および3つの迅速な条件 - (A)単純なプロンプト、(B)生PS JSON、(C)自然言語レンダリングPS - の3条件調査において、LLM判事が評価した540のAI生成アウトプットを収集した。 goal_alignmentは、ユーザインテント中心の評価ディメンションであり、レンダリングされたPSSは、このメトリクス上で単純なプロンプトと生のJSONの両方より優れています。 PPSゲインはタスク依存であり、高あいまいなビジネス分析タスクでは大きなゲインであるが、低あいまいな旅行計画では逆になる。また、制限のないプロンプトが制約の順守スコアをインフレーションし、構造化プロンプトの実用的価値を隠蔽する、標準LCM評価における測定非対称性を同定する。予備のレトロスペクティブ調査(N = 20)では、さらに3.33ラウンドから1.13ラウンドまで、66.1%のフォローアッププロンプトの削減が示されている。これらの結果から,構造化意図表現は,特にユーザ意図が本質的に曖昧なタスクにおいて,人間とAIのインタラクションにおけるアライメントとユーザビリティを向上させることが示唆された。

論文の概要: Evaluating 5W3H Structured Prompting for Intent Alignment in Human-AI Interaction

関連論文リスト