Fugu-MT 論文翻訳(概要): Proactive Instance Navigation with Comparative Judgment for Ambiguous User Queries

論文の概要: Proactive Instance Navigation with Comparative Judgment for Ambiguous User Queries

arxiv url: http://arxiv.org/abs/2605.06223v2
Date: Fri, 08 May 2026 13:20:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 16:31:23.133098
Title: Proactive Instance Navigation with Comparative Judgment for Ambiguous User Queries
Title（参考訳）: 曖昧なユーザクエリに対する比較判断を用いたアクティブなインスタンスナビゲーション
Authors: Junhyuk Kwon, Seungjoon Lee, Hyejin Park, Kyle Min, Jungseul Ok,
Abstract要約: 比較判断(ProCompNav)を用いた能動型インスタンスナビゲーションを提案する。 ProCompNavはまず候補プールを構築し、比較判定によってターゲットを識別する。 CoIN-Benchでは、ProCompNavはインタラクティブなベースラインよりも、最小限の入力と非インタラクティブなベースラインを詳細な説明で改善する。
参考スコア（独自算出の注目度）: 24.694719593521594
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Natural-language instance navigation becomes challenging when the initial user request does not uniquely specify the target instance. A practical agent should reduce the user's burden by actively asking only the information needed to distinguish the target from similar distractors, rather than requiring a detailed description upfront. Existing approaches often fall short of this goal: they may stop at the first plausible candidate before sufficiently exploring alternatives, or, even after collecting multiple candidates, ask about the target's attributes derived from individual candidates rather than questions selected to distinguish candidates in the pool. As a result, despite the dialogue, the agent may still fail to distinguish the target from distractors, leading to premature decisions and lengthy user responses. We propose Proactive Instance Navigation with Comparative Judgment (ProCompNav), a two-stage framework that first constructs a candidate pool and then identifies the target through comparative judgment. At each round, ProCompNav extracts an attribute-value pair that splits the current pool, asks a binary yes/no question, and prunes all inconsistent candidates at once. This reframes disambiguation from open-ended target description to pool-level discriminative questioning, where each question is chosen to narrow the candidate set. On CoIN-Bench, ProCompNav improves Success Rate over interactive baselines with the same minimal input and non-interactive baselines with detailed descriptions, while substantially reducing Response Length. ProCompNav also achieves state-of-the-art Success Rate on TextNav, suggesting that comparative judgment is broadly useful for instance-level navigation among similar distractors.
Abstract（参考訳）: 自然言語のインスタンスナビゲーションは、初期ユーザ要求がターゲットインスタンスをユニークに指定していない場合に困難になる。具体的エージェントは、詳細な説明を事前に必要とせず、ターゲットを類似の邪魔者から区別するために必要な情報のみを積極的に求めることで、ユーザの負担を軽減するべきである。既存のアプローチは、しばしばこの目標に欠ける: 選択肢を十分に探求する前に、最初の妥当な候補に立ち寄る、あるいは、複数の候補を集めた後でも、プールの候補者を区別するために選択された質問よりも、個々の候補から派生したターゲットの属性について尋ねる。結果として、対話にも拘わらず、エージェントは依然として標的を邪魔者と区別することができず、早急な判断と長いユーザーの反応につながる可能性がある。提案するProactive Instance Navigation with Comparison Judgment(ProCompNav)は,まず候補プールを構築し,そのターゲットを比較判定により識別する2段階のフレームワークである。各ラウンドでProCompNavは属性値のペアを抽出し、現在のプールを分割し、バイナリーのyes/noを問う。これにより、オープンエンドのターゲット記述からプールレベルの差別的質問への曖昧さが再定義され、各質問が候補セットを狭めるために選択される。 CoIN-Benchでは、ProCompNavは、最小限の入力と非インタラクティブのベースラインでインタラクティブなベースラインよりも成功率を向上させると同時に、レスポンス長を大幅に削減する。 ProCompNavはTextNavの最先端の成功率も達成している。

論文の概要: Proactive Instance Navigation with Comparative Judgment for Ambiguous User Queries

関連論文リスト