Fugu-MT 論文翻訳(概要): SPHINX: First Explain, Then Explore

論文の概要: SPHINX: First Explain, Then Explore

arxiv url: http://arxiv.org/abs/2606.17482v1
Date: Tue, 16 Jun 2026 03:48:27 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-17 17:15:32.256049
Title: SPHINX: First Explain, Then Explore
Title（参考訳）: SPHINX:最初の説明と探索
Authors: Nguyen Do, Tue M. Cao, Tien Van Do, András Hajdu, Tamás Bérczes, My T. Thai,
Abstract要約: 我々は、運転方針の故障診断に基づいて、敵のシーンを生成するべきであると論じる。本稿では,逆シナリオ合成のためのクローズドループフレームワークについて,まず説明し,探索する。
参考スコア（独自算出の注目度）: 9.461724985210312
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Generating adversarial driving scenarios is critical for evaluating and improving autonomous vehicle decision-making systems in simulation. Recent approaches, such as ChatScene and LLM-Attacker, rely primarily on the prior knowledge of Large Language Models and Vision-Language Models to generate driving scenarios procedurally. We argue that adversarial scenes should be generated based on the failure diagnosis (e.g., indecisiveness, multi-frame inconsistency) of the driving policy to specifically address the policy's weaknesses instead of relying on prior assumptions. In this paper, we propose SPHINX, a closed-loop framework for adversarial scenario synthesis guided by a simple principle: first explain, then explore. Beyond blindly exploring the scenario space, SPHINX leverages explainable artificial intelligence methods to analyze the policy, identifying key visual concepts and their influence on policy outputs, and the uncertainty of the decisions. Given the interpretable evidence extracted from the policy's own decision process, we use a vision language model to rationalize and criticize failure modes of the current policy. These critics are then used to generate targeted adversarial scenarios for policy retraining and improvement. We demonstrate that SPHINX can highlight an interpretable account of policy failures while other adversarial scene generation cannot. Across the evaluated benchmarks and test suites, SPHINX can be applied to diverse state-of-the-art autonomous vehicle architectures and yields consistent robustness improvements over existing scenario-generation methods.
Abstract（参考訳）: 対向運転シナリオの生成は、シミュレーションにおける自動運転車の意思決定システムの評価と改善に不可欠である。 ChatScene や LLM-Attacker のような最近のアプローチは、主に手続き的に駆動シナリオを生成するために、大規模言語モデルや視覚言語モデルの事前知識に依存している。我々は,事前の仮定に頼るのではなく,政策の弱点に特異的に対処するために,運転方針の障害診断(例えば,不決定性,複数フレームの不整合)に基づいて,敵のシーンを生成するべきだと論じる。本稿では,SPHINXを提案する。SPHINXは単純な原理で導かれる逆シナリオ合成のためのクローズドループフレームワークで,まず説明し,探索する。シナリオ空間を盲目的に探索するだけでなく、SPHINXは説明可能な人工知能手法を利用してポリシーを分析し、重要な視覚概念と政策出力への影響を特定し、決定の不確実性を判断する。政策自身の意思決定プロセスから抽出された解釈可能な証拠を考慮し,現在の政策の失敗モードを合理化し,批判するために視覚言語モデルを用いる。これらの批評家は、政策の再訓練と改善のための標的となる敵シナリオを生成するために使用される。我々は、SPHINXが、他の敵のシーン生成ができないのに対して、政策失敗の解釈可能な説明をハイライトできることを実証した。評価されたベンチマークとテストスイート全体にわたって、SPHINXはさまざまな最先端の自動運転車アーキテクチャに適用でき、既存のシナリオ生成手法よりも一貫した堅牢性の向上をもたらす。

論文の概要: SPHINX: First Explain, Then Explore

関連論文リスト