Fugu-MT 論文翻訳(概要): Beyond Fixed Thresholds and Domain-Specific Benchmarks for Explainable Multi-Task Classification in Autonomous Vehicles

論文の概要: Beyond Fixed Thresholds and Domain-Specific Benchmarks for Explainable Multi-Task Classification in Autonomous Vehicles

arxiv url: http://arxiv.org/abs/2605.04299v1
Date: Tue, 05 May 2026 21:07:11 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-07 18:41:07.546063
Title: Beyond Fixed Thresholds and Domain-Specific Benchmarks for Explainable Multi-Task Classification in Autonomous Vehicles
Title（参考訳）: 自律走行車における説明可能なマルチタスク分類のための固定閾値とドメイン特化ベンチマークを超えて
Authors: Maryam Sadat Hosseini Azad, Shahriar Baradaran Shokouhi,
Abstract要約: シーン理解は、ディープラーニングモデルの使用を必要とする自律運転システムにおいて重要な部分である。本稿では,各タスクに対する最適決定境界を特定するために,様々なしきい値を評価する信頼しきい値感度分析を導入する。 IUST-XAI-ADは、人間のアノテーションを付加した958枚の画像からなる新しいデータセットである。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Scene understanding is a vital part of autonomous driving systems, which requires the use of deep learning models. Deep learning methods are intrinsically black box models, which lack transparency and safety in autonomous driving. To make these systems transparent, multi-task visual understanding has become crucial for explainable autonomous driving perception systems, where simultaneous prediction of multiple driving behaviors and their underlying explanations is essential for safe navigation and human trust in autonomous vehicles. In order to design an accurate and cross-cultural explainable autonomous driving system, we introduce a comprehensive confidence threshold sensitivity analysis that evaluates various threshold values to identify optimal decision boundaries for different tasks. Our analysis demonstrates that traditional fixed threshold approaches are suboptimal for multi-task scenarios. Through extensive evaluation, we demonstrate that our adaptive threshold selection methodology improves F1-scores across different tasks. In addition, we introduce IUST-XAI-AD, a novel dataset consisting of 958 images with human annotations for driving decisions and corresponding reasoning. This dataset addresses the critical gap in domain-specific evaluation benchmarks for distinct driving contexts and provides a more challenging test environment compared to existing datasets. Experimental results demonstrate that confidence threshold sensitivity analysis can significantly improve model performance, while the introduction of the IUST-XAI-AD dataset reveals important insights about cross-cultural driving behavior patterns. The combined contributions of this work provide both methodological advances and practical evaluation tools that can accelerate the development of more reliable, explainable, and culturally-adaptive autonomous driving systems for global deployment.
Abstract（参考訳）: シーン理解は、ディープラーニングモデルの使用を必要とする自律運転システムにおいて重要な部分である。ディープラーニングは本質的にブラックボックスモデルであり、自律運転における透明性と安全性を欠いている。これらのシステムを透過的にするためには、複数の運転行動の同時予測と、その基礎となる説明が、安全なナビゲーションと自動運転車に対する人間の信頼に不可欠である、説明可能な自律運転認識システムにおいて、マルチタスクの視覚的理解が不可欠になっている。高精度で異文化横断的な自律運転システムを設計するために,様々なしきい値を評価する総合的信頼しきい値感度分析を導入し,タスクの最適決定境界を同定する。本分析は,従来の固定しきい値アプローチがマルチタスクシナリオに最適であることを示す。適応しきい値選択手法がF1スコアを様々なタスクで改善することを示す。また,IUST-XAI-ADは,人間のアノテーションを付加した958枚の画像からなる新しいデータセットである。このデータセットは、異なる駆動コンテキストに対するドメイン固有の評価ベンチマークの致命的なギャップに対処し、既存のデータセットと比較してより困難なテスト環境を提供する。 IUST-XAI-ADデータセットの導入は、異文化間運転行動パターンに関する重要な洞察を明らかにする一方で、信頼性閾値感度分析がモデル性能を著しく向上させることを示した。本研究のコントリビューションは, より信頼性が高く, 説明可能な, 文化的に適応可能な自動運転システムの開発を加速する, 方法論的進歩と実践的評価ツールの両方を提供する。

論文の概要: Beyond Fixed Thresholds and Domain-Specific Benchmarks for Explainable Multi-Task Classification in Autonomous Vehicles

関連論文リスト