Fugu-MT 論文翻訳(概要): AI Assurance: A Comprehensive Testing Strategy for Enterprise AI Systems

論文の概要: AI Assurance: A Comprehensive Testing Strategy for Enterprise AI Systems

arxiv url: http://arxiv.org/abs/2605.23459v1
Date: Fri, 22 May 2026 10:19:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-25 17:29:20.308061
Title: AI Assurance: A Comprehensive Testing Strategy for Enterprise AI Systems
Title（参考訳）: AI保証: エンタープライズAIシステムのための総合的なテスト戦略
Authors: Chitra Badagi, Divye Singh, Animesh Sen, Adinath Shirsath,
Abstract要約: 本稿では,3つの原則に基づいて構築されたエンタープライズAIシステムに対する包括的保証戦略を提案する。まず、AIテストは厳格な正当性検証ではなく、継続的なリスク低減に焦点を当てるべきである。第3に、AI保証の失敗は、従来の決定論的ソフトウェアシステムに見られるものと根本的に異なる組織的影響をもたらす可能性がある。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Enterprise AI systems, built on large language models, retrieval pipelines and autonomous agents, introduce a class of risks that traditional software quality assurance was never designed to address. These systems are probabilistic, context-sensitive and emergent: they cannot be verified to be correct in the classical sense, but only evaluated with increasing confidence. This paper presents a comprehensive assurance strategy for enterprise AI systems built around three key principles: first, that AI testing should focus on continuous risk reduction rather than strict correctness verification; second, that evaluation must be treated as a core engineering discipline alongside development; and third, that failures in AI assurance can lead to organizational impacts that are fundamentally different from those seen in traditional deterministic software systems. We introduce a structured AI Failure Taxonomy, propose a revised five-layer AI Assurance Pyramid and provide operational guidance on evaluation-driven development, RAG system testing, model lifecycle management and governance. The goal is to equip engineering leaders and practitioners with a strategy that is both philosophically grounded and operationally deployable.
Abstract（参考訳）: 大規模言語モデル、検索パイプライン、自律エージェントに基づいて構築されたエンタープライズAIシステムは、従来のソフトウェア品質保証が対処するように設計されていなかった、一連のリスクを導入している。これらのシステムは確率的であり、文脈に敏感で、創発的であり、古典的な意味では正しいと証明することはできないが、信頼性を高めてのみ評価される。本稿では,AIテストが厳密な正当性検証ではなく,継続的なリスク低減に重点を置くこと,AIテストが開発に伴う中核的な技術規律として扱われること,AIアシュアランスの失敗が従来の決定論的ソフトウェアシステムと根本的に異なる組織的影響をもたらすこと,の3つの原則に基づいて構築された企業AIシステムに対する包括的保証戦略を提案する。我々は、構造化されたAI失敗分類を導入し、改訂された5層AI保証ピラミッドを提案し、評価駆動開発、RAGシステムテスト、モデルライフサイクル管理、ガバナンスに関する運用ガイダンスを提供する。目標は、哲学的基盤と運用的デプロイの両方が可能な戦略を、エンジニアリングリーダと実践者に提供することです。

論文の概要: AI Assurance: A Comprehensive Testing Strategy for Enterprise AI Systems

関連論文リスト