Fugu-MT 論文翻訳(概要): Clear, Compelling Arguments: Rethinking the Foundations of Frontier AI Safety Cases

論文の概要: Clear, Compelling Arguments: Rethinking the Foundations of Frontier AI Safety Cases

arxiv url: http://arxiv.org/abs/2603.08760v1
Date: Sun, 08 Mar 2026 16:25:58 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-11 15:25:23.729329
Title: Clear, Compelling Arguments: Rethinking the Foundations of Frontier AI Safety Cases
Title（参考訳）: AIのフロンティア・セーフティ・ケースの基礎を再考
Authors: Shaun Feakins, Ibrahim Habli, Phillip Morgan,
Abstract要約: 本稿では,フロンティアAIシステムの安全性に関する最近の議論に寄与する。安全ケースは構造化されており、特定のコンテキストにおいてシステムが確実に安全にデプロイできるという防御可能な主張である。その結果、フロンティアAIの安全性のケースが注目されている。
参考スコア（独自算出の注目度）: 1.0170129555792935
License: http://creativecommons.org/licenses/by/4.0/
Abstract: This paper contributes to the nascent debate around safety cases for frontier AI systems. Safety cases are structured, defensible arguments that a system is acceptably safe to deploy in a given context. Historically, they have been used in safety-critical industries, such as aerospace, nuclear or automotive. As a result, safety cases for frontier AI have risen in prominence, both in the safety policies of leading frontier developers and in international research agendas proposed by leaders in generative AI, such as the Singapore Consensus on Global AI Safety Research Priorities and the International AI Safety Report. This paper appraises this work. We note that research conducted within the alignment community which draws explicitly on lessons from the assurance community has significant limitations. We therefore aim to rethink existing approaches to alignment safety cases. We offer lessons from existing methodologies within safety assurance and outline the limitations involved in the alignment community's current approach. Building on this foundation, we present a case study for a safety case focused on Deceptive Alignment and CBRN capabilities, drawing on existing, theoretical safety case "sketches" created by the alignment safety case community. Overall, we contribute holistic insights from the field of safety assurance via rigorous theory and methodologies that have been applied in safety-critical contexts. We do so in order to create a better foundational framework for robust, defensible and useful safety case methodologies which can help to assure the safety of frontier AI systems.
Abstract（参考訳）: 本稿では,フロンティアAIシステムの安全性に関する最近の議論に寄与する。安全ケースは構造化されており、特定のコンテキストにおいてシステムが確実に安全にデプロイできるという防御可能な主張である。歴史的には、航空宇宙、原子力、自動車などの安全上重要な産業で使用されている。その結果、フロンティアAIの安全ケースは、先進的なフロンティア開発者の安全政策と、シンガポール国際AI安全研究優先条約(英語版)や国際AI安全レポート(英語版)など、ジェネレーティブAIのリーダーが提案する国際研究課題の両方において、注目されている。この論文は、この作品を評価している。本研究は,アライメントコミュニティにおいて,アライメントコミュニティからの教訓を明示的に取り入れた研究には,重大な制限があることに留意する。したがって、我々は、既存の安全事例の整合化アプローチを再考することを目指している。我々は、安全保証の既存の方法論から教訓を提供し、アライメントコミュニティの現在のアプローチにかかわる限界を概説する。本財団を基盤として,アライメント・アライメント・アライメントとCBRN機能に着目した安全事例のケーススタディを,アライメント・アライメント・アライメント・アライメント・アライメント・ケース・コミュニティが生み出した,既存の理論上の安全事例「スケッチ」に基づいて提示する。本研究は,安全性に批判的な文脈で適用された厳密な理論と方法論を通じて,安全保証の分野からの総合的な洞察を貢献する。私たちは、フロンティアAIシステムの安全性を保証するのに役立つ、堅牢で、防御可能な、有用な安全ケース方法論のための、より良い基盤となるフレームワークを構築するために、そうしています。

論文の概要: Clear, Compelling Arguments: Rethinking the Foundations of Frontier AI Safety Cases

関連論文リスト