Fugu-MT 論文翻訳(概要): Rethinking Autonomy: Preventing Failures in AI-Driven Software Engineering

論文の概要: Rethinking Autonomy: Preventing Failures in AI-Driven Software Engineering

arxiv url: http://arxiv.org/abs/2508.11824v1
Date: Fri, 15 Aug 2025 22:13:54 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-19 14:49:10.395204
Title: Rethinking Autonomy: Preventing Failures in AI-Driven Software Engineering
Title（参考訳）: 自律性を再考する - AI駆動ソフトウェアエンジニアリングの失敗を防ぐ
Authors: Satyam Kumar Navneet, Joydeep Chandra,
Abstract要約: SAFE-AI Frameworkは、安全性、監査可能性、フィードバック、説明可能性を強調した総合的なアプローチである。我々は、リスク評価と監視を導くために、提案的、生成的、自律的、破壊的なアクションを分類する、AI行動の新しい分類法を導入する。この記事では、EU AI ActやカナダのAIDAといった新たな規則に沿って、ソフトウェアエンジニアリングにおける責任あるAI統合のためのロードマップを提供する。
参考スコア（独自算出の注目度）: 1.6766200616088744
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The integration of Large Language Models (LLMs) into software engineering has revolutionized code generation, enabling unprecedented productivity through promptware and autonomous AI agents. However, this transformation introduces significant risks, including insecure code generation, hallucinated outputs, irreversible actions, and a lack of transparency and accountability. Incidents like the Replit database deletion underscore the urgent need for robust safety and governance mechanisms. This paper comprehensively analyzes the inherent challenges of LLM-assisted code generation, such as vulnerability inheritance, overtrust, misinterpretation, and the absence of standardized validation and rollback protocols. To address these, we propose the SAFE-AI Framework, a holistic approach emphasizing Safety, Auditability, Feedback, and Explainability. The framework integrates guardrails, sandboxing, runtime verification, risk-aware logging, human-in-the-loop systems, and explainable AI techniques to mitigate risks while fostering trust and compliance. We introduce a novel taxonomy of AI behaviors categorizing suggestive, generative, autonomous, and destructive actions to guide risk assessment and oversight. Additionally, we identify open problems, including the lack of standardized benchmarks for code specific hallucinations and autonomy levels, and propose future research directions for hybrid verification, semantic guardrails, and proactive governance tools. Through detailed comparisons of autonomy control, prompt engineering, explainability, and governance frameworks, this paper provides a roadmap for responsible AI integration in software engineering, aligning with emerging regulations like the EU AI Act and Canada's AIDA to ensure safe, transparent, and accountable AI-driven development.
Abstract（参考訳）: 大規模言語モデル(LLM)をソフトウェア工学に統合することは、コード生成に革命をもたらし、プロンプトウェアや自律型AIエージェントを通じて前例のない生産性を実現する。しかし、このトランスフォーメーションは、安全でないコード生成、幻覚的なアウトプット、不可逆的なアクション、透明性と説明責任の欠如など、重大なリスクをもたらす。 Replitデータベースの削除のようなインシデントは、堅牢な安全性とガバナンスメカニズムに対する緊急の必要性を浮き彫りにしている。本稿では、脆弱性継承、過信、誤解釈、標準化されたバリデーションとロールバックプロトコルの欠如など、LLM支援コード生成の固有の課題を包括的に分析する。これらの問題に対処するために,安全,聴取性,フィードバック,説明可能性を重視した総合的なアプローチであるSAFE-AIフレームワークを提案する。このフレームワークは、ガードレール、サンドボックス、実行時検証、リスク対応ロギング、ヒューマン・イン・ザ・ループ・システム、そして説明可能なAI技術を統合して、信頼性とコンプライアンスを促進しながらリスクを軽減する。我々は、リスク評価と監視を導くために、提案的、生成的、自律的、破壊的なアクションを分類する、AI行動の新しい分類法を導入する。さらに、コード固有の幻覚と自律レベルのための標準ベンチマークの欠如など、オープンな問題を特定し、ハイブリッド検証、セマンティックガードレール、プロアクティブガバナンスツールの将来的な研究方向性を提案する。自律性制御、迅速なエンジニアリング、説明可能性、およびガバナンスフレームワークの詳細な比較を通じて、本論文は、安全で透明で説明可能なAI駆動開発を保証するために、EU AI ActやカナダのAIDAといった新たな規制と整合した、ソフトウェアエンジニアリングにおける責任あるAI統合のためのロードマップを提供する。

論文の概要: Rethinking Autonomy: Preventing Failures in AI-Driven Software Engineering

関連論文リスト