Fugu-MT 論文翻訳(概要): Red Lines and Grey Zones in the Fog of War: Benchmarking Legal Risk, Moral Harm, and Regional Bias in Large Language Model Military Decision-Making

論文の概要: Red Lines and Grey Zones in the Fog of War: Benchmarking Legal Risk, Moral Harm, and Regional Bias in Large Language Model Military Decision-Making

arxiv url: http://arxiv.org/abs/2510.03514v1
Date: Fri, 03 Oct 2025 20:55:04 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-07 16:52:59.086004
Title: Red Lines and Grey Zones in the Fog of War: Benchmarking Legal Risk, Moral Harm, and Regional Bias in Large Language Model Military Decision-Making
Title（参考訳）: 戦前の赤線と白地 : 大規模言語モデル軍事意思決定における法的リスク、道徳的ハーム、地域バイアスのベンチマーク
Authors: Toby Drinkall,
Abstract要約: 本研究では,ターゲット行動における法的・道徳的リスクの側面を評価するためのベンチマークフレームワークを開発する。我々は国際人道法(IHL)と軍事教義に基づく4つの指標を紹介する。 GPT-4o, Gemini-2.5, LLaMA-3.1の3つのフロンティアモデルを90個のマルチエージェント・マルチターン危機シミュレーションにより評価した。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: As military organisations consider integrating large language models (LLMs) into command and control (C2) systems for planning and decision support, understanding their behavioural tendencies is critical. This study develops a benchmarking framework for evaluating aspects of legal and moral risk in targeting behaviour by comparing LLMs acting as agents in multi-turn simulated conflict. We introduce four metrics grounded in International Humanitarian Law (IHL) and military doctrine: Civilian Target Rate (CTR) and Dual-use Target Rate (DTR) assess compliance with legal targeting principles, while Mean and Max Simulated Non-combatant Casualty Value (SNCV) quantify tolerance for civilian harm. We evaluate three frontier models, GPT-4o, Gemini-2.5, and LLaMA-3.1, through 90 multi-agent, multi-turn crisis simulations across three geographic regions. Our findings reveal that off-the-shelf LLMs exhibit concerning and unpredictable targeting behaviour in simulated conflict environments. All models violated the IHL principle of distinction by targeting civilian objects, with breach rates ranging from 16.7% to 66.7%. Harm tolerance escalated through crisis simulations with MeanSNCV increasing from 16.5 in early turns to 27.7 in late turns. Significant inter-model variation emerged: LLaMA-3.1 selected an average of 3.47 civilian strikes per simulation with MeanSNCV of 28.4, while Gemini-2.5 selected 0.90 civilian strikes with MeanSNCV of 17.6. These differences indicate that model selection for deployment constitutes a choice about acceptable legal and moral risk profiles in military operations. This work seeks to provide a proof-of-concept of potential behavioural risks that could emerge from the use of LLMs in Decision Support Systems (AI DSS) as well as a reproducible benchmarking framework with interpretable metrics for standardising pre-deployment testing.
Abstract（参考訳）: 軍事組織は、大規模言語モデル(LLM)を計画と意思決定支援のためのコマンド・アンド・コントロール(C2)システムに統合することを検討するため、その行動傾向を理解することが重要である。本研究では,マルチターン・シミュレート・コンフリクトにおけるエージェントとして機能するLDMを比較検討することにより,ターゲット行動における法的・道徳的リスクの側面を評価するためのベンチマーク・フレームワークを開発した。市民目標率(CTR)とデュアルユース目標率(DTR)は、法的なターゲティング原則の遵守を評価する一方、MeanとMax Simulated Non-combatant Casualty Value(SNCV)は、民間の害に対する寛容を定量化する。我々は,GPT-4o,Gemini-2.5,LLaMA-3.1の3つのフロンティアモデルについて,90個のマルチエージェント・マルチターン危機シミュレーションを用いて評価した。実験の結果, 既成のLLMは, 模擬紛争環境において, 予測不可能な標的行動を示すことが明らかとなった。全てのモデルは民間の物体を標的としたIHLの原則に反し、侵害率は16.7%から66.7%であった。ハーム耐性は危機シミュレーションを通じて増大し、MeanSNCVは初期の16.5から後期の27.7に増加した。 LLaMA-3.1は平均3.47発のMeanSNCVを28.4発、ジェミニ2.5は0.90発のMeanSNCVを17.6発とした。これらの違いは、展開のためのモデル選択が、軍事作戦において許容される法的および道徳的リスクプロファイルの選択を構成することを示している。この研究は、LLMs in Decision Support Systems(AI DSS)の使用から生じる潜在的な行動リスクの実証と、事前デプロイテストの標準化のための解釈可能なメトリクスを備えた再現可能なベンチマークフレームワークを提供することを目指している。

論文の概要: Red Lines and Grey Zones in the Fog of War: Benchmarking Legal Risk, Moral Harm, and Regional Bias in Large Language Model Military Decision-Making

関連論文リスト