Fugu-MT 論文翻訳(概要): Adversarial Vulnerabilities in Neural Operator Digital Twins: Gradient-Free Attacks on Nuclear Thermal-Hydraulic Surrogates

論文の概要: Adversarial Vulnerabilities in Neural Operator Digital Twins: Gradient-Free Attacks on Nuclear Thermal-Hydraulic Surrogates

arxiv url: http://arxiv.org/abs/2603.22525v1
Date: Mon, 23 Mar 2026 19:35:17 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-25 19:53:37.170719
Title: Adversarial Vulnerabilities in Neural Operator Digital Twins: Gradient-Free Attacks on Nuclear Thermal-Hydraulic Surrogates
Title（参考訳）: ニューラルオペレーターデジタル双生児の対向性脆弱性:核熱水性サロゲートに対するグラディエントフリー攻撃
Authors: Samrendra Roy, Kazuma Kobayashi, Souvik Chakraborty, Rizwan-uddin, Syed Bahauddin Alam,
Abstract要約: 運用者学習モデルは、原子力とエネルギーシステムのためのデジタルツインの予測コアとして浮上している。神経オペレーターは、境界条件に対する感受性を生かした極端にスパースな摂動に対して非常に脆弱であることを示す。本研究は,演算子学習モデルにおいてこれまで見過ごされていた攻撃面を明らかにし,これらのモデルが標準検証以上の保証を必要とすることを確かめるものである。
参考スコア（独自算出の注目度）: 4.190153363488523
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Operator learning models are rapidly emerging as the predictive core of digital twins for nuclear and energy systems, promising real-time field reconstruction from sparse sensor measurements. Yet their robustness to adversarial perturbations remains uncharacterized, a critical gap for deployment in safety-critical systems. Here we show that neural operators are acutely vulnerable to extremely sparse (fewer than 1% of inputs), physically plausible perturbations that exploit their sensitivity to boundary conditions. Using gradient-free differential evolution across four operator architectures, we demonstrate that minimal modifications trigger catastrophic prediction failures, increasing relative $L_2$ error from $\sim$1.5% (validated accuracy) to 37-63% while remaining completely undetectable by standard validation metrics. Notably, 100% of successful single-point attacks pass z-score anomaly detection. We introduce the effective perturbation dimension $d_{\text{eff}}$, a Jacobian-based diagnostic that, together with sensitivity magnitude, yields a two-factor vulnerability model explaining why architectures with extreme sensitivity concentration (POD-DeepONet, $d_{\text{eff}} \approx 1$) are not necessarily the most exploitable, since low-rank output projections cap maximum error, while moderate concentration with sufficient amplification (S-DeepONet, $d_{\text{eff}} \approx 4$) produces the highest attack success. Gradient-free search outperforms gradient-based alternatives (PGD) on architectures with gradient pathologies, while random perturbations of equal magnitude achieve near-zero success rates, confirming that the discovered vulnerabilities are structural. Our findings expose a previously overlooked attack surface in operator learning models and establish that these models require robustness guarantees beyond standard validation before deployment.
Abstract（参考訳）: 演算子学習モデルは、核とエネルギーシステムのためのデジタルツインの予測コアとして急速に発展し、スパースセンサー測定によるリアルタイムなフィールド再構築を約束している。しかし、敵の摂動に対する頑健さは、安全クリティカルなシステムへの展開において重要なギャップである、不特なままである。ここでは、神経オペレーターは、非常にスパース(入力の1%未満)に対して非常に脆弱であり、境界条件に対する感受性を生かした物理的にもっともらしい摂動を示す。 4つの演算子アーキテクチャ間の勾配のない微分進化を用いることで、最小限の変更が破滅的な予測失敗を引き起こし、相対的な$L_2$エラーを$\sim$1.5%から37-63%まで増加させ、標準検証基準で完全に検出不能なままであることを示した。特に、成功したシングルポイント攻撃の100%は、zスコア異常検出をパスする。低ランクな出力プロジェクションが最大誤差を抑えるのに対し、十分な増幅率(S-DeepONet, $d_{\text{eff}} \approx 4$)が最高の攻撃成功をもたらすのに対し、極端に感度の高いアーキテクチャ(POD-DeepONet, $d_{\text{eff}} \approx 1$)が必ずしも最も有効ではない理由を説明する2要素の脆弱性モデルを生成する。グラディエントフリーサーチは、勾配病理を持つアーキテクチャ上の勾配に基づく代替(PGD)よりも優れ、一方、等等度のランダムな摂動は、ほぼゼロの成功率を達成し、発見された脆弱性が構造的であることを確認した。本研究は,操作者学習モデルにおいてこれまで見過ごされていた攻撃面を明らかにし,これらのモデルが展開前に標準的な検証以上の堅牢性を保証する必要があることを確かめるものである。

論文の概要: Adversarial Vulnerabilities in Neural Operator Digital Twins: Gradient-Free Attacks on Nuclear Thermal-Hydraulic Surrogates

関連論文リスト