Fugu-MT 論文翻訳(概要): Adversarial Attacks on Reinforcement Learning-based Medical Questionnaire Systems: Input-level Perturbation Strategies and Medical Constraint Validation

論文の概要: Adversarial Attacks on Reinforcement Learning-based Medical Questionnaire Systems: Input-level Perturbation Strategies and Medical Constraint Validation

arxiv url: http://arxiv.org/abs/2508.05677v1
Date: Tue, 05 Aug 2025 11:10:49 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-11 20:39:05.923846
Title: Adversarial Attacks on Reinforcement Learning-based Medical Questionnaire Systems: Input-level Perturbation Strategies and Medical Constraint Validation
Title（参考訳）: 強化学習に基づく医療アンケートシステムに対する敵対的攻撃:入力レベルの摂動方略と医療制約検証
Authors: Peizhuo Liu,
Abstract要約: RLベースの医療アンケートシステムは、医療シナリオにおいて大きな可能性を示している。本研究は,潜在的な脆弱性を同定・解析するための敵攻撃手法に関する総合的な評価を行う。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: RL-based medical questionnaire systems have shown great potential in medical scenarios. However, their safety and robustness remain unresolved. This study performs a comprehensive evaluation on adversarial attack methods to identify and analyze their potential vulnerabilities. We formulate the diagnosis process as a Markov Decision Process (MDP), where the state is the patient responses and unasked questions, and the action is either to ask a question or to make a diagnosis. We implemented six prevailing major attack methods, including the Fast Gradient Signed Method (FGSM), Projected Gradient Descent (PGD), Carlini & Wagner Attack (C&W) attack, Basic Iterative Method (BIM), DeepFool, and AutoAttack, with seven epsilon values each. To ensure the generated adversarial examples remain clinically plausible, we developed a comprehensive medical validation framework consisting of 247 medical constraints, including physiological bounds, symptom correlations, and conditional medical constraints. We achieved a 97.6% success rate in generating clinically plausible adversarial samples. We performed our experiment on the National Health Interview Survey (NHIS) dataset (https://www.cdc.gov/nchs/nhis/), which consists of 182,630 samples, to predict the participant's 4-year mortality rate. We evaluated our attacks on the AdaptiveFS framework proposed in arXiv:2004.00994. Our results show that adversarial attacks could significantly impact the diagnostic accuracy, with attack success rates ranging from 33.08% (FGSM) to 64.70% (AutoAttack). Our work has demonstrated that even under strict medical constraints on the input, such RL-based medical questionnaire systems still show significant vulnerabilities.
Abstract（参考訳）: RLベースの医療アンケートシステムは、医療シナリオにおいて大きな可能性を示している。しかし、その安全性と堅牢性は未解決のままである。本研究は,潜在的な脆弱性を同定・解析するための敵攻撃手法に関する総合的な評価を行う。診断過程をMDP (Markov Decision Process) として定式化し, その状態が患者の反応であり, 不明な質問であり, 質問を行うか, 診断を行うかのどちらかである。我々はFGSM(Fast Gradient Signed Method)、PGD(Projected Gradient Descent)、Carini & Wagner attack(C&W)、Basic Iterative Method(BIM)、DeepFool(DeepFool)、AutoAttack(AutoAttack)の6つの主要な攻撃手法を実装した。本研究は, 臨床診断の妥当性を確保するために, 生理的制約, 症状相関, 条件付き医学的制約を含む247の医学的制約からなる総合的医学的検証フレームワークを開発した。臨床的に妥当な逆行性サンプルを作製し,97.6%の成功率を得た。全国健康調査(NHIS)データセット(https://www.cdc.gov/nchs/nhis/)を用いて,被験者の4年間の死亡率を予測する実験を行った。我々は、arXiv:2004.00994で提案されたAdaptiveFSフレームワークに対する攻撃を評価した。その結果, 対人攻撃は診断精度に大きな影響を与え, 攻撃成功率は33.08% (FGSM) から64.70% (AutoAttack) まで様々であった。我々の研究は、入力に厳しい医学的制約があっても、そのようなRLベースの医療アンケートシステムには重大な脆弱性があることを示した。

論文の概要: Adversarial Attacks on Reinforcement Learning-based Medical Questionnaire Systems: Input-level Perturbation Strategies and Medical Constraint Validation

関連論文リスト