Fugu-MT 論文翻訳(概要): Are AlphaZero-like Agents Robust to Adversarial Perturbations?

論文の概要: Are AlphaZero-like Agents Robust to Adversarial Perturbations?

arxiv url: http://arxiv.org/abs/2211.03769v1
Date: Mon, 7 Nov 2022 18:43:25 GMT
ステータス: 翻訳完了
システム内更新日: 2022-11-08 16:26:56.526881
Title: Are AlphaZero-like Agents Robust to Adversarial Perturbations?
Title（参考訳）: alphazeroライクなエージェントは敵対的摂動に対して堅牢か?
Authors: Li-Cheng Lan, Huan Zhang, Ti-Rong Wu, Meng-Yu Tsai, I-Chen Wu, Cho-Jui Hsieh
Abstract要約: AlphaZero(AZ)は、ニューラルネットワークベースのGo AIが人間のパフォーマンスを大きく上回ることを示した。私たちは、Go AIが驚くほど間違った行動を起こさせる可能性のある、敵対的な状態が存在するかどうか尋ねる。我々は、Go AIに対する最初の敵攻撃を開発し、探索空間を戦略的に減らし、効率よく敵の状態を探索する。
参考スコア（独自算出の注目度）: 73.13944217915089
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The success of AlphaZero (AZ) has demonstrated that neural-network-based Go AIs can surpass human performance by a large margin. Given that the state space of Go is extremely large and a human player can play the game from any legal state, we ask whether adversarial states exist for Go AIs that may lead them to play surprisingly wrong actions. In this paper, we first extend the concept of adversarial examples to the game of Go: we generate perturbed states that are ``semantically'' equivalent to the original state by adding meaningless moves to the game, and an adversarial state is a perturbed state leading to an undoubtedly inferior action that is obvious even for Go beginners. However, searching the adversarial state is challenging due to the large, discrete, and non-differentiable search space. To tackle this challenge, we develop the first adversarial attack on Go AIs that can efficiently search for adversarial states by strategically reducing the search space. This method can also be extended to other board games such as NoGo. Experimentally, we show that the actions taken by both Policy-Value neural network (PV-NN) and Monte Carlo tree search (MCTS) can be misled by adding one or two meaningless stones; for example, on 58\% of the AlphaGo Zero self-play games, our method can make the widely used KataGo agent with 50 simulations of MCTS plays a losing action by adding two meaningless stones. We additionally evaluated the adversarial examples found by our algorithm with amateur human Go players and 90\% of examples indeed lead the Go agent to play an obviously inferior action. Our code is available at \url{https://PaperCode.cc/GoAttack}.
Abstract（参考訳）: AlphaZero(AZ)の成功は、ニューラルネットワークベースのGo AIが人間のパフォーマンスを大きく上回ることを示した。 Goの国家空間が極めて大きく、人間のプレイヤーが法的状態からゲームをすることができることを考慮すれば、Go AIに対して敵国が存在するかどうかを問う。本稿では,まず,goゲームに敵の例の概念を最初に拡張する。我々は,ゲームに意味のない動きを加えることによって,本来の状態と同値である‘semantically’となる摂動状態を生成し,その逆の状態を,go初心者にとっても明らかな劣った動作につながる摂動状態とする。しかし、逆境状態の探索は、大きくて離散的で、非微分可能な探索空間のため困難である。この課題に取り組むため,我々は,検索空間を戦略的に縮小することにより,効率的に敵国を探索できる,go aisに対する最初の敵対的攻撃を開発した。この方法は、NoGoのような他のボードゲームにも拡張できる。例えば,AlphaGo Zero の 58 % の自己プレイゲームでは,MCTS の 50 個のシミュレーションで広く使われている KataGo エージェントが,2 個の無意味なストーンを追加することで,その動作を損なうことができる。さらに,このアルゴリズムで見いだされた敵の例をアマチュアの人間goプレーヤーで評価し,その90%はgoエージェントに明らかに劣るアクションをさせた。私たちのコードは \url{https://PaperCode.cc/GoAttack} で利用可能です。

論文の概要: Are AlphaZero-like Agents Robust to Adversarial Perturbations?

関連論文リスト