Fugu-MT 論文翻訳(概要): Easy-to-Use Shielding for Reinforcement Learning

論文の概要: Easy-to-Use Shielding for Reinforcement Learning

arxiv url: http://arxiv.org/abs/2606.03804v1
Date: Tue, 02 Jun 2026 15:50:34 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-03 22:00:05.126776
Title: Easy-to-Use Shielding for Reinforcement Learning
Title（参考訳）: 強化学習のための使い易いシールド
Authors: Stefan Pranger, Bettina Könighofer,
Abstract要約: シールドは、アクションセーフティを決定するための環境モデルという形でドメイン知識を仮定するテクニックである。シールドの適用には、通常、正式な手法と実質的なエンジニアリング作業の専門知識が必要である。我々はシールド合成ツールであるTempestを安全なRLのための実用的なバックエンドに拡張する。
参考スコア（独自算出の注目度）: 4.640835690336653
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Safe exploration is a key challenge in Reinforcement Learning (RL) that aims to prevent agents from making harmful decisions while exploring their environment. Safe exploration is a key challenge in Reinforcement Learning (RL) that aims to prevent agents from making harmful decisions while exploring their environment. Shielding is one such technique that assumes domain knowledge in the form of an environment model to decide upon action safety. Although well-established, shielding has seen limited adoption in RL due to the lack of accessible end-to-end infrastructure connecting formal shield synthesis with standard RL frameworks. Applying shielding typically requires expertise in formal methods and substantial engineering effort, keeping it outside the typical RL workflow. We address this by extending our shield synthesis tool Tempest into a practical backend for safe RL. Our core contribution is tempestpy, a Python library that integrates Tempest-based shield synthesis directly into the Gymnasium API, allowing shields to be synthesized and deployed within existing RL pipelines. This lowers the barrier to entry for shielding and turns formal safe-exploration methods into a usable component for RL practitioners. We also extend Tempest's algorithmic support to compute sound shields for stochastic multiplayer games, preserving formal safety guarantees. We demonstrate the resulting workflow end to end and evaluate shielded and unshielded RL across multiple environments. To facilitate modeling, we provide symbolic models for MiniGrid and introduce MiniGridSafe, a collection of playground environments designed to make shielding easily accessible and experimentally transparent. MiniGridSafe extends MiniGrid with safety-oriented scenarios featuring probabilistic transitions and additional agents, enabling the study of challenging safety aspects in a simple and intuitive setting.
Abstract（参考訳）: 安全探索は、エージェントが環境を探索しながら有害な決定を下すのを防ぐことを目的とした強化学習(RL)における重要な課題である。安全探索は、エージェントが環境を探索しながら有害な決定を下すのを防ぐことを目的とした強化学習(RL)における重要な課題である。シールドは、アクションセーフティを決定するための環境モデルという形でドメイン知識を仮定するテクニックのひとつです。十分に確立されたものの、標準のRLフレームワークと公式なシールド合成を接続するエンドツーエンドのインフラストラクチャが欠如しているため、シールドはRLでしか採用されていない。シールドを適用するには、通常、形式的な手法と実質的なエンジニアリング作業の専門知識が必要で、典型的なRLワークフローの外部に置いておく必要がある。我々は、シールド合成ツールであるTempestを安全なRLのための実用的なバックエンドに拡張することで、この問題に対処する。これはPythonライブラリで、Tempestベースのシールド合成を直接Gymnasium APIに統合します。これにより、シールドの参入障壁を低くし、正式な安全な探索メソッドをRL実践者にとって有用なコンポーネントにする。我々はまた、確率的マルチプレイヤーゲームのためのサウンドシールドを計算するために、テンペストのアルゴリズムサポートを拡張し、正式な安全保証を保持する。結果のワークフローをエンドツーエンドで実証し、シールド付きおよびシールドなしのRLを複数の環境にわたって評価する。モデリングを容易にするために,MiniGridのシンボリックモデルとMiniGridSafeを導入する。 MiniGridSafeは、確率的遷移と追加エージェントを備えた安全指向のシナリオでMiniGridを拡張し、シンプルで直感的な設定で、挑戦的な安全性面の研究を可能にする。

論文の概要: Easy-to-Use Shielding for Reinforcement Learning

関連論文リスト