Fugu-MT 論文翻訳(概要): Safe Control using Learned Safety Filters and Adaptive Conformal Inference

論文の概要: Safe Control using Learned Safety Filters and Adaptive Conformal Inference

arxiv url: http://arxiv.org/abs/2604.18482v1
Date: Mon, 20 Apr 2026 16:34:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.997478
Title: Safe Control using Learned Safety Filters and Adaptive Conformal Inference
Title（参考訳）: 学習安全フィルタと適応等角推論を用いた安全制御
Authors: Sacha Huriot, Ihab Tabbara, Hussein Sibai,
Abstract要約: 安全フィルタは、安全でない名目上のポリシーで制御システムの安全性を確保する効果的なツールであることが示されている。本稿では,ハミルトン-ヤコビ到達可能性に基づく安全フィルタと適応型共形推論を組み合わせた適応型コンフォーマルフィルタ(ACoFi)を提案する。また,ACoFiは,ユーザ定義パラメータによって,予測されたセキュリティ上の不確実性を不正確に定量化することを保証する。
参考スコア（独自算出の注目度）: 0.8921166277011348
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Safety filters have been shown to be effective tools to ensure the safety of control systems with unsafe nominal policies. To address scalability challenges in traditional synthesis methods, learning-based approaches have been proposed for designing safety filters for systems with high-dimensional state and control spaces. However, the inevitable errors in the decisions of these models raise concerns about their reliability and the safety guarantees they offer. This paper presents Adaptive Conformal Filtering (ACoFi), a method that combines learned Hamilton-Jacobi reachability-based safety filters with adaptive conformal inference. Under ACoFi, the filter dynamically adjusts its switching criteria based on the observed errors in its predictions of the safety of actions. The range of possible safety values of the nominal policy's output is used to quantify uncertainty in safety assessment. The filter switches from the nominal policy to the learned safe one when that range suggests it might be unsafe. We show that ACoFi guarantees that the rate of incorrectly quantifying uncertainty in the predicted safety of the nominal policy is asymptotically upper bounded by a user-defined parameter. This gives a soft safety guarantee rather than a hard safety guarantee. We evaluate ACoFi in a Dubins car simulation and a Safety Gymnasium environment, empirically demonstrating that it significantly outperforms the baseline method that uses a fixed switching threshold by achieving higher learned safety values and fewer safety violations, especially in out-of-distribution scenarios.
Abstract（参考訳）: 安全フィルタは、安全でない名目上のポリシーで制御システムの安全性を確保する効果的なツールであることが示されている。従来の合成手法におけるスケーラビリティの課題に対処するため,高次元状態と制御空間を持つシステムの安全フィルタ設計のための学習ベースアプローチが提案されている。しかしながら、これらのモデルの決定における必然的なエラーは、信頼性とそれらが提供する安全保証に関する懸念を引き起こす。本稿では,ハミルトン-ヤコビ到達可能性に基づく安全フィルタと適応型共形推論を組み合わせた適応型コンフォーマルフィルタ(ACoFi)を提案する。 ACoFiの下では、フィルタは動作の安全性の予測において観測された誤差に基づいて切換基準を動的に調整する。安全評価における不確実性を定量化するために、名目政策の出力の可能な安全性値の範囲が使用される。このフィルターは、名目上のポリシーから、その範囲が安全でないことを示唆する学習された安全ポリシーに切り替える。 ACoFiは, 利用者が定義したパラメータで漸近的に上界にあることを保証する。これにより、ハードセーフティの保証よりもソフトセーフティの保証が提供される。デュビンス車両シミュレーションおよび安全体育館環境におけるACoFiの評価を行い、特にアウト・オブ・ディストリビューションシナリオにおいて、高い安全性値と少ない安全性違反を達成し、固定切換閾値を使用するベースライン法を著しく上回っていることを実証的に証明した。

論文の概要: Safe Control using Learned Safety Filters and Adaptive Conformal Inference

関連論文リスト