Fugu-MT 論文翻訳(概要): ASACK : Adaptive Safe Active Continual Koopman Learning for Uncertain Systems with Contractive Guarantees

論文の概要: ASACK : Adaptive Safe Active Continual Koopman Learning for Uncertain Systems with Contractive Guarantees

arxiv url: http://arxiv.org/abs/2605.09659v1
Date: Sun, 10 May 2026 17:11:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:50.35499
Title: ASACK : Adaptive Safe Active Continual Koopman Learning for Uncertain Systems with Contractive Guarantees
Title（参考訳）: ASACK : 契約保証付き不確かさシステムのための適応型安全なアクティブ・アクティブ・コンティニュアル・クープマン学習
Authors: Chandan Kumar Sah, Rajpal Singh, Jishnu Keshavan,
Abstract要約: クープマン作用素論 (Koopman operator theory) は、持ち上げ可観測物に作用する線型作用素を通して非線形力学を表現するための枠組みを提供する。本稿では,学習モデルの安全かつ効率的なオンライン改善を可能にする,継続的適応型クープマン学習のための統合フレームワークを提案する。提案したアプローチは、リアルタイム効率を犠牲にすることなく、コントロールフレームワーク内の学習、デプロイメント、安全性を統一する。
参考スコア（独自算出の注目度）: 0.38887448816036313
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Koopman operator theory provides a powerful framework for representing nonlinear dynamics through a linear operator acting on lifted observables, enabling the use of linear control techniques for nonlinear systems. However, Koopman models are typically learned from data and often degrade in performance under model uncertainty and distributional shifts between training and deployment. Although several works have explored online adaptation to address this issue, many rely on neural network-based updates that introduce significant computational overhead and lack formal safety guarantees, limiting their suitability for real-time and safety-critical robotic applications. In this work, we propose a unified framework for continual adaptive Koopman learning that enables safe and efficient online refinement of learned models during task execution. An autoencoder-based Koopman model is first learned offline and subsequently refined online through a contractive adaptation law, which provides theoretical convergence guarantees under distributional shifts and model uncertainty. To improve data efficiency and accelerate model refinement, the adaptation mechanism is integrated with an active learning strategy that drives the system to collect informative data while accomplishing task objectives. The resulting control problem is formulated as a nonconvex optimization problem incorporating both active learning objectives and safety constraints. We further derive theoretical bounds on model approximation error and show how these bounds can be incorporated within a robust Model Predictive Control (MPC) framework to provide formal safety guarantees. The proposed approach unifies learning, excitation, and safety within a single control framework without sacrificing real-time feasibility. Extensive simulation and experimental studies demonstrate superior performance compared to state-of-the-art baselines.
Abstract（参考訳）: クープマン作用素理論は、持ち上げ可観測物に作用する線形作用素を通して非線形力学を表現するための強力なフレームワークを提供し、非線形系に線形制御技術を利用することができる。しかしながら、Koopmanモデルは一般的にデータから学習され、モデルの不確実性とトレーニングとデプロイメントの間の分散シフトの下で、しばしばパフォーマンスが低下する。この問題に対処するためにオンライン適応を探索する研究はいくつかあるが、多くはニューラルネットワークベースの更新に依存しており、計算オーバーヘッドが大きく、正式な安全性保証が欠如しており、リアルタイムおよび安全クリティカルなロボットアプリケーションに対する適合性が制限されている。本研究では,タスク実行中の学習モデルの安全かつ効率的なオンライン改善を可能にする,継続的適応型クープマン学習のための統合フレームワークを提案する。オートエンコーダをベースとしたクープマンモデルは、まずオフラインで学習され、その後、決定的適応法によりオンラインで洗練され、分散シフトの下で理論的収束保証とモデルの不確実性を提供する。データ効率の向上とモデル改良の促進を目的として,タスク目標達成時に情報収集を行う能動的学習戦略に適応機構を組み込む。得られた制御問題は、アクティブな学習目標と安全制約の両方を取り入れた非凸最適化問題として定式化される。さらに、モデル近似誤差に関する理論的境界を導出し、これらの境界をモデル予測制御(MPC)フレームワークに組み込んで、正式な安全性を保証する方法を示す。提案手法は、リアルタイムの実現性を犠牲にすることなく、単一の制御フレームワーク内での学習、励起、安全性を統一する。大規模なシミュレーションと実験的研究は、最先端のベースラインよりも優れた性能を示した。

論文の概要: ASACK : Adaptive Safe Active Continual Koopman Learning for Uncertain Systems with Contractive Guarantees

関連論文リスト