Fugu-MT 論文翻訳(概要): Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

論文の概要: Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

arxiv url: http://arxiv.org/abs/2604.02260v1
Date: Thu, 02 Apr 2026 16:52:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-03 14:21:10.92959
Title: Model-Based Reinforcement Learning for Control under Time-Varying Dynamics
Title（参考訳）: 時間変化ダイナミクスを考慮したモデルベース強化学習
Authors: Klemens Iten, Bruce Lee, Chenhao Li, Lenart Treven, Andreas Krause, Bhavya Sukhija,
Abstract要約: 時間変動力学下での制御のための強化学習について検討する。適応型データバッファ機構を用いた実用的な楽観的モデルベース強化学習アルゴリズムを提案する。
参考スコア（独自算出の注目度）: 40.96573079787976
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Learning-based control methods typically assume stationary system dynamics, an assumption often violated in real-world systems due to drift, wear, or changing operating conditions. We study reinforcement learning for control under time-varying dynamics. We consider a continual model-based reinforcement learning setting in which an agent repeatedly learns and controls a dynamical system whose transition dynamics evolve across episodes. We analyze the problem using Gaussian process dynamics models under frequentist variation-budget assumptions. Our analysis shows that persistent non-stationarity requires explicitly limiting the influence of outdated data to maintain calibrated uncertainty and meaningful dynamic regret guarantees. Motivated by these insights, we propose a practical optimistic model-based reinforcement learning algorithm with adaptive data buffer mechanisms and demonstrate improved performance on continuous control benchmarks with non-stationary dynamics.
Abstract（参考訳）: 学習に基づく制御法は通常、静止系の力学を仮定するが、これは現実のシステムではドリフト、摩耗、動作条件の変化によってしばしば違反される仮定である。時間変動力学下での制御のための強化学習について検討する。本稿では,エージェントが繰り返し学習し,遷移ダイナミクスがエピソード間で進化する動的システムを制御する連続モデルに基づく強化学習環境について考察する。頻繁な変動予算仮定の下でガウス過程力学モデルを用いて問題を解析する。解析の結果, 持続的非定常性は, 校正された不確かさと意味のある動的後悔の保証を維持するために, 時代遅れのデータの影響を明示的に制限する必要があることがわかった。これらの知見により,適応型データバッファ機構を持つ実用的な楽観的モデルベース強化学習アルゴリズムを提案し,非定常力学を用いた連続制御ベンチマークの性能向上を示す。

論文の概要: Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

関連論文リスト