Fugu-MT 論文翻訳(概要): Fine-Tuning of Neural Network Approximate MPC without Retraining via Bayesian Optimization

論文の概要: Fine-Tuning of Neural Network Approximate MPC without Retraining via Bayesian Optimization

arxiv url: http://arxiv.org/abs/2512.14350v3
Date: Fri, 16 Jan 2026 14:52:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 08:17:40.377545
Title: Fine-Tuning of Neural Network Approximate MPC without Retraining via Bayesian Optimization
Title（参考訳）: ベイズ最適化によるリトレーニングのないニューラルネットワーク近似MPCの微調整
Authors: Henrik Hose, Paul Brunzema, Alexander von Rohr, Alexander Gräfe, Angela P. Schoellig, Sebastian Trimpe,
Abstract要約: 近似モデル予測制御(AMPC)は、MPCの動作をニューラルネットワークで模倣することを目的としている。デプロイメント中、基盤となるMPCのパラメータは通常、微調整されなければならない。最近の研究は、MPC最適化問題の近似感度を用いてAMPCを調整することなく適用することでこの問題に対処している。
参考スコア（独自算出の注目度）: 81.35990332700389
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Approximate model-predictive control (AMPC) aims to imitate an MPC's behavior with a neural network, removing the need to solve an expensive optimization problem at runtime. However, during deployment, the parameters of the underlying MPC must usually be fine-tuned. This often renders AMPC impractical as it requires repeatedly generating a new dataset and retraining the neural network. Recent work addresses this problem by adapting AMPC without retraining using approximated sensitivities of the MPC's optimization problem. Currently, this adaption must be done by hand, which is labor-intensive and can be unintuitive for high-dimensional systems. To solve this issue, we propose using Bayesian optimization to tune the parameters of AMPC policies based on experimental data. By combining model-based control with direct and local learning, our approach achieves superior performance to nominal AMPC on hardware, with minimal experimentation. This allows automatic and data-efficient adaptation of AMPC to new system instances and fine-tuning to cost functions that are difficult to directly implement in MPC. We demonstrate the proposed method in hardware experiments for the swing-up maneuver on an inverted cartpole and yaw control of an under-actuated balancing unicycle robot, a challenging control problem.
Abstract（参考訳）: 近似モデル予測制御(AMPC)は、MPCの動作をニューラルネットワークで模倣することを目的としており、実行時に高価な最適化問題を解決する必要がなくなる。しかしながら、配備中は、基礎となるMPCのパラメータは、通常、微調整される必要がある。これは、新しいデータセットを繰り返し生成し、ニューラルネットワークを再トレーニングする必要があるため、AMPCが実用的でないことが多い。最近の研究は、MPC最適化問題の近似感度を用いてAMPCを調整することなく適用することでこの問題に対処している。現在、この適応は手作業で行わなければならないが、これは労働集約的であり、高次元システムでは直感的である。そこで本研究では,実験データに基づくAMPCポリシーのパラメータの調整にベイズ最適化を用いることを提案する。モデルベース制御と直接学習,局所学習を組み合わせることで,ハードウェア上でのAMPCよりも優れた性能と最小限の実験を実現する。これにより、AMPCを新しいシステムインスタンスに自動かつデータ効率で適応させ、MPCで直接実装するのが難しいコスト関数を微調整できる。本手法は, 逆転型カートポールの揺動動作と, 不安定なバランスの取れた一輪車ロボットのヨー制御のハードウェア実験において, 困難な制御問題であることを示す。

論文の概要: Fine-Tuning of Neural Network Approximate MPC without Retraining via Bayesian Optimization

関連論文リスト