Fugu-MT 論文翻訳(概要): Real-Time Auto-Optimization in Unknown Environments via Structure-Exploiting Dual Control for Exploration and Exploitation

論文の概要: Real-Time Auto-Optimization in Unknown Environments via Structure-Exploiting Dual Control for Exploration and Exploitation

arxiv url: http://arxiv.org/abs/2605.22431v1
Date: Thu, 21 May 2026 12:55:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 16:35:42.260502
Title: Real-Time Auto-Optimization in Unknown Environments via Structure-Exploiting Dual Control for Exploration and Exploitation
Title（参考訳）: 構造展開デュアル制御による未知環境のリアルタイム自動最適化
Authors: Shiying Dong, Haoyang Yang, Qiwei Liu, Wen-Hua Chen,
Abstract要約: 本稿では,探索と利用のための高速な数値二重制御法を開発した。未知の環境における自動最適化問題に対処する。約1桁のスピードアップを実現し、典型的な車両組み込みCPU上では、マイクロ秒レベルの最大時間は83秒に過ぎない。
参考スコア（独自算出の注目度）: 5.005890380363851
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: This paper develops a fast numerical dual control for exploration and exploitation (DCEE) method to address auto-optimization problems in unknown environments. In auto-optimization problems, the optimal operating condition is unknown a priori and may vary with the environment. As in classical dual control techniques, computational burden remains a major concern in DCEE for active learning. Existing DCEE methods provide a principled exploration-exploitation objective, but mainly realized through standard optimization packages or explicit gradient-type update laws, where the numerical structure of the DCEE has not been fully exploited. This paper shows that the reward function in DCEE has an inherent convex-over-nonlinear structure, where the exploitation and exploration terms form a unified nonlinear residual map equipped with a convex outer loss. Benefiting from this structure, a structure-exploiting numerical method is developed by linearizing only the nonlinear residual map while preserving the convex outer loss. Thus, each subproblem is transformed into a structured convex form that can be solved reliably. The resulting generalized Gauss-Newton Hessian approximation is positive semidefinite and depends only on first-order derivatives, thereby supporting fast online computation. The proposed method is evaluated on a vehicle cruising auto-optimization problem and compared with existing methods. Simulation and hardware-in-the-loop experimental results show that the proposed method improves control performance and achieves a speedup of approximately one order of magnitude, with a microsecond-level maximum computation time of only 83 μs on a typical vehicle embedded CPU.
Abstract（参考訳）: 本稿では,未知環境における自動最適化問題に対処するために,高速な数値二重制御法(DCEE)を開発した。自動最適化問題では、最適動作条件は事前に不明であり、環境によって異なる可能性がある。古典的二重制御技術と同様に、DCEEのアクティブラーニングにおける計算負担は依然として大きな関心事である。既存のDCEE法は探索探索の目的を提供するが、主に標準最適化パッケージや、DCEEの数値構造が十分に活用されていない明示的な勾配型更新法によって実現されている。本稿では,DCEE の報酬関数が本質的に凸-非直線構造を持つことを示す。この構造から, 凸外損失を保ちながら, 非線形残留写像のみを線形化することにより, 構造露光数値法を開発した。したがって、各サブプロブレムは、確実に解ける構造化凸形式に変換される。一般化されたガウス・ニュートン・ヘッセン近似は正の半定値であり、一階微分にのみ依存し、高速オンライン計算をサポートする。提案手法は,車両の走行自動最適化問題に対して評価し,既存手法と比較した。シミュレーションとハードウェア・イン・ザ・ループ実験の結果、提案手法は制御性能を向上し、典型的な車両組込みCPU上では83μsのマイクロ秒レベルの最大計算時間で約1桁の高速化を実現することが示された。

論文の概要: Real-Time Auto-Optimization in Unknown Environments via Structure-Exploiting Dual Control for Exploration and Exploitation

関連論文リスト