Fugu-MT 論文翻訳(概要): Betting for Sim-to-Real Performance Evaluation

論文の概要: Betting for Sim-to-Real Performance Evaluation

arxiv url: http://arxiv.org/abs/2604.24018v1
Date: Mon, 27 Apr 2026 03:58:50 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:07.729952
Title: Betting for Sim-to-Real Performance Evaluation
Title（参考訳）: Sim-to-Realパフォーマンス評価のための賭け
Authors: Zaid Mahboob, Yujia Chen, Bowen Weng,
Abstract要約: 我々は、ベッティング機構が正確かつ効率的に推定できる理論条件を開発する。これらの近似ベッティング戦略が意図通りに機能している場合に診断する具体的な決定ルールを提供する。また,ロボットマニピュレータの実際のピック・アンド・プレイス精度を推定するために,合成分布群を用いた実例を示した。
参考スコア（独自算出の注目度）: 5.669264620577287
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: This paper studies the problem of robot performance evaluation, focusing on how to obtain accurate and efficient estimates of real-world behavior under severe constraints on physical experimentation. Such estimates are essential for benchmarking algorithms, comparing design alternatives, validating controllers, and supporting certification or regulatory decision-making, yet real-world testing with physical robots is often expensive, time-consuming, and safety-limited. To mitigate the scarcity of real-world trials, sim-to-real methodologies are commonly employed, using low-cost simulators to inform, supplement, or prioritize physical experiments. Departing from (and complementary to) existing approaches in variance reduction (e.g., importance-sampling variants) or bias-correction (e.g., through prediction-powered inference or learned control variates), we examine this performance-evaluation problem through the lens of betting. We establish theoretical conditions under which a betting mechanism can yield accurate and efficient estimates (provably outperforming the Monte Carlo estimator) and we characterize how such bets should be constructed. We further develop theoretically grounded yet practically implementable approximations of the ideal bet, and we provide concrete decision rules that diagnose when these approximate betting strategies are working as intended. We demonstrate the effectiveness of the proposed methods using both synthetic examples and cross-fidelity computational simulators. Notably, we also showcase an illustrative case in which a group of synthetic distributions are used to infer the real-world pick-and-place accuracy of a robotic manipulator, a seemingly unconventional sim-to-real transfer that becomes natural and feasible under the proposed betting perspective. Programs for reproducing empirical results are available at https://github.com/ISUSAIL/Bet4Sim2Real.
Abstract（参考訳）: 本稿では,身体実験の厳しい制約下での実世界の行動の正確かつ効率的な推定方法に着目し,ロボットの性能評価の課題について考察する。このような推定は、アルゴリズムのベンチマーク、設計代替品の比較、コントローラの検証、認証や規制決定のサポートなどには不可欠だが、物理ロボットによる現実的なテストは高価で時間を要すること、安全性に制限があることが多い。現実世界の試行の欠如を軽減するため、シム・ツー・リアルの方法論が一般的に用いられ、低コストのシミュレータを使って物理的な実験を知らせ、補足し、優先順位付けする。分散の低減(例えば重要サンプリングの変種)やバイアス補正(例えば、予測駆動推論や学習された制御変数)の既存のアプローチから分離し、ベッティングのレンズを通してこの性能評価問題を考察する。我々は、賭け機構が正確かつ効率的な見積もり(モンテカルロ推定器より優れている)を得られる理論条件を確立し、そのような賭けをどのように構築すべきかを特徴づける。我々はさらに、理論上は基礎を成すが、実際は理想的賭けの近似を実装可能とし、これらの近似的賭け戦略が意図通りに機能している場合に診断する具体的な決定ルールを提供する。提案手法の有効性を,合成例とクロスフィデリティ計算シミュレータを用いて実証する。また,ロボットマニピュレータの実際のピック・アンド・プレイス精度を推定するために,合成分布群を用いた図示的事例も紹介する。実験結果を再現するプログラムはhttps://github.com/ISUSAIL/Bet4Sim2Real.comで公開されている。

論文の概要: Betting for Sim-to-Real Performance Evaluation

関連論文リスト