Fugu-MT 論文翻訳(概要): Unified Precision-Guaranteed Stopping Rules for Contextual Learning

論文の概要: Unified Precision-Guaranteed Stopping Rules for Contextual Learning

arxiv url: http://arxiv.org/abs/2604.07913v1
Date: Thu, 09 Apr 2026 07:30:15 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-10 18:34:05.767766
Title: Unified Precision-Guaranteed Stopping Rules for Contextual Learning
Title（参考訳）: 文脈学習のための統一的精度保証型停止規則
Authors: Mingrui Ding, Qiuhong Zhao, Siyang Gao, Jing Dong,
Abstract要約: 文脈学習は、個人の特徴をデータ収集を通じて行動にマッピングする決定ポリシーを学習しようとする。本研究は,文脈的基準と政策価値基準の総合的基準の2つの精度基準の下で検討する。我々は、未知のサンプリング分散を伴う文脈学習のための統一的な停止規則を、非構造化と構造化の両方の線形設定で開発する。
参考スコア（独自算出の注目度）: 8.604741134620559
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Contextual learning seeks to learn a decision policy that maps an individual's characteristics to an action through data collection. In operations management, such data may come from various sources, and a central question is when data collection can stop while still guaranteeing that the learned policy is sufficiently accurate. We study this question under two precision criteria: a context-wise criterion and an aggregate policy-value criterion. We develop unified stopping rules for contextual learning with unknown sampling variances in both unstructured and structured linear settings. Our approach is based on generalized likelihood ratio (GLR) statistics for pairwise action comparisons. To calibrate the corresponding sequential boundaries, we derive new time-uniform deviation inequalities that directly control the self-normalized GLR evidence and thus avoid the conservativeness caused by decoupling mean and variance uncertainty. Under the Gaussian sampling model, we establish finite-sample precision guarantees for both criteria. Numerical experiments on synthetic instances and two case studies demonstrate that the proposed stopping rules achieve the target precision with substantially fewer samples than benchmark methods. The proposed framework provides a practical way to determine when enough information has been collected in personalized decision problems. It applies across multiple data-collection environments, including historical datasets, simulation models, and real systems, enabling practitioners to reduce unnecessary sampling while maintaining a desired level of decision quality.
Abstract（参考訳）: 文脈学習は、個人の特徴をデータ収集を通じて行動にマッピングする決定ポリシーを学習しようとする。運用管理においては、このようなデータはさまざまなソースから取得され、学習されたポリシーが十分に正確であることを保証しながら、データ収集がいつ停止するかが中心的な疑問である。本研究は,文脈的基準と政策価値基準の総合的基準の2つの精度基準の下で研究する。我々は、未知のサンプリング分散を伴う文脈学習のための統一的な停止規則を、非構造化と構造化の両方の線形設定で開発する。提案手法は, 対作用比較のための一般化可能性比(GLR)統計に基づく。対応する逐次境界をキャリブレーションするために, 自己正規化GLRエビデンスを直接制御し, 平均と不確かさの疎結合に起因する保守性を回避する, 新たな時間均一偏差不等式を導出する。ガウスサンプリングモデルでは、両方の基準に対して有限サンプル精度を保証する。合成事例に関する数値実験と2つのケーススタディにより,提案した停止規則が,ベンチマーク法よりもかなり少ないサンプルで目標精度を達成することを示した。提案フレームワークは、パーソナライズされた決定問題において、十分な情報が収集されたかどうかを判断する実用的な方法を提供する。これは、過去のデータセット、シミュレーションモデル、実際のシステムを含む複数のデータ収集環境に適用され、実践者が望ましい意思決定品質を維持しながら不要なサンプリングを減らすことができる。

論文の概要: Unified Precision-Guaranteed Stopping Rules for Contextual Learning

関連論文リスト