Fugu-MT 論文翻訳(概要): Support sufficiency as action-sufficient compression: a single-cycle rate-regret formulation

論文の概要: Support sufficiency as action-sufficient compression: a single-cycle rate-regret formulation

arxiv url: http://arxiv.org/abs/2606.09858v1
Date: Thu, 28 May 2026 14:59:41 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-15 07:09:36.814966
Title: Support sufficiency as action-sufficient compression: a single-cycle rate-regret formulation
Title（参考訳）: アクション充足圧縮としての支持充足性:単一サイクル速度-回帰定式化
Authors: Mark Walsh,
Abstract要約: 本稿では,アクション・サフィシエント・圧縮としてサポート・サフィシエントを定式化する。固定$Z$の場合、厳密なアクション十分圧縮はポリシー等価性によるサポート空間の商である。行動の正確性は、再構築の忠実さ、情報・ブートネック予測、合理的な意図と区別される。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Robust decision-making requires compression. A system that forms a rich support state cannot usually preserve its full structure at the point of action. It must retain only those distinctions needed to act, verify, abstain, or defer under the current consequence geometry. This paper formalizes support sufficiency as action-sufficient compression. Let $H$ denote a full support state, $\mathcal{A}$ a finite action set, and $Z$ a consequence geometry specifying payoff structure. For fixed $Z$, the coarsest exactly action-sufficient compression is the quotient of support space by policy equivalence. Two support states may be merged exactly when they require the same optimal action. This clarifies why content-only and scalar-confidence-only arbitration fail whenever their induced partitions cross action boundaries. Approximate sufficiency is then defined by bounded expected policy regret. In the finite single-cycle setting, this yields a rate-regret problem with source $H$, reproduction alphabet $\mathcal{A}$, and distortion given by consequence-sensitive regret. The optimal stochastic action channel inherits the standard rate-distortion Gibbs form, applied here to support states with regret distortion. The contribution is interpretive: action adequacy is distinguished from reconstruction fidelity, information-bottleneck prediction, and rational inattention. Robust single-cycle arbitration does not require preserving all support, but it does require preserving the distinctions that consequence geometry makes action-relevant.
Abstract（参考訳）: ロバストな意思決定には圧縮が必要です。豊かな支持状態を形成するシステムは、通常行動の時点でその完全な構造を維持できない。それは、現在の結果幾何学の下で行動し、検証し、棄却し、延期するために必要な区別のみを保持する必要がある。本稿では,アクション・サフィシエント・圧縮としてサポート・サフィシエントを定式化する。 H$ はフルサポート状態、$\mathcal{A} は有限アクション集合、$Z$ はペイオフ構造を指定する結果幾何学を表す。固定$Z$の場合、厳密なアクション十分圧縮はポリシー等価性によるサポート空間の商である。 2つのサポート状態は、同じ最適なアクションを必要とするときに正確にマージされる。これは、引き起こされたパーティションがアクション境界を越えると、コンテントのみとスカラーのみの仲裁が失敗する理由を明確にする。近似の十分性は、制限された期待されたポリシーの後悔によって定義される。有限個の単サイクル設定では、これはソース$H$、再生アルファベット$\mathcal{A}$、結果に敏感な後悔によって与えられる歪みによるレート-回帰問題をもたらす。最適確率的作用チャネルは、標準速度歪曲ギブズ形式を継承し、ここでは、後悔した歪みのある状態をサポートするために適用される。その貢献は解釈的であり、行動の妥当性は再構築の忠実さ、情報・ブートネック予測、合理的な意図と区別される。ロバストな単一サイクル仲裁はすべてのサポートを保存する必要はないが、結果として幾何学がアクション関連性をもたらす区別を保存する必要がある。

論文の概要: Support sufficiency as action-sufficient compression: a single-cycle rate-regret formulation

関連論文リスト