Fugu-MT 論文翻訳(概要): Estimation of the sub-Gaussian parameter

論文の概要: Estimation of the sub-Gaussian parameter

arxiv url: http://arxiv.org/abs/2606.06384v1
Date: Thu, 04 Jun 2026 16:48:31 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-05 22:39:44.973936
Title: Estimation of the sub-Gaussian parameter
Title（参考訳）: ガウス下パラメータの推定
Authors: Jason Liu, Min Xu, Jinchuan Xing,
Abstract要約: 準ガウス確率変数 $X$ は 2_* = s_upin mathbbR L()$ ここで $L() = frac22 log mathbbE eX$ は重み付き累積生成関数である。ガウス以下の確率変数が多用されているにもかかわらず、$2_*$の推定はほとんど注目されず、まだよく理解されていない。
参考スコア（独自算出の注目度）: 7.4076878426925035
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The sub-Gaussian parameter (also called the variance proxy) of a mean-zero random variable $X$ is defined as $ξ^2_* = \sup_{λ\in \mathbb{R}} L(λ)$ where $L(λ) = \frac{2}{λ^2} \log \mathbb{E} e^{λX}$ is a weighted cumulant generating function. Despite the ubiquity of sub-Gaussian random variables, the estimation of $ξ^2_*$ has received little attention and is not yet well understood. In this work, we study a natural estimator of $ξ^2_*$ based on constrained maximization of the empirical analogue of $L$. We prove that the estimator is consistent bound the rates of convergence under assumptions on $L$: if $L$ has an maximizer, then our bound is $O_p(n^{-1/2 + \varepsilon})$ for any $\varepsilon > 0$; if the argmax of $L$ is also bounded, then the bound improves to $O_p(n^{-1/2})$. We show that our assumptions on $L$ are necessary by proving that the minimax risk over all sub-Gaussian distributions is $Ω(1)$; imposing increasingly strong assumptions on the tail growth of $L$ yields a continuum of classes whose minimax lower bound interpolates between $Ω(1/\log n)$ and $Ω(1)$. Root-n rate is possible if we restrict to a subclass of distributions where $L$ attains its supremum in a bounded region, in which case our estimator is minimax optimal. If the underlying distribution is not sub-Gaussian, we show that our estimator goes to infinity with a divergence rate controlled by the tail of the distribution. Finally, we apply our estimator in a Gene Ontology (GO) enrichment study to construct p-values for a large-scale permutation test, showing that it can serve as a reliable alternative to the peaks-over-threshold approach, particularly in regimes where the peaks-over-threshold method is of uncertain validity.
Abstract（参考訳）: 平均ゼロ確率変数 $X$ の亜ガウスパラメータ(分散プロキシとも呼ばれる)は、$L(λ) = \frac{2}{λ^2} \log \mathbb{E} e^{λX}$ が重み付き累積生成関数であるとき、$ ^2_* = \sup_{λ\in \mathbb{R}} L(λ)$ と定義される。ガウス以下の確率変数のユビキティにもかかわらず、$ ^2_*$ の推定はほとんど注目されず、まだよく理解されていない。本研究では、$L$の経験的アナログの制約付き最大化に基づいて、$ ^2_*$の自然推定器について検討する。 L$ が最大値を持つなら、我々の有界は $O_p(n^{-1/2 + \varepsilon})$ であり、任意の $\varepsilon > 0$ に対して、$L$ の argmax が有界であれば、その有界は $O_p(n^{-1/2})$ に改善される。 L$ に対する我々の仮定は、すべてのガウス分布に対するミニマックスリスクが $Ω(1)$ であることを証明することによって必要であることが示される;$L$ の尾成長に対する強い仮定は、小マックスが $Ω(1/\log n)$ と $Ω(1)$ の間を補間するクラス全体の連続体を生じる。ルート-n レートは、L$ が有界領域で上限に達するような分布のサブクラスに制限される場合、その場合、推定器は極小極小である。基底分布が亜ガウス分布でない場合、分布の尾によって制御される発散速度で、我々の推定器が無限大に進むことを示す。最後に,遺伝子オントロジー(GO)エンリッチメント(enrichment study)を用いて,大規模置換試験におけるp値の構築を行い,特にピーク・オーバー・スレッショルド法が不確実な状況において,ピーク・オーバー・スレッショルド法に代わる信頼性の高い代替手段として機能することを示す。

論文の概要: Estimation of the sub-Gaussian parameter

関連論文リスト