Fugu-MT 論文翻訳(概要): Beyond Fixed False Discovery Rates: Post-Hoc Conformal Selection with E-Variables

論文の概要: Beyond Fixed False Discovery Rates: Post-Hoc Conformal Selection with E-Variables

arxiv url: http://arxiv.org/abs/2604.11305v2
Date: Fri, 17 Apr 2026 11:33:47 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-20 13:38:49.282687
Title: Beyond Fixed False Discovery Rates: Post-Hoc Conformal Selection with E-Variables
Title（参考訳）: 固定された偽発見率を超える:E-変数によるホック後のコンフォーマル選択
Authors: Meiyi Zhu, Osvaldo Simeone,
Abstract要約: コンフォーマルセレクション(CS)は、キャリブレーションデータを使用して、未観測結果が所定の最小品質要件を満たす可能性のあるテスト入力を識別し、偽発見率(FDR)を制御している。既存の方法は、データを見る前にターゲットFDRレベルを固定し、ユーザが選択したテスト入力数とFDR間のバランスを、利用可能なデータに基づいてダウンストリームのニーズと制約に適合させないようにする。本稿では,データ駆動型偽発見率(FDP)の推定値と組み合わせて,候補選択集合の経路を生成するポストホックCS(PH-CS)を提案する。
参考スコア（独自算出の注目度）: 31.204606769083068
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Conformal selection (CS) uses calibration data to identify test inputs whose unobserved outcomes are likely to satisfy a pre-specified minimal quality requirement, while controlling the false discovery rate (FDR). Existing methods fix the target FDR level before observing data, which prevents the user from adapting the balance between number of selected test inputs and FDR to downstream needs and constraints based on the available data. For example, in genomics or neuroimaging, researchers often inspect the distribution of test statistics, and decide how aggressively to pursue candidates based on observed evidence strength and available follow-up resources. To address this limitation, we introduce {post-hoc CS} (PH-CS), which generates a path of candidate selection sets, each paired with a data-driven false discovery proportion (FDP) estimate. PH-CS lets the user select any operating point on this path by maximizing a user-specified utility, arbitrarily balancing selection size and FDR. Building on conformal e-variables and the e-Benjamini-Hochberg (e-BH) procedure, PH-CS is proved to provide a finite-sample post-hoc reliability guarantee whereby the ratio between estimated FDP level and true FDP is, on average, upper bounded by $1$, so that the average estimated FDP is, to first order, a valid upper bound on the true FDR. PH-CS is extended to control quality defined in terms of a general risk. Experiments on synthetic and real-world datasets demonstrate that, unlike CS, PH-CS can consistently satisfy user-imposed utility constraints while producing reliable FDP estimates and maintaining competitive FDR control.
Abstract（参考訳）: コンフォーマルセレクション(CS)は、キャリブレーションデータを使用して、未観測の結果が所定の最小品質要件を満たす可能性があるテスト入力を特定し、偽発見率(FDR)を制御している。既存の方法は、データを見る前にターゲットFDRレベルを固定し、ユーザが選択したテスト入力数とFDR間のバランスを、利用可能なデータに基づいてダウンストリームのニーズと制約に適合させないようにする。例えば、ゲノミクスやニューロイメージングでは、研究者はしばしばテスト統計の分布を調べ、観察された証拠の強さと利用可能なフォローアップリソースに基づいて、いかに積極的に候補者を追求するかを決定する。この制限に対処するために、候補選択セットの経路を生成する {post-hoc CS} (PH-CS) を導入し、それぞれがデータ駆動型偽発見率(FDP)の推定値と組み合わせる。 PH-CSは、ユーザが指定したユーティリティを最大化し、選択サイズとFDRを任意にバランスさせることで、この経路上の任意の操作ポイントを選択することができる。共形e-変数とe-Benjamini-Hochberg(e-BH)法に基づいて、PH-CSは、推定されたFDPレベルと真のFDPとの比率が平均1ドル上界となるような有限サンプルポストホック保証を提供することを証明した。 PH-CSは、一般的なリスクの観点から定義された品質を制御するように拡張される。合成および実世界のデータセットに関する実験は、CSとは異なり、PH-CSは、信頼性の高いFDP推定を生成し、競争力のあるFDR制御を維持しながら、ユーザが提供するユーティリティ制約を一貫して満たすことができることを示した。

論文の概要: Beyond Fixed False Discovery Rates: Post-Hoc Conformal Selection with E-Variables

関連論文リスト