Cost-aware Generalized $\alpha$-investing for Multiple Hypothesis
Testing
- URL: http://arxiv.org/abs/2210.17514v3
- Date: Fri, 3 Nov 2023 15:55:56 GMT
- Title: Cost-aware Generalized $\alpha$-investing for Multiple Hypothesis
Testing
- Authors: Thomas Cook and Harsh Vardhan Dubey and Ji Ah Lee and Guangyu Zhu and
Tingting Zhao and Patrick Flaherty
- Abstract summary: We consider the problem of sequential multiple hypothesis testing with nontrivial data collection costs.
This problem appears when conducting biological experiments to identify differentially expressed genes of a disease process.
We make a theoretical analysis of the long term behavior of $alpha$-wealth which motivates a consideration of sample size in the $alpha$-investing decision rule.
- Score: 5.521213530218833
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We consider the problem of sequential multiple hypothesis testing with
nontrivial data collection costs. This problem appears, for example, when
conducting biological experiments to identify differentially expressed genes of
a disease process. This work builds on the generalized $\alpha$-investing
framework which enables control of the false discovery rate in a sequential
testing setting. We make a theoretical analysis of the long term asymptotic
behavior of $\alpha$-wealth which motivates a consideration of sample size in
the $\alpha$-investing decision rule. Posing the testing process as a game with
nature, we construct a decision rule that optimizes the expected
$\alpha$-wealth reward (ERO) and provides an optimal sample size for each test.
Empirical results show that a cost-aware ERO decision rule correctly rejects
more false null hypotheses than other methods for $n=1$ where $n$ is the sample
size. When the sample size is not fixed cost-aware ERO uses a prior on the null
hypothesis to adaptively allocate of the sample budget to each test. We extend
cost-aware ERO investing to finite-horizon testing which enables the decision
rule to allocate samples in a non-myopic manner. Finally, empirical tests on
real data sets from biological experiments show that cost-aware ERO balances
the allocation of samples to an individual test against the allocation of
samples across multiple tests.
Related papers
- Robust Non-adaptive Group Testing under Errors in Group Membership Specifications [3.554868356768806]
Group testing (GT) aims to determine defect status by performing tests on $n p$ groups', where a group is formed by mixing a subset of the $p$ samples.
Most existing methods, however, assume that the group memberships are accurately specified.
We develop a new GT method, the Debiased Robust Lasso Test Method (DRLT), that handles such group membership specification errors.
arXiv Detail & Related papers (2024-09-09T06:03:23Z) - Doubly Robust Conditional Independence Testing with Generative Neural Networks [8.323172773256449]
This article addresses the problem of testing the conditional independence of two generic random vectors $X$ and $Y$ given a third random vector $Z$.
We propose a new non-parametric testing procedure that avoids explicitly estimating any conditional distributions.
arXiv Detail & Related papers (2024-07-25T01:28:59Z) - Collaborative non-parametric two-sample testing [55.98760097296213]
The goal is to identify nodes where the null hypothesis $p_v = q_v$ should be rejected.
We propose the non-parametric collaborative two-sample testing (CTST) framework that efficiently leverages the graph structure.
Our methodology integrates elements from f-divergence estimation, Kernel Methods, and Multitask Learning.
arXiv Detail & Related papers (2024-02-08T14:43:56Z) - Active Sequential Two-Sample Testing [18.99517340397671]
We consider the two-sample testing problem in a new scenario where sample measurements are inexpensive to access.
We devise the first emphactiveNIST-sample testing framework that not only sequentially but also emphactively queries.
In practice, we introduce an instantiation of our framework and evaluate it using several experiments.
arXiv Detail & Related papers (2023-01-30T02:23:49Z) - Sequential Kernelized Independence Testing [101.22966794822084]
We design sequential kernelized independence tests inspired by kernelized dependence measures.
We demonstrate the power of our approaches on both simulated and real data.
arXiv Detail & Related papers (2022-12-14T18:08:42Z) - Boost Test-Time Performance with Closed-Loop Inference [85.43516360332646]
We propose to predict hard-classified test samples in a looped manner to boost the model performance.
We first devise a filtering criterion to identify those hard-classified test samples that need additional inference loops.
For each hard sample, we construct an additional auxiliary learning task based on its original top-$K$ predictions to calibrate the model.
arXiv Detail & Related papers (2022-03-21T10:20:21Z) - The Sample Complexity of Robust Covariance Testing [56.98280399449707]
We are given i.i.d. samples from a distribution of the form $Z = (1-epsilon) X + epsilon B$, where $X$ is a zero-mean and unknown covariance Gaussian $mathcalN(0, Sigma)$.
In the absence of contamination, prior work gave a simple tester for this hypothesis testing task that uses $O(d)$ samples.
We prove a sample complexity lower bound of $Omega(d2)$ for $epsilon$ an arbitrarily small constant and $gamma
arXiv Detail & Related papers (2020-12-31T18:24:41Z) - Optimal Testing of Discrete Distributions with High Probability [49.19942805582874]
We study the problem of testing discrete distributions with a focus on the high probability regime.
We provide the first algorithms for closeness and independence testing that are sample-optimal, within constant factors.
arXiv Detail & Related papers (2020-09-14T16:09:17Z) - Noisy Adaptive Group Testing using Bayesian Sequential Experimental
Design [63.48989885374238]
When the infection prevalence of a disease is low, Dorfman showed 80 years ago that testing groups of people can prove more efficient than testing people individually.
Our goal in this paper is to propose new group testing algorithms that can operate in a noisy setting.
arXiv Detail & Related papers (2020-04-26T23:41:33Z) - Nonparametric Inference under B-bits Quantization [5.958064620718292]
We propose a nonparametric testing procedure based on samples quantized to $B$ bits.
In particular, we show that if $B$ exceeds a certain threshold, the proposed nonparametric testing procedure achieves the classical minimax rate of testing.
arXiv Detail & Related papers (2019-01-24T18:43:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.