QuTE: decentralized multiple testing on sensor networks with false
discovery rate control
- URL: http://arxiv.org/abs/2210.04334v1
- Date: Sun, 9 Oct 2022 19:48:39 GMT
- Title: QuTE: decentralized multiple testing on sensor networks with false
discovery rate control
- Authors: Aaditya Ramdas and Jianbo Chen and Martin J. Wainwright and Michael I.
Jordan
- Abstract summary: This paper designs methods for decentralized multiple hypothesis testing on graphs equipped with provable guarantees on the false discovery rate (FDR)
We consider the setting where distinct agents reside on the nodes of an undirected graph, and each agent possesses p-values corresponding to one or more hypotheses local to its node.
Each agent must individually decide whether to reject one or more of its local hypotheses by only communicating with its neighbors, with the joint aim that the global FDR over the entire graph must be controlled at a predefined level.
- Score: 130.7122910646076
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper designs methods for decentralized multiple hypothesis testing on
graphs that are equipped with provable guarantees on the false discovery rate
(FDR). We consider the setting where distinct agents reside on the nodes of an
undirected graph, and each agent possesses p-values corresponding to one or
more hypotheses local to its node. Each agent must individually decide whether
to reject one or more of its local hypotheses by only communicating with its
neighbors, with the joint aim that the global FDR over the entire graph must be
controlled at a predefined level. We propose a simple decentralized family of
Query-Test-Exchange (QuTE) algorithms and prove that they can control FDR under
independence or positive dependence of the p-values. Our algorithm reduces to
the Benjamini-Hochberg (BH) algorithm when after graph-diameter rounds of
communication, and to the Bonferroni procedure when no communication has
occurred or the graph is empty. To avoid communicating real-valued p-values, we
develop a quantized BH procedure, and extend it to a quantized QuTE procedure.
QuTE works seamlessly in streaming data settings, where anytime-valid p-values
may be continually updated at each node. Last, QuTE is robust to arbitrary
dropping of packets, or a graph that changes at every step, making it
particularly suitable to mobile sensor networks involving drones or other
multi-agent systems. We study the power of our procedure using a simulation
suite of different levels of connectivity and communication on a variety of
graph structures, and also provide an illustrative real-world example.
Related papers
- The Heterophilic Snowflake Hypothesis: Training and Empowering GNNs for Heterophilic Graphs [59.03660013787925]
We introduce the Heterophily Snowflake Hypothesis and provide an effective solution to guide and facilitate research on heterophilic graphs.
Our observations show that our framework acts as a versatile operator for diverse tasks.
It can be integrated into various GNN frameworks, boosting performance in-depth and offering an explainable approach to choosing the optimal network depth.
arXiv Detail & Related papers (2024-06-18T12:16:00Z) - Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs [49.71319907864573]
In this paper, we propose multi-agent skill discovery which enables the ease of decomposition.
Our key idea is to approximate the joint state space as a Kronecker graph, based on which we can directly estimate its Fiedler vector.
Considering that directly computing the Laplacian spectrum is intractable for tasks with infinite-scale state spaces, we further propose a deep learning extension of our method.
arXiv Detail & Related papers (2023-07-21T14:53:12Z) - Online Heavy-tailed Change-point detection [6.7643284029102295]
We present an algorithm based on clipped Gradient Descent (SGD), that works even if we only assume that the second moment of the data generating process is bounded.
We derive guarantees on worst-case, finite-sample false-positive rate (FPR) over the family of all distributions with bounded second moment.
Our method is the first OCPD algorithm that guarantees finite-sample FPR, even if the data is high dimensional and the underlying distributions are heavy-tailed.
arXiv Detail & Related papers (2023-06-15T23:39:05Z) - Online Centralized Non-parametric Change-point Detection via Graph-based
Likelihood-ratio Estimation [77.81487285123147]
Consider each node of a graph to be generating a data stream that is synchronized and observed at near real-time.
At a change-point $tau$, a change occurs at a subset of nodes $C$, which affects the probability distribution of their associated node streams.
We propose a novel kernel-based method to both detect $tau$ and localize $C$, based on the direct estimation of the likelihood-ratio between the post-change and the pre-change distributions of the node streams.
arXiv Detail & Related papers (2023-01-08T10:15:24Z) - Sample-and-Forward: Communication-Efficient Control of the False
Discovery Rate in Networks [19.786769414376323]
This work concerns controlling the false discovery rate (FDR) in networks under communication constraints.
We present sample-and-forward, a flexible and communication-efficient version of the Benjamini-Hochberg (BH) procedure for multihop networks with general topologies.
arXiv Detail & Related papers (2022-10-05T20:54:32Z) - Walk for Learning: A Random Walk Approach for Federated Learning from
Heterogeneous Data [17.978941229970886]
We focus on Federated Learning (FL) as a canonical application.
One of the main challenges of FL is the communication bottleneck between the nodes and the parameter server.
We present an adaptive random walk learning algorithm.
arXiv Detail & Related papers (2022-06-01T19:53:24Z) - Optimal Clustering with Bandit Feedback [57.672609011609886]
This paper considers the problem of online clustering with bandit feedback.
It includes a novel stopping rule for sequential testing that circumvents the need to solve any NP-hard weighted clustering problem as its subroutines.
We show through extensive simulations on synthetic and real-world datasets that BOC's performance matches the lower boundally, and significantly outperforms a non-adaptive baseline algorithm.
arXiv Detail & Related papers (2022-02-09T06:05:05Z) - Coded Stochastic ADMM for Decentralized Consensus Optimization with Edge
Computing [113.52575069030192]
Big data, including applications with high security requirements, are often collected and stored on multiple heterogeneous devices, such as mobile devices, drones and vehicles.
Due to the limitations of communication costs and security requirements, it is of paramount importance to extract information in a decentralized manner instead of aggregating data to a fusion center.
We consider the problem of learning model parameters in a multi-agent system with data locally processed via distributed edge nodes.
A class of mini-batch alternating direction method of multipliers (ADMM) algorithms is explored to develop the distributed learning model.
arXiv Detail & Related papers (2020-10-02T10:41:59Z) - PushNet: Efficient and Adaptive Neural Message Passing [1.9121961872220468]
Message passing neural networks have recently evolved into a state-of-the-art approach to representation learning on graphs.
Existing methods perform synchronous message passing along all edges in multiple subsequent rounds.
We consider a novel asynchronous message passing approach where information is pushed only along the most relevant edges until convergence.
arXiv Detail & Related papers (2020-03-04T18:15:30Z) - Adaptive Propagation Graph Convolutional Network [17.41698818541144]
Graph convolutional networks (GCNs) are a family of neural network models that perform inference on graph data.
We show that state-of-the-art results can be achieved by adapting the number of communication steps independently at every node.
We show that the proposed adaptive propagation GCN (AP-GCN) achieves superior or similar results to the best proposed models.
arXiv Detail & Related papers (2020-02-24T15:31:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.