Meek Separators and Their Applications in Targeted Causal Discovery
- URL: http://arxiv.org/abs/2310.20075v1
- Date: Mon, 30 Oct 2023 23:15:27 GMT
- Title: Meek Separators and Their Applications in Targeted Causal Discovery
- Authors: Kirankumar Shiragur and Jiaqi Zhang and Caroline Uhler
- Abstract summary: We focus on two well-motivated problems: subset search and causal matching.
We present an efficient algorithm to find Meek separators that are of small sizes.
Our results provide the first known average-case provable guarantees for both problems.
- Score: 17.84347390320128
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Learning causal structures from interventional data is a fundamental problem
with broad applications across various fields. While many previous works have
focused on recovering the entire causal graph, in practice, there are scenarios
where learning only part of the causal graph suffices. This is called
$targeted$ causal discovery. In our work, we focus on two such well-motivated
problems: subset search and causal matching. We aim to minimize the number of
interventions in both cases.
Towards this, we introduce the $Meek~separator$, which is a subset of
vertices that, when intervened, decomposes the remaining unoriented edges into
smaller connected components. We then present an efficient algorithm to find
Meek separators that are of small sizes. Such a procedure is helpful in
designing various divide-and-conquer-based approaches. In particular, we
propose two randomized algorithms that achieve logarithmic approximation for
subset search and causal matching, respectively. Our results provide the first
known average-case provable guarantees for both problems. We believe that this
opens up possibilities to design near-optimal methods for many other targeted
causal structure learning problems arising from various applications.
Related papers
- Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search [9.01394829787271]
We study the problem of routing in clustering-based maximum inner product search (MIPS)
We present a new framework that incorporates the moments of the distribution of inner products within each shard to optimistically estimate the maximum inner product.
arXiv Detail & Related papers (2024-05-20T17:47:18Z) - Agnostically Learning Multi-index Models with Queries [54.290489524576756]
We study the power of query access for the task of agnostic learning under the Gaussian distribution.
We show that query access gives significant runtime improvements over random examples for agnostically learning MIMs.
arXiv Detail & Related papers (2023-12-27T15:50:47Z) - Subset verification and search algorithms for causal DAGs [13.108039226223793]
We study the problem of identifying the smallest set of interventions required to learn the causal relationships between a subset of edges (target edges)
We prove several interesting structural properties of interventional causal graphs that we believe have applications beyond the subset verification/search problems studied here.
arXiv Detail & Related papers (2023-01-09T06:25:44Z) - Verification and search algorithms for causal DAGs [11.038866111306728]
We give a characterization of a minimal sized set of atomic interventions to check the correctness of a claimed causal graph.
We generalize our results to the settings of bounded size interventions and node-dependent interventional costs.
Our result is the first known algorithm that gives a non-trivial approximation guarantee to the verifying size on general unweighted graphs.
arXiv Detail & Related papers (2022-06-30T15:52:30Z) - An Application of a Multivariate Estimation of Distribution Algorithm to
Cancer Chemotherapy [59.40521061783166]
Chemotherapy treatment for cancer is a complex optimisation problem with a large number of interacting variables and constraints.
We show that the more sophisticated algorithm would yield better performance on a complex problem like this.
We hypothesise that this is caused by the more sophisticated algorithm being impeded by the large number of interactions in the problem.
arXiv Detail & Related papers (2022-05-17T15:28:46Z) - Simple Stochastic and Online Gradient DescentAlgorithms for Pairwise
Learning [65.54757265434465]
Pairwise learning refers to learning tasks where the loss function depends on a pair instances.
Online descent (OGD) is a popular approach to handle streaming data in pairwise learning.
In this paper, we propose simple and online descent to methods for pairwise learning.
arXiv Detail & Related papers (2021-11-23T18:10:48Z) - Intervention Efficient Algorithm for Two-Stage Causal MDPs [15.838256272508357]
We study Markov Decision Processes (MDP) wherein states correspond to causal graphs that generate rewards.
In this setup, the learner's goal is to identify atomic interventions that lead to high rewards by intervening on variables at each state.
Generalizing the recent causal-bandit framework, the current work develops (simple) regret minimization guarantees for two-stage causal MDPs.
arXiv Detail & Related papers (2021-11-01T12:22:37Z) - Near-Optimal Multi-Perturbation Experimental Design for Causal Structure
Learning [0.0]
Causal structures can be learnt by performing experiments on the system of interest.
We address the largely unexplored problem of designing experiments that simultaneously intervene on multiple variables.
arXiv Detail & Related papers (2021-05-28T18:00:00Z) - Adversarial Examples for $k$-Nearest Neighbor Classifiers Based on
Higher-Order Voronoi Diagrams [69.4411417775822]
Adversarial examples are a widely studied phenomenon in machine learning models.
We propose an algorithm for evaluating the adversarial robustness of $k$-nearest neighbor classification.
arXiv Detail & Related papers (2020-11-19T08:49:10Z) - Resource Allocation in Multi-armed Bandit Exploration: Overcoming
Sublinear Scaling with Adaptive Parallelism [107.48538091418412]
We study exploration in multi-armed bandits when we have access to a divisible resource that can be allocated in varying amounts to arm pulls.
We focus in particular on the allocation of distributed computing resources, where we may obtain results faster by allocating more resources per pull.
arXiv Detail & Related papers (2020-10-31T18:19:29Z) - Multi-task Supervised Learning via Cross-learning [102.64082402388192]
We consider a problem known as multi-task learning, consisting of fitting a set of regression functions intended for solving different tasks.
In our novel formulation, we couple the parameters of these functions, so that they learn in their task specific domains while staying close to each other.
This facilitates cross-fertilization in which data collected across different domains help improving the learning performance at each other task.
arXiv Detail & Related papers (2020-10-24T21:35:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.