A kernel test for quasi-independence
- URL: http://arxiv.org/abs/2011.08991v1
- Date: Tue, 17 Nov 2020 22:42:45 GMT
- Title: A kernel test for quasi-independence
- Authors: Tamara Fern\'andez, Wenkai Xu, Marc Ditzhaus and Arthur Gretton
- Abstract summary: We consider settings in which the data of interest correspond to pairs of ordered times.
It is still of interest to determine whether there exists significant dependence beyond their ordering in time.
We propose a nonparametric statistical test of quasi-independence.
- Score: 24.127106529428335
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We consider settings in which the data of interest correspond to pairs of
ordered times, e.g, the birth times of the first and second child, the times at
which a new user creates an account and makes the first purchase on a website,
and the entry and survival times of patients in a clinical trial. In these
settings, the two times are not independent (the second occurs after the
first), yet it is still of interest to determine whether there exists
significant dependence {\em beyond} their ordering in time. We refer to this
notion as "quasi-(in)dependence". For instance, in a clinical trial, to avoid
biased selection, we might wish to verify that recruitment times are
quasi-independent of survival times, where dependencies might arise due to
seasonal effects. In this paper, we propose a nonparametric statistical test of
quasi-independence. Our test considers a potentially infinite space of
alternatives, making it suitable for complex data where the nature of the
possible quasi-dependence is not known in advance. Standard parametric
approaches are recovered as special cases, such as the classical conditional
Kendall's tau, and log-rank tests. The tests apply in the right-censored
setting: an essential feature in clinical trials, where patients can withdraw
from the study. We provide an asymptotic analysis of our test-statistic, and
demonstrate in experiments that our test obtains better power than existing
approaches, while being more computationally efficient.
Related papers
- Mitigating LLM Hallucinations via Conformal Abstention [70.83870602967625]
We develop a principled procedure for determining when a large language model should abstain from responding in a general domain.
We leverage conformal prediction techniques to develop an abstention procedure that benefits from rigorous theoretical guarantees on the hallucination rate (error rate)
Experimentally, our resulting conformal abstention method reliably bounds the hallucination rate on various closed-book, open-domain generative question answering datasets.
arXiv Detail & Related papers (2024-04-04T11:32:03Z) - Causal Discovery via Conditional Independence Testing with Proxy Variables [35.3493980628004]
The presence of unobserved variables, such as the latent confounder, can introduce bias in conditional independence testing.
We propose a novel hypothesis-testing procedure that can effectively examine the existence of the causal relationship over continuous variables.
arXiv Detail & Related papers (2023-05-09T09:08:39Z) - Sequential Predictive Two-Sample and Independence Testing [114.4130718687858]
We study the problems of sequential nonparametric two-sample and independence testing.
We build upon the principle of (nonparametric) testing by betting.
arXiv Detail & Related papers (2023-04-29T01:30:33Z) - Sequential Kernelized Independence Testing [101.22966794822084]
We design sequential kernelized independence tests inspired by kernelized dependence measures.
We demonstrate the power of our approaches on both simulated and real data.
arXiv Detail & Related papers (2022-12-14T18:08:42Z) - Adaptive Sequential Surveillance with Network and Temporal Dependence [1.7205106391379026]
Strategic test allocation plays a major role in the control of both emerging and existing pandemics.
Infectious disease surveillance presents unique statistical challenges.
We propose an Online Super Learner for adaptive sequential surveillance.
arXiv Detail & Related papers (2022-12-05T17:04:17Z) - Continuous-Time Modeling of Counterfactual Outcomes Using Neural
Controlled Differential Equations [84.42837346400151]
Estimating counterfactual outcomes over time has the potential to unlock personalized healthcare.
Existing causal inference approaches consider regular, discrete-time intervals between observations and treatment decisions.
We propose a controllable simulation environment based on a model of tumor growth for a range of scenarios.
arXiv Detail & Related papers (2022-06-16T17:15:15Z) - Private Sequential Hypothesis Testing for Statisticians: Privacy, Error
Rates, and Sample Size [24.149533870085175]
We study the sequential hypothesis testing problem under a slight variant of differential privacy, known as Renyi differential privacy.
We present a new private algorithm based on Wald's Sequential Probability Ratio Test (SPRT) that also gives strong theoretical privacy guarantees.
arXiv Detail & Related papers (2022-04-10T04:15:50Z) - Nonparametric Conditional Local Independence Testing [69.31200003384122]
Conditional local independence is an independence relation among continuous time processes.
No nonparametric test of conditional local independence has been available.
We propose such a nonparametric test based on double machine learning.
arXiv Detail & Related papers (2022-03-25T10:31:02Z) - Tracking disease outbreaks from sparse data with Bayesian inference [55.82986443159948]
The COVID-19 pandemic provides new motivation for estimating the empirical rate of transmission during an outbreak.
Standard methods struggle to accommodate the partial observability and sparse data common at finer scales.
We propose a Bayesian framework which accommodates partial observability in a principled manner.
arXiv Detail & Related papers (2020-09-12T20:37:33Z) - Independence Testing for Temporal Data [14.25244839642841]
A fundamental question is whether two time-series are related or not.
Existing approaches often have limitations, such as relying on parametric assumptions.
This paper introduces the temporal dependence statistic with block permutation to test independence between temporal data.
arXiv Detail & Related papers (2019-08-18T17:19:16Z) - Estimating Treatment Effects using Multiple Surrogates: The Role of the Surrogate Score and the Surrogate Index [4.154846138501937]
Estimating the long-term effects of treatments is of interest in many fields.
One approach is to analyze treatments effects on an intermediate outcome, often called a statistical surrogate.
arXiv Detail & Related papers (2016-03-30T19:45:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.