Three-quarter Sibling Regression for Denoising Observational Data
- URL: http://arxiv.org/abs/2101.00074v1
- Date: Thu, 31 Dec 2020 21:18:01 GMT
- Title: Three-quarter Sibling Regression for Denoising Observational Data
- Authors: Shiv Shankar, Daniel Sheldon, Tao Sun, John Pickering, and Thomas G.
Dietterich
- Abstract summary: 'Half-sibling regression' can detect and correct for systematic errors in measurements of multiple independent random variables.
It does not apply to many situations, including modeling of species counts that are controlled by common causes.
We present a technique called 'three-quarter sibling regression' to partially overcome this limitation.
- Score: 24.91564825369108
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Many ecological studies and conservation policies are based on field
observations of species, which can be affected by systematic variability
introduced by the observation process. A recently introduced causal modeling
technique called 'half-sibling regression' can detect and correct for
systematic errors in measurements of multiple independent random variables.
However, it will remove intrinsic variability if the variables are dependent,
and therefore does not apply to many situations, including modeling of species
counts that are controlled by common causes. We present a technique called
'three-quarter sibling regression' to partially overcome this limitation. It
can filter the effect of systematic noise when the latent variables have
observed common causes. We provide theoretical justification of this approach,
demonstrate its effectiveness on synthetic data, and show that it reduces
systematic detection variability due to moon brightness in moth surveys.
Related papers
- A Bayesian Classification Trees Approach to Treatment Effect Variation with Noncompliance [0.5356944479760104]
Estimating varying treatment effects in randomized trials with noncompliance is inherently challenging.
Existing flexible machine learning methods are highly sensitive to the weak instruments problem.
We present a Bayesian Causal Forest model for binary response variables in scenarios with noncompliance.
arXiv Detail & Related papers (2024-08-14T18:33:55Z) - Simultaneous inference for generalized linear models with unmeasured confounders [0.0]
We propose a unified statistical estimation and inference framework that harnesses structures and integrates linear projections into three key stages.
We show effective Type-I error control of $z$-tests as sample and response sizes approach infinity.
arXiv Detail & Related papers (2023-09-13T18:53:11Z) - A Causal Framework for Decomposing Spurious Variations [68.12191782657437]
We develop tools for decomposing spurious variations in Markovian and Semi-Markovian models.
We prove the first results that allow a non-parametric decomposition of spurious effects.
The described approach has several applications, ranging from explainable and fair AI to questions in epidemiology and medicine.
arXiv Detail & Related papers (2023-06-08T09:40:28Z) - Nonparametric Identifiability of Causal Representations from Unknown
Interventions [63.1354734978244]
We study causal representation learning, the task of inferring latent causal variables and their causal relations from mixtures of the variables.
Our goal is to identify both the ground truth latents and their causal graph up to a set of ambiguities which we show to be irresolvable from interventional data.
arXiv Detail & Related papers (2023-06-01T10:51:58Z) - Causality-Based Multivariate Time Series Anomaly Detection [63.799474860969156]
We formulate the anomaly detection problem from a causal perspective and view anomalies as instances that do not follow the regular causal mechanism to generate the multivariate data.
We then propose a causality-based anomaly detection approach, which first learns the causal structure from data and then infers whether an instance is an anomaly relative to the local causal mechanism.
We evaluate our approach with both simulated and public datasets as well as a case study on real-world AIOps applications.
arXiv Detail & Related papers (2022-06-30T06:00:13Z) - Causal Identification with Additive Noise Models: Quantifying the Effect
of Noise [5.037636944933989]
This work investigates the impact of different noise levels on the ability of Additive Noise Models to identify the direction of the causal relationship.
We use an exhaustive range of models where the level of additive noise gradually changes from 1% to 10000% of the causes' noise level.
The results of the experiments show that ANMs methods can fail to capture the true causal direction for some levels of noise.
arXiv Detail & Related papers (2021-10-15T13:28:33Z) - Sibling Regression for Generalized Linear Models [22.16690904610619]
Field observations form the basis of many scientific studies, especially in ecological and social sciences.
Despite efforts to conduct such surveys in a standardized way, observations can be prone to systematic measurement errors.
Existing non-parametric techniques for correcting such errors assume linear additive noise models.
We present an approach based on residual functions to address this limitation.
arXiv Detail & Related papers (2021-07-03T04:07:11Z) - Deconfounded Score Method: Scoring DAGs with Dense Unobserved
Confounding [101.35070661471124]
We show that unobserved confounding leaves a characteristic footprint in the observed data distribution that allows for disentangling spurious and causal effects.
We propose an adjusted score-based causal discovery algorithm that may be implemented with general-purpose solvers and scales to high-dimensional problems.
arXiv Detail & Related papers (2021-03-28T11:07:59Z) - Efficient Causal Inference from Combined Observational and
Interventional Data through Causal Reductions [68.6505592770171]
Unobserved confounding is one of the main challenges when estimating causal effects.
We propose a novel causal reduction method that replaces an arbitrary number of possibly high-dimensional latent confounders.
We propose a learning algorithm to estimate the parameterized reduced model jointly from observational and interventional data.
arXiv Detail & Related papers (2021-03-08T14:29:07Z) - Learning from Aggregate Observations [82.44304647051243]
We study the problem of learning from aggregate observations where supervision signals are given to sets of instances.
We present a general probabilistic framework that accommodates a variety of aggregate observations.
Simple maximum likelihood solutions can be applied to various differentiable models.
arXiv Detail & Related papers (2020-04-14T06:18:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.