Full Law Identification In Graphical Models Of Missing Data:
Completeness Results
- URL: http://arxiv.org/abs/2004.04872v3
- Date: Mon, 31 Aug 2020 14:28:45 GMT
- Title: Full Law Identification In Graphical Models Of Missing Data:
Completeness Results
- Authors: Razieh Nabi, Rohit Bhattacharya, Ilya Shpitser
- Abstract summary: We provide the first completeness result in this field of study.
We then address issues that may arise due to the presence of both missing data and unmeasured confounding.
- Score: 13.299431908881425
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Missing data has the potential to affect analyses conducted in all fields of
scientific study, including healthcare, economics, and the social sciences.
Several approaches to unbiased inference in the presence of non-ignorable
missingness rely on the specification of the target distribution and its
missingness process as a probability distribution that factorizes with respect
to a directed acyclic graph. In this paper, we address the longstanding
question of the characterization of models that are identifiable within this
class of missing data distributions. We provide the first completeness result
in this field of study -- necessary and sufficient graphical conditions under
which, the full data distribution can be recovered from the observed data
distribution. We then simultaneously address issues that may arise due to the
presence of both missing data and unmeasured confounding, by extending these
graphical conditions and proofs of completeness, to settings where some
variables are not just missing, but completely unobserved.
Related papers
- On Different Notions of Redundancy in Conditional-Independence-Based Discovery of Graphical Models [11.384020227038961]
We show that due to the conciseness of the graphical representation, there are often many tests that are not used in the construction of the graph.
We show that not all tests contain this additional information and that such redundant tests have to be applied with care.
arXiv Detail & Related papers (2025-02-12T16:08:48Z) - Out-of-Distribution Detection on Graphs: A Survey [58.47395497985277]
Graph out-of-distribution (GOOD) detection focuses on identifying graph data that deviates from the distribution seen during training.
We categorize existing methods into four types: enhancement-based, reconstruction-based, information propagation-based, and classification-based approaches.
We discuss practical applications and theoretical foundations, highlighting the unique challenges posed by graph data.
arXiv Detail & Related papers (2025-02-12T04:07:12Z) - Identifiable Latent Neural Causal Models [82.14087963690561]
Causal representation learning seeks to uncover latent, high-level causal representations from low-level observed data.
We determine the types of distribution shifts that do contribute to the identifiability of causal representations.
We translate our findings into a practical algorithm, allowing for the acquisition of reliable latent causal representations.
arXiv Detail & Related papers (2024-03-23T04:13:55Z) - Non-Asymptotic Performance of Social Machine Learning Under Limited Data [45.48644055449902]
This paper studies the probability of error associated with the social machine learning framework.
It addresses the problem of classifying a stream of unlabeled data in a distributed manner.
arXiv Detail & Related papers (2023-06-15T17:42:14Z) - Sufficient Identification Conditions and Semiparametric Estimation under
Missing Not at Random Mechanisms [4.211128681972148]
Conducting valid statistical analyses is challenging in the presence of missing-not-at-random (MNAR) data.
We consider a MNAR model that generalizes several prior popular MNAR models in two ways.
We propose methods for testing the independence restrictions encoded in such models using odds ratio as our parameter of interest.
arXiv Detail & Related papers (2023-06-10T13:46:16Z) - Nonparametric Identifiability of Causal Representations from Unknown
Interventions [63.1354734978244]
We study causal representation learning, the task of inferring latent causal variables and their causal relations from mixtures of the variables.
Our goal is to identify both the ground truth latents and their causal graph up to a set of ambiguities which we show to be irresolvable from interventional data.
arXiv Detail & Related papers (2023-06-01T10:51:58Z) - Causal and Counterfactual Views of Missing Data Models [11.670103549157387]
We show how identification assumptions in missing data can be encoded in terms of graphical models.
We note interesting similarities and differences between missing data and causal identification theories.
arXiv Detail & Related papers (2022-10-11T15:53:15Z) - Assessing Fairness in the Presence of Missing Data [2.3605348648054463]
We study the problem of estimating fairness in the complete data domain for an arbitrary model evaluated merely using complete cases.
Our work provides the first known theoretical results on fairness guarantee in analysis of incomplete data.
arXiv Detail & Related papers (2021-12-07T17:51:26Z) - OR-Net: Pointwise Relational Inference for Data Completion under Partial
Observation [51.083573770706636]
This work uses relational inference to fill in the incomplete data.
We propose Omni-Relational Network (OR-Net) to model the pointwise relativity in two aspects.
arXiv Detail & Related papers (2021-05-02T06:05:54Z) - The Hidden Uncertainty in a Neural Networks Activations [105.4223982696279]
The distribution of a neural network's latent representations has been successfully used to detect out-of-distribution (OOD) data.
This work investigates whether this distribution correlates with a model's epistemic uncertainty, thus indicating its ability to generalise to novel inputs.
arXiv Detail & Related papers (2020-12-05T17:30:35Z) - Graph Embedding with Data Uncertainty [113.39838145450007]
spectral-based subspace learning is a common data preprocessing step in many machine learning pipelines.
Most subspace learning methods do not take into consideration possible measurement inaccuracies or artifacts that can lead to data with high uncertainty.
arXiv Detail & Related papers (2020-09-01T15:08:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.