Hoeffding decomposition of black-box models with dependent inputs
- URL: http://arxiv.org/abs/2310.06567v2
- Date: Thu, 7 Mar 2024 09:14:24 GMT
- Title: Hoeffding decomposition of black-box models with dependent inputs
- Authors: Marouane Il Idrissi (EDF R\&D PRISME, IMT, SINCLAIR AI Lab), Nicolas
Bousquet (EDF R\&D PRISME, SINCLAIR AI Lab, LPSM (UMR\_8001)), Fabrice Gamboa
(IMT), Bertrand Iooss (EDF R\&D PRISME, IMT, SINCLAIR AI Lab, RT-UQ),
Jean-Michel Loubes (IMT)
- Abstract summary: One of the main challenges for interpreting black-box models is the ability to decompose square-integrable functions of non-independent random inputs into a sum of functions of every possible subset of variables.
We show that under two reasonable assumptions, it is always possible to decompose such a function uniquely.
The elements of this decomposition can be expressed using projections and allow for novel interpretability indices.
- Score: 22.519758624657644
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: One of the main challenges for interpreting black-box models is the ability
to uniquely decompose square-integrable functions of non-independent random
inputs into a sum of functions of every possible subset of variables. However,
dealing with dependencies among inputs can be complicated. We propose a novel
framework to study this problem, linking three domains of mathematics:
probability theory, functional analysis, and combinatorics. We show that, under
two reasonable assumptions on the inputs (non-perfect functional dependence and
non-degenerate stochastic dependence), it is always possible to decompose such
a function uniquely. This generalizes the well-known Hoeffding decomposition.
The elements of this decomposition can be expressed using oblique projections
and allow for novel interpretability indices for evaluation and variance
decomposition purposes. The properties of these novel indices are studied and
discussed. This generalization offers a path towards a more precise uncertainty
quantification, which can benefit sensitivity analysis and interpretability
studies whenever the inputs are dependent. This decomposition is illustrated
analytically, and the challenges for adopting these results in practice are
discussed.
Related papers
- Loss-Complexity Landscape and Model Structure Functions [56.01537787608726]
We develop a framework for dualizing the Kolmogorov structure function $h_x(alpha)$.<n>We establish a mathematical analogy between information-theoretic constructs and statistical mechanics.<n>We explicitly prove the Legendre-Fenchel duality between the structure function and free energy.
arXiv Detail & Related papers (2025-07-17T21:31:45Z) - Linear causal disentanglement via higher-order cumulants [0.0]
We study the identifiability of linear causal disentanglement, assuming access to data under multiple contexts.
We show that one perfect intervention on each latent variable is sufficient and in the worst case necessary to recover parameters under perfect interventions.
arXiv Detail & Related papers (2024-07-05T15:53:16Z) - Understanding Diffusion Models by Feynman's Path Integral [2.4373900721120285]
We introduce a novel formulation of diffusion models using Feynman's integral path.
We find this formulation providing comprehensive descriptions of score-based generative models.
We also demonstrate the derivation of backward differential equations and loss functions.
arXiv Detail & Related papers (2024-03-17T16:24:29Z) - Logistic-beta processes for dependent random probabilities with beta marginals [58.91121576998588]
We propose a novel process called the logistic-beta process, whose logistic transformation yields a process with common beta marginals.
It can model dependence on both discrete and continuous domains, such as space or time, and has a flexible dependence structure through correlation kernels.
We illustrate the benefits through nonparametric binary regression and conditional density estimation examples, both in simulation studies and in a pregnancy outcome application.
arXiv Detail & Related papers (2024-02-10T21:41:32Z) - Nonparametric Partial Disentanglement via Mechanism Sparsity: Sparse
Actions, Interventions and Sparse Temporal Dependencies [58.179981892921056]
This work introduces a novel principle for disentanglement we call mechanism sparsity regularization.
We propose a representation learning method that induces disentanglement by simultaneously learning the latent factors.
We show that the latent factors can be recovered by regularizing the learned causal graph to be sparse.
arXiv Detail & Related papers (2024-01-10T02:38:21Z) - The Geometry of Causality [0.0]
We provide a unified framework for the study of causality, non-locality and contextuality.
We define causaltopes, for arbitrary spaces of input histories and arbitrary choices of input contexts.
We introduce a notion of causal separability relative to arbitrary causal constraints.
arXiv Detail & Related papers (2023-03-16T01:11:47Z) - Axiomatic characterization of pointwise Shapley decompositions [0.0]
A common problem in various applications is the additive decomposition of the output of a function with respect to its input variables.
In this paper, axioms are developed which fully preserve functional structures and lead to unique decompositions for all Borel measurable functions.
arXiv Detail & Related papers (2023-03-14T10:24:48Z) - Identifying Weight-Variant Latent Causal Models [82.14087963690561]
We find that transitivity acts as a key role in impeding the identifiability of latent causal representations.
Under some mild assumptions, we can show that the latent causal representations can be identified up to trivial permutation and scaling.
We propose a novel method, termed Structural caUsAl Variational autoEncoder, which directly learns latent causal representations and causal relationships among them.
arXiv Detail & Related papers (2022-08-30T11:12:59Z) - Data-Driven Influence Functions for Optimization-Based Causal Inference [105.5385525290466]
We study a constructive algorithm that approximates Gateaux derivatives for statistical functionals by finite differencing.
We study the case where probability distributions are not known a priori but need to be estimated from data.
arXiv Detail & Related papers (2022-08-29T16:16:22Z) - Inference on Strongly Identified Functionals of Weakly Identified
Functions [71.42652863687117]
We study a novel condition for the functional to be strongly identified even when the nuisance function is not.
We propose penalized minimax estimators for both the primary and debiasing nuisance functions.
arXiv Detail & Related papers (2022-08-17T13:38:31Z) - Exploiting Independent Instruments: Identification and Distribution
Generalization [3.701112941066256]
We exploit the independence for distribution generalization by taking into account higher moments.
We prove that the proposed estimator is invariant to distributional shifts on the instruments.
These results hold even in the under-identified case where the instruments are not sufficiently rich to identify the causal function.
arXiv Detail & Related papers (2022-02-03T21:49:04Z) - Exponentially Weighted l_2 Regularization Strategy in Constructing
Reinforced Second-order Fuzzy Rule-based Model [72.57056258027336]
In the conventional Takagi-Sugeno-Kang (TSK)-type fuzzy models, constant or linear functions are usually utilized as the consequent parts of the fuzzy rules.
We introduce an exponential weight approach inspired by the weight function theory encountered in harmonic analysis.
arXiv Detail & Related papers (2020-07-02T15:42:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.