Energy-Based Learning for Cooperative Games, with Applications to
Feature/Data/Model Valuations
- URL: http://arxiv.org/abs/2106.02938v1
- Date: Sat, 5 Jun 2021 17:39:04 GMT
- Title: Energy-Based Learning for Cooperative Games, with Applications to
Feature/Data/Model Valuations
- Authors: Yatao Bian, Yu Rong, Tingyang Xu, Jiaxiang Wu, Andreas Krause, Junzhou
Huang
- Abstract summary: We present a novel energy-based treatment for cooperative games, with a theoretical justification by the maximum entropy framework.
Surprisingly, by conducting variational inference of the energy-based model, we recover various game-theoretic valuation criteria, such as Shapley value and Banzhaf index.
We experimentally demonstrate that the proposed Variational Index enjoys intriguing properties on certain synthetic and real-world valuation problems.
- Score: 91.36803653600667
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Valuation problems, such as attribution-based feature interpretation, data
valuation and model valuation for ensembles, become increasingly more important
in many machine learning applications. Such problems are commonly solved by
well-known game-theoretic criteria, such as Shapley value or Banzhaf index. In
this work, we present a novel energy-based treatment for cooperative games,
with a theoretical justification by the maximum entropy framework.
Surprisingly, by conducting variational inference of the energy-based model, we
recover various game-theoretic valuation criteria, such as Shapley value and
Banzhaf index, through conducting one-step gradient ascent for maximizing the
mean-field ELBO objective. This observation also verifies the rationality of
existing criteria, as they are all trying to decouple the correlations among
the players through the mean-field approach. By running gradient ascent for
multiple steps, we achieve a trajectory of the valuations, among which we
define the valuation with the best conceivable decoupling error as the
Variational Index. We experimentally demonstrate that the proposed Variational
Index enjoys intriguing properties on certain synthetic and real-world
valuation problems.
Related papers
- Statistical ranking with dynamic covariates [6.729750785106628]
We introduce an efficient alternating algorithm to compute the likelihood estimator (MLE)
A comprehensive numerical study is conducted to corroborate our theoretical findings and demonstrate the application of the proposed model to real-world datasets, including horse racing and tennis competitions.
arXiv Detail & Related papers (2024-06-24T10:26:05Z) - XPose: eXplainable Human Pose Estimation [21.738680136615127]
XPose is a framework that incorporates Explainable AI (XAI) principles into pose estimation.
Group Shapley Value (GSV) organizes keypoints into clusters based on their interdependencies.
GSV meticulously calculates Shapley value for keypoints, while for inter-cluster keypoints, it opts for a more holistic group-level valuation.
arXiv Detail & Related papers (2024-03-19T02:29:34Z) - Towards stable real-world equation discovery with assessing
differentiating quality influence [52.2980614912553]
We propose alternatives to the commonly used finite differences-based method.
We evaluate these methods in terms of applicability to problems, similar to the real ones, and their ability to ensure the convergence of equation discovery algorithms.
arXiv Detail & Related papers (2023-11-09T23:32:06Z) - Sample Complexity Bounds for Score-Matching: Causal Discovery and
Generative Modeling [82.36856860383291]
We demonstrate that accurate estimation of the score function is achievable by training a standard deep ReLU neural network.
We establish bounds on the error rate of recovering causal relationships using the score-matching-based causal discovery method.
arXiv Detail & Related papers (2023-10-27T13:09:56Z) - Between Rate-Distortion Theory & Value Equivalence in Model-Based
Reinforcement Learning [21.931580762349096]
We introduce an algorithm for synthesizing simple and useful approximations of the environment from which an agent might still recover near-optimal behavior.
We recognize the information-theoretic nature of this lossy environment compression problem and use the appropriate tools of rate-distortion theory to make mathematically precise how value equivalence can lend tractability to otherwise intractable sequential decision-making problems.
arXiv Detail & Related papers (2022-06-04T17:09:46Z) - Rectified Max-Value Entropy Search for Bayesian Optimization [54.26984662139516]
We develop a rectified MES acquisition function based on the notion of mutual information.
As a result, RMES shows a consistent improvement over MES in several synthetic function benchmarks and real-world optimization problems.
arXiv Detail & Related papers (2022-02-28T08:11:02Z) - Holistic Deep Learning [3.718942345103135]
This paper presents a novel holistic deep learning framework that addresses the challenges of vulnerability to input perturbations, overparametrization, and performance instability.
The proposed framework holistically improves accuracy, robustness, sparsity, and stability over standard deep learning models.
arXiv Detail & Related papers (2021-10-29T14:46:32Z) - Leveraging Global Parameters for Flow-based Neural Posterior Estimation [90.21090932619695]
Inferring the parameters of a model based on experimental observations is central to the scientific method.
A particularly challenging setting is when the model is strongly indeterminate, i.e., when distinct sets of parameters yield identical observations.
We present a method for cracking such indeterminacy by exploiting additional information conveyed by an auxiliary set of observations sharing global parameters.
arXiv Detail & Related papers (2021-02-12T12:23:13Z) - The Value Equivalence Principle for Model-Based Reinforcement Learning [29.368870568214007]
We argue that the limited representational resources of model-based RL agents are better used to build models that are directly useful for value-based planning.
We show that, as we augment the set of policies and functions considered, the class of value equivalent models shrinks.
We argue that the principle of value equivalence underlies a number of recent empirical successes in RL.
arXiv Detail & Related papers (2020-11-06T18:25:54Z) - GenDICE: Generalized Offline Estimation of Stationary Values [108.17309783125398]
We show that effective estimation can still be achieved in important applications.
Our approach is based on estimating a ratio that corrects for the discrepancy between the stationary and empirical distributions.
The resulting algorithm, GenDICE, is straightforward and effective.
arXiv Detail & Related papers (2020-02-21T00:27:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.