Related papers: A taxonomy of surprise definitions

A taxonomy of surprise definitions

URL: http://arxiv.org/abs/2209.01034v1
Date: Fri, 2 Sep 2022 13:07:15 GMT
Title: A taxonomy of surprise definitions
Authors: Alireza Modirshanechi, Johanni Brea, Wulfram Gerstner
Abstract summary: We identify 18 mathematical definitions of surprise in a unifying framework. We classify them into four conceptual categories based on the quantity they measure. The taxonomy poses the foundation for principled studies of the functional roles and physiological signatures of surprise in the brain.
Score: 4.849550522970841
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Surprising events trigger measurable brain activity and influence human behavior by affecting learning, memory, and decision-making. Currently there is, however, no consensus on the definition of surprise. Here we identify 18 mathematical definitions of surprise in a unifying framework. We first propose a technical classification of these definitions into three groups based on their dependence on an agent's belief, show how they relate to each other, and prove under what conditions they are indistinguishable. Going beyond this technical analysis, we propose a taxonomy of surprise definitions and classify them into four conceptual categories based on the quantity they measure: (i) 'prediction surprise' measures a mismatch between a prediction and an observation; (ii) 'change-point detection surprise' measures the probability of a change in the environment; (iii) 'confidence-corrected surprise' explicitly accounts for the effect of confidence; and (iv) 'information gain surprise' measures the belief-update upon a new observation. The taxonomy poses the foundation for principled studies of the functional roles and physiological signatures of surprise in the brain.

Related papers

Conceptualizing Uncertainty: A Concept-based Approach to Explaining Uncertainty [45.370565359867534]
Uncertainty in machine learning refers to the degree of confidence or lack thereof in a model's predictions.<n>We propose to explain uncertainty in high-dimensional data classification settings by means of concept activation vectors.<n>We demonstrate the utility of the generated explanations by leveraging them to refine and improve our model.
arXiv Detail & Related papers (2025-03-05T12:24:12Z)
On Information-Theoretic Measures of Predictive Uncertainty [5.8034373350518775]
Despite its significance, a consensus on the correct measurement of predictive uncertainty remains elusive. Our proposed framework categorizes predictive uncertainty measures according to two factors: (I) The predicting model (II) The approximation of the true predictive distribution. We empirically evaluate these measures in typical uncertainty estimation settings, such as misclassification detection, selective prediction, and out-of-distribution detection.
arXiv Detail & Related papers (2024-10-14T17:52:18Z)
Ensured: Explanations for Decreasing the Epistemic Uncertainty in Predictions [1.2289361708127877]
Epistem uncertainty adds a crucial dimension to explanation quality. We introduce new types of explanations that specifically target this uncertainty. We introduce a new metric, ensured ranking, designed to help users identify the most reliable explanations.
arXiv Detail & Related papers (2024-10-07T20:21:51Z)
Performative Prediction on Games and Mechanism Design [69.7933059664256]
We study a collective risk dilemma where agents decide whether to trust predictions based on past accuracy. As predictions shape collective outcomes, social welfare arises naturally as a metric of concern. We show how to achieve better trade-offs and use them for mechanism design.
arXiv Detail & Related papers (2024-08-09T16:03:44Z)
Reasoning about unpredicted change and explicit time [10.220888127527152]
Reasoning about unpredicted change consists in explaining observations by events. We propose here an approach for explaining time-stamped observations by surprises, which are simple events consisting in the change of the truth value of a fluent.
arXiv Detail & Related papers (2024-07-09T07:49:57Z)
VOICE: Variance of Induced Contrastive Explanations to quantify Uncertainty in Neural Network Interpretability [15.864519662894034]
We visualize and quantify the predictive uncertainty of gradient-based visual explanations for neural networks. Visual post hoc explainability techniques highlight features within an image to justify a network's prediction. We show that every image, network, prediction, and explanatory technique has a unique uncertainty.
arXiv Detail & Related papers (2024-06-01T23:32:29Z)
A Dual-Perspective Approach to Evaluating Feature Attribution Methods [40.73602126894125]
We propose two new perspectives within the faithfulness paradigm that reveal intuitive properties: soundness and completeness. Soundness assesses the degree to which attributed features are truly predictive features, while completeness examines how well the resulting attribution reveals all the predictive features. We apply these metrics to mainstream attribution methods, offering a novel lens through which to analyze and compare feature attribution methods.
arXiv Detail & Related papers (2023-08-17T12:41:04Z)
A Semantic Approach to Decidability in Epistemic Planning (Extended Version) [72.77805489645604]
We use a novel semantic approach to achieve decidability. Specifically, we augment the logic of knowledge S5$_n$ and with an interaction axiom called (knowledge) commutativity. We prove that our framework admits a finitary non-fixpoint characterization of common knowledge, which is of independent interest.
arXiv Detail & Related papers (2023-07-28T11:26:26Z)
What Should I Know? Using Meta-gradient Descent for Predictive Feature Discovery in a Single Stream of Experience [63.75363908696257]
computational reinforcement learning seeks to construct an agent's perception of the world through predictions of future sensations. An open challenge in this line of work is determining from the infinitely many predictions that the agent could possibly make which predictions might best support decision-making. We introduce a meta-gradient descent process by which an agent learns what predictions to make, 2) the estimates for its chosen predictions, and 3) how to use those estimates to generate policies that maximize future reward.
arXiv Detail & Related papers (2022-06-13T21:31:06Z)
Measuring Fairness of Text Classifiers via Prediction Sensitivity [63.56554964580627]
ACCUMULATED PREDICTION SENSITIVITY measures fairness in machine learning models based on the model's prediction sensitivity to perturbations in input features. We show that the metric can be theoretically linked with a specific notion of group fairness (statistical parity) and individual fairness.
arXiv Detail & Related papers (2022-03-16T15:00:33Z)
Nested Counterfactual Identification from Arbitrary Surrogate Experiments [95.48089725859298]
We study the identification of nested counterfactuals from an arbitrary combination of observations and experiments. Specifically, we prove the counterfactual unnesting theorem (CUT), which allows one to map arbitrary nested counterfactuals to unnested ones.
arXiv Detail & Related papers (2021-07-07T12:51:04Z)
DEUP: Direct Epistemic Uncertainty Prediction [56.087230230128185]
Epistemic uncertainty is part of out-of-sample prediction error due to the lack of knowledge of the learner. We propose a principled approach for directly estimating epistemic uncertainty by learning to predict generalization error and subtracting an estimate of aleatoric uncertainty.
arXiv Detail & Related papers (2021-02-16T23:50:35Z)

This list is automatically generated from the titles and abstracts of the papers in this site.