Related papers: Belief Attribution as Mental Explanation: The Role of Accuracy, Informativity, and Causality

Belief Attribution as Mental Explanation: The Role of Accuracy, Informativity, and Causality

URL: http://arxiv.org/abs/2505.19376v1
Date: Mon, 26 May 2025 00:21:38 GMT
Title: Belief Attribution as Mental Explanation: The Role of Accuracy, Informativity, and Causality
Authors: Lance Ying, Almog Hillel, Ryan Truong, Vikash K. Mansinghka, Joshua B. Tenenbaum, Tan Zhi-Xuan,
Abstract summary: We investigate the hypothesis that people prefer to attribute beliefs that are good explanations for the behavior they observe.<n>We develop a computational model that quantifies the explanatory strength of a (natural language) statement about an agent's beliefs.<n>Using this model, we study the role of each factor in how people selectively attribute beliefs to other agents.
Score: 42.943294683967046
License: http://creativecommons.org/licenses/by/4.0/
Abstract: A key feature of human theory-of-mind is the ability to attribute beliefs to other agents as mentalistic explanations for their behavior. But given the wide variety of beliefs that agents may hold about the world and the rich language we can use to express them, which specific beliefs are people inclined to attribute to others? In this paper, we investigate the hypothesis that people prefer to attribute beliefs that are good explanations for the behavior they observe. We develop a computational model that quantifies the explanatory strength of a (natural language) statement about an agent's beliefs via three factors: accuracy, informativity, and causal relevance to actions, each of which can be computed from a probabilistic generative model of belief-driven behavior. Using this model, we study the role of each factor in how people selectively attribute beliefs to other agents. We investigate this via an experiment where participants watch an agent collect keys hidden in boxes in order to reach a goal, then rank a set of statements describing the agent's beliefs about the boxes' contents. We find that accuracy and informativity perform reasonably well at predicting these rankings when combined, but that causal relevance is the single factor that best explains participants' responses.

Related papers

A Descriptive and Normative Theory of Human Beliefs in RLHF [12.627454162208846]
We propose that human beliefs about the capabilities of the agent being trained also play a key role in preference generation.<n>We show through synthetic experiments that it is often suboptimal for human preference labelers to assume agent optimality.
arXiv Detail & Related papers (2025-06-02T13:52:55Z)
Plasticity as the Mirror of Empowerment [94.91580596320331]
We show that plasticity is identical to the empowerment of the environment.<n>We suggest that plasticity, empowerment, and their relationship are essential to understanding agency.
arXiv Detail & Related papers (2025-05-15T14:52:16Z)
Generating Causal Explanations of Vehicular Agent Behavioural Interactions with Learnt Reward Profiles [13.450023647228843]
We learn a weighting of reward metrics for agents such that explanations for agent interactions can be causally inferred.<n>We validate our approach quantitatively and qualitatively across three real-world driving datasets.
arXiv Detail & Related papers (2025-03-18T01:53:59Z)
Tell Me Why: Incentivizing Explanations [3.2754470919268543]
There is no known mechanism that provides incentives to elicit explanations for beliefs from agents.<n>Standard Bayesian models make assumptions that preempt the need for explanations.<n>This work argues that rationales-explanations of an agent's private information-lead to more efficient aggregation.
arXiv Detail & Related papers (2025-02-19T03:47:34Z)
Grounding Language about Belief in a Bayesian Theory-of-Mind [5.058204320571824]
We take a step towards an answer by grounding the semantics of belief statements in a Bayesian theory-of-mind. By modeling how humans jointly infer coherent sets of goals, beliefs, and plans, our framework provides a conceptual role semantics for belief. We evaluate this framework by studying how humans attribute goals and beliefs while watching an agent solve a doors-and-keys gridworld puzzle.
arXiv Detail & Related papers (2024-02-16T02:47:09Z)
Decoding Susceptibility: Modeling Misbelief to Misinformation Through a Computational Approach [61.04606493712002]
Susceptibility to misinformation describes the degree of belief in unverifiable claims that is not observable. Existing susceptibility studies heavily rely on self-reported beliefs. We propose a computational approach to model users' latent susceptibility levels.
arXiv Detail & Related papers (2023-11-16T07:22:56Z)
Properties from Mechanisms: An Equivariance Perspective on Identifiable Representation Learning [79.4957965474334]
Key goal of unsupervised representation learning is "inverting" a data generating process to recover its latent properties. This paper asks, "Can we instead identify latent properties by leveraging knowledge of the mechanisms that govern their evolution?" We provide a complete characterization of the sources of non-identifiability as we vary knowledge about a set of possible mechanisms.
arXiv Detail & Related papers (2021-10-29T14:04:08Z)
AGENT: A Benchmark for Core Psychological Reasoning [60.35621718321559]
Intuitive psychology is the ability to reason about hidden mental variables that drive observable actions. Despite recent interest in machine agents that reason about other agents, it is not clear if such agents learn or hold the core psychology principles that drive human reasoning. We present a benchmark consisting of procedurally generated 3D animations, AGENT, structured around four scenarios.
arXiv Detail & Related papers (2021-02-24T14:58:23Z)
Maximizing Information Gain in Partially Observable Environments via Prediction Reward [64.24528565312463]
This paper tackles the challenge of using belief-based rewards for a deep RL agent. We derive the exact error between negative entropy and the expected prediction reward. This insight provides theoretical motivation for several fields using prediction rewards.
arXiv Detail & Related papers (2020-05-11T08:13:49Z)
Towards the Role of Theory of Mind in Explanation [23.818659473644505]
Theory of Mind is the ability to attribute mental states (e.g., beliefs, goals) to oneself, and to others. Previous work has observed that Theory of Mind capabilities are central to providing an explanation to another agent.
arXiv Detail & Related papers (2020-05-06T17:13:46Z)

This list is automatically generated from the titles and abstracts of the papers in this site.

This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.