SNAPE-PM: Building and Utilizing Dynamic Partner Models for Adaptive Explanation Generation
- URL: http://arxiv.org/abs/2505.13053v1
- Date: Mon, 19 May 2025 12:42:23 GMT
- Title: SNAPE-PM: Building and Utilizing Dynamic Partner Models for Adaptive Explanation Generation
- Authors: Amelie S. Robrecht, Christoph R. Kowalski, Stefan Kopp,
- Abstract summary: Adapting to the addressee is crucial for successful explanations, yet poses significant challenges for dialogsystems.<n>We adopt the approach of treating explanation generation as a non-stationary decision process, where the optimal strategy varies according to changing beliefs about the explainee and the interaction context.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Adapting to the addressee is crucial for successful explanations, yet poses significant challenges for dialogsystems. We adopt the approach of treating explanation generation as a non-stationary decision process, where the optimal strategy varies according to changing beliefs about the explainee and the interaction context. In this paper we address the questions of (1) how to track the interaction context and the relevant listener features in a formally defined computational partner model, and (2) how to utilize this model in the dynamically adjusted, rational decision process that determines the currently best explanation strategy. We propose a Bayesian inference-based approach to continuously update the partner model based on user feedback, and a non-stationary Markov Decision Process to adjust decision-making based on the partner model values. We evaluate an implementation of this framework with five simulated interlocutors, demonstrating its effectiveness in adapting to different partners with constant and even changing feedback behavior. The results show high adaptivity with distinct explanation strategies emerging for different partners, highlighting the potential of our approach to improve explainable AI systems and dialogsystems in general.
Related papers
- Fuzzy Information Evolution with Three-Way Decision in Social Network Group Decision-Making [22.992898531210326]
In group decision-making (GDM) scenarios, uncertainty, dynamic social structures, and vague information present major challenges.<n>This study proposes a novel social network group decision-making framework that integrates three-way decision (3WD) theory, dynamic network reconstruction, and linguistic opinion representation.
arXiv Detail & Related papers (2025-05-22T15:26:48Z) - Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization [75.1240295759264]
We propose an effective framework for Bridging and Modeling Correlations in pairwise data, named BMC.<n>We increase the consistency and informativeness of the pairwise preference signals through targeted modifications.<n>We identify that DPO alone is insufficient to model these correlations and capture nuanced variations.
arXiv Detail & Related papers (2024-08-14T11:29:47Z) - Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning [51.52387511006586]
We propose Hierarchical Opponent modeling and Planning (HOP), a novel multi-agent decision-making algorithm.
HOP is hierarchically composed of two modules: an opponent modeling module that infers others' goals and learns corresponding goal-conditioned policies.
HOP exhibits superior few-shot adaptation capabilities when interacting with various unseen agents, and excels in self-play scenarios.
arXiv Detail & Related papers (2024-06-12T08:48:06Z) - On Predictive planning and counterfactual learning in active inference [0.20482269513546453]
In this paper, we examine two decision-making schemes in active inference based on 'planning' and 'learning from experience'
We introduce a mixed model that navigates the data-complexity trade-off between these strategies.
We evaluate our proposed model in a challenging grid-world scenario that requires adaptability from the agent.
arXiv Detail & Related papers (2024-03-19T04:02:31Z) - Adaptive Bayesian Learning with Action and State-Dependent Signal
Variance [0.0]
This manuscript presents an advanced framework for Bayesian learning by incorporating action and state-dependent signal variances into decision-making models.
This framework is pivotal in understanding complex data-feedback loops and decision-making processes in various economic systems.
arXiv Detail & Related papers (2023-11-20T17:59:30Z) - JoTR: A Joint Transformer and Reinforcement Learning Framework for
Dialog Policy Learning [53.83063435640911]
Dialogue policy learning (DPL) is a crucial component of dialogue modelling.
We introduce a novel framework, JoTR, to generate flexible dialogue actions.
Unlike traditional methods, JoTR formulates a word-level policy that allows for a more dynamic and adaptable dialogue action generation.
arXiv Detail & Related papers (2023-09-01T03:19:53Z) - Inverse Online Learning: Understanding Non-Stationary and Reactionary
Policies [79.60322329952453]
We show how to develop interpretable representations of how agents make decisions.
By understanding the decision-making processes underlying a set of observed trajectories, we cast the policy inference problem as the inverse to this online learning problem.
We introduce a practical algorithm for retrospectively estimating such perceived effects, alongside the process through which agents update them.
Through application to the analysis of UNOS organ donation acceptance decisions, we demonstrate that our approach can bring valuable insights into the factors that govern decision processes and how they change over time.
arXiv Detail & Related papers (2022-03-14T17:40:42Z) - On Variational Inference for User Modeling in Attribute-Driven
Collaborative Filtering [10.64460581091531]
We present an approach to use causal inference to learn user-attribute affinities through temporal contexts.
We formulate this objective as a Probabilistic Machine Learning problem and apply a variational inference based method to estimate the model parameters.
arXiv Detail & Related papers (2020-12-02T22:39:58Z) - Hybrid Supervised Reinforced Model for Dialogue Systems [2.1485350418225244]
The model copes with both tasks required for Dialogue Management: State Tracking and Decision Making.
The model achieves greater performance, learning speed and robustness than a non-recurrent baseline.
arXiv Detail & Related papers (2020-11-04T12:03:12Z) - Learning an Effective Context-Response Matching Model with
Self-Supervised Tasks for Retrieval-based Dialogues [88.73739515457116]
We introduce four self-supervised tasks including next session prediction, utterance restoration, incoherence detection and consistency discrimination.
We jointly train the PLM-based response selection model with these auxiliary tasks in a multi-task manner.
Experiment results indicate that the proposed auxiliary self-supervised tasks bring significant improvement for multi-turn response selection.
arXiv Detail & Related papers (2020-09-14T08:44:46Z) - Inverse Active Sensing: Modeling and Understanding Timely
Decision-Making [111.07204912245841]
We develop a framework for the general setting of evidence-based decision-making under endogenous, context-dependent time pressure.
We demonstrate how it enables modeling intuitive notions of surprise, suspense, and optimality in decision strategies.
arXiv Detail & Related papers (2020-06-25T02:30:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.