Eliciting Risk Aversion with Inverse Reinforcement Learning via
Interactive Questioning
- URL: http://arxiv.org/abs/2308.08427v1
- Date: Wed, 16 Aug 2023 15:17:57 GMT
- Title: Eliciting Risk Aversion with Inverse Reinforcement Learning via
Interactive Questioning
- Authors: Ziteng Cheng and Anthony Coache and Sebastian Jaimungal
- Abstract summary: This paper proposes a novel framework for identifying an agent's risk aversion using interactive questioning.
We prove that the agent's risk aversion can be identified as the number of questions tends to infinity, and the questions are randomly designed.
Our framework has important applications in robo-advising and provides a new approach for identifying an agent's risk preferences.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper proposes a novel framework for identifying an agent's risk
aversion using interactive questioning. Our study is conducted in two
scenarios: a one-period case and an infinite horizon case. In the one-period
case, we assume that the agent's risk aversion is characterized by a cost
function of the state and a distortion risk measure. In the infinite horizon
case, we model risk aversion with an additional component, a discount factor.
Assuming the access to a finite set of candidates containing the agent's true
risk aversion, we show that asking the agent to demonstrate her optimal
policies in various environment, which may depend on their previous answers, is
an effective means of identifying the agent's risk aversion. Specifically, we
prove that the agent's risk aversion can be identified as the number of
questions tends to infinity, and the questions are randomly designed. We also
develop an algorithm for designing optimal questions and provide empirical
evidence that our method learns risk aversion significantly faster than
randomly designed questions in simulations. Our framework has important
applications in robo-advising and provides a new approach for identifying an
agent's risk preferences.
Related papers
- Data-driven decision-making under uncertainty with entropic risk measure [5.407319151576265]
The entropic risk measure is widely used in high-stakes decision making to account for tail risks associated with an uncertain loss.
To debias the empirical entropic risk estimator, we propose a strongly consistent bootstrapping procedure.
We show that cross validation methods can result in significantly higher out-of-sample risk for the insurer if the bias in validation performance is not corrected for.
arXiv Detail & Related papers (2024-09-30T04:02:52Z) - Risks and NLP Design: A Case Study on Procedural Document QA [52.557503571760215]
We argue that clearer assessments of risks and harms to users will be possible when we specialize the analysis to more concrete applications and their plausible users.
We conduct a risk-oriented error analysis that could then inform the design of a future system to be deployed with lower risk of harm and better performance.
arXiv Detail & Related papers (2024-08-16T17:23:43Z) - RiskBench: A Scenario-based Benchmark for Risk Identification [4.263035319815899]
This work focuses on risk identification, the process of identifying and analyzing risks stemming from dynamic traffic participants and unexpected events.
We introduce textbfRiskBench, a large-scale scenario-based benchmark for risk identification.
We assess the ability of ten algorithms to (1) detect and locate risks, (2) anticipate risks, and (3) facilitate decision-making.
arXiv Detail & Related papers (2023-12-04T06:21:22Z) - RiskQ: Risk-sensitive Multi-Agent Reinforcement Learning Value Factorization [49.26510528455664]
We introduce the Risk-sensitive Individual-Global-Max (RIGM) principle as a generalization of the Individual-Global-Max (IGM) and Distributional IGM (DIGM) principles.
We show that RiskQ can obtain promising performance through extensive experiments.
arXiv Detail & Related papers (2023-11-03T07:18:36Z) - SafeAR: Safe Algorithmic Recourse by Risk-Aware Policies [2.291948092032746]
We present a method to compute recourse policies that consider variability in cost.
We show how existing recourse desiderata can fail to capture the risk of higher costs.
arXiv Detail & Related papers (2023-08-23T18:12:11Z) - Distinguishing Risk Preferences using Repeated Gambles [0.0]
Sequences of repeated gambles provide an experimental tool to characterize risk preferences of humans or artificial decision-making agents.
We show that it becomes increasingly difficult to distinguish the risk preferences of agents as their wealth increases.
arXiv Detail & Related papers (2023-08-14T10:27:58Z) - Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits [81.60136088841948]
We propose an algorithm that minimizes the regret over the horizon of time $T$.
The proposed algorithm is applicable to domains such as recommendation systems and transportation.
arXiv Detail & Related papers (2023-01-31T03:49:00Z) - A Survey of Risk-Aware Multi-Armed Bandits [84.67376599822569]
We review various risk measures of interest, and comment on their properties.
We consider algorithms for the regret minimization setting, where the exploration-exploitation trade-off manifests.
We conclude by commenting on persisting challenges and fertile areas for future research.
arXiv Detail & Related papers (2022-05-12T02:20:34Z) - Efficient Risk-Averse Reinforcement Learning [79.61412643761034]
In risk-averse reinforcement learning (RL), the goal is to optimize some risk measure of the returns.
We prove that under certain conditions this inevitably leads to a local-optimum barrier, and propose a soft risk mechanism to bypass it.
We demonstrate improved risk aversion in maze navigation, autonomous driving, and resource allocation benchmarks.
arXiv Detail & Related papers (2022-05-10T19:40:52Z) - Navigating to the Best Policy in Markov Decision Processes [68.8204255655161]
We investigate the active pure exploration problem in Markov Decision Processes.
Agent sequentially selects actions and, from the resulting system trajectory, aims at the best as fast as possible.
arXiv Detail & Related papers (2021-06-05T09:16:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.