Fugu-MT 論文翻訳(概要): Partial Identifiability in Inverse Reinforcement Learning For Agents With Non-Exponential Discounting

論文の概要: Partial Identifiability in Inverse Reinforcement Learning For Agents With Non-Exponential Discounting

arxiv url: http://arxiv.org/abs/2412.11155v1
Date: Sun, 15 Dec 2024 11:08:58 GMT
ステータス: 翻訳完了
システム内更新日: 2024-12-17 15:50:00.041982
Title: Partial Identifiability in Inverse Reinforcement Learning For Agents With Non-Exponential Discounting
Title（参考訳）: 非指数分散エージェントの逆強化学習における部分的識別可能性
Authors: Joar Skalse, Alessandro Abate,
Abstract要約: 逆強化学習は、エージェントの振る舞いを観察することから、エージェントの好みを推測することを目的としている。 IRLの主な課題の1つは、複数の選好が同じ観察行動を引き起こす可能性があることである。一般にIRLは、正しい最適ポリシーを特定するのに、$R$に関する十分な情報を推測できないことを示す。
参考スコア（独自算出の注目度）: 64.13583792391783
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The aim of inverse reinforcement learning (IRL) is to infer an agent's preferences from observing their behaviour. Usually, preferences are modelled as a reward function, $R$, and behaviour is modelled as a policy, $\pi$. One of the central difficulties in IRL is that multiple preferences may lead to the same observed behaviour. That is, $R$ is typically underdetermined by $\pi$, which means that $R$ is only partially identifiable. Recent work has characterised the extent of this partial identifiability for different types of agents, including optimal and Boltzmann-rational agents. However, work so far has only considered agents that discount future reward exponentially: this is a serious limitation, especially given that extensive work in the behavioural sciences suggests that humans are better modelled as discounting hyperbolically. In this work, we newly characterise partial identifiability in IRL for agents with non-exponential discounting: our results are in particular relevant for hyperbolical discounting, but they also more generally apply to agents that use other types of (non-exponential) discounting. We significantly show that generally IRL is unable to infer enough information about $R$ to identify the correct optimal policy, which entails that IRL alone can be insufficient to adequately characterise the preferences of such agents.
Abstract（参考訳）: 逆強化学習(IRL)の目的は、エージェントの振る舞いを観察することから、エージェントの好みを推測することである。通常、好みは報酬関数、$R$としてモデル化され、振る舞いはポリシー、$\pi$としてモデル化される。 IRLの主な課題の1つは、複数の選好が同じ観察行動を引き起こす可能性があることである。つまり、$R$は通常$\pi$によって過小評価される。最近の研究は、最適およびボルツマン有理化剤を含む様々な種類のエージェントに対するこの部分的識別可能性の範囲を特徴づけている。これは特に行動科学における広範な研究が、人間は双曲的に割引するものとしてモデル化されていることを示唆していることを考えると、深刻な制限である。本研究では,非排他的割引を行うエージェントに対するIRLの部分的識別可能性について,特に双曲的割引に関係があるが,他の種類の非排他的割引を使用するエージェントに対しても適用が一般的である。一般にIRLは適切なポリシーを特定するのに$R$に関する十分な情報を推測することができず、IRLだけではそのようなエージェントの嗜好を適切に特徴づけるには不十分であることを示す。

論文の概要: Partial Identifiability in Inverse Reinforcement Learning For Agents With Non-Exponential Discounting

関連論文リスト