Fugu-MT 論文翻訳(概要): Persona-Conditioned Risk Behavior in Large Language Models: A Simulated Gambling Study with GPT-4.1

論文の概要: Persona-Conditioned Risk Behavior in Large Language Models: A Simulated Gambling Study with GPT-4.1

arxiv url: http://arxiv.org/abs/2603.15831v1
Date: Mon, 16 Mar 2026 19:03:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-18 17:42:06.956036
Title: Persona-Conditioned Risk Behavior in Large Language Models: A Simulated Gambling Study with GPT-4.1
Title（参考訳）: 大規模言語モデルにおけるペルソナ契約型リスク行動: GPT-4.1を用いたシミュレーションギャンブル研究
Authors: Sankalp Dubedy,
Abstract要約: 本稿では,GPT4.1が3つの社会経済的ペルソナの1つに割り当てられた制御実験について述べる。このモデルは、カーネマンとトヴェルスキーのプロスペクト理論によって予測される重要な行動シグネチャを再現する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) are increasingly deployed as autonomous agents in uncertain, sequential decision-making contexts. Yet it remains poorly understood whether the behaviors they exhibit in such environments reflect principled cognitive patterns or simply surface-level prompt mimicry. This paper presents a controlled experiment in which GPT-4.1 was assigned one of three socioeconomic personas (Rich, Middle-income, and Poor) and placed in a structured slot-machine environment with three distinct machine configurations: Fair (50%), Biased Low (35%), and Streak (dynamic probability increasing after consecutive losses). Across 50 independent iterations per condition and 6,950 recorded decisions, we find that the model reproduces key behavioral signatures predicted by Kahneman and Tversky's Prospect Theory without being instructed to do so. The Poor persona played a mean of 37.4 rounds per session (SD=15.5) compared to 1.1 rounds for the Rich persona (SD=0.31), a difference that is highly significant (Kruskal-Wallis H=393.5, p<2.2e-16). Risk scores by persona show large effect sizes (Cohen's d=4.15 for Poor vs Rich). Emotional labels appear to function as post-hoc annotations rather than decision drivers (chi-square=3205.4, Cramer's V=0.39), and belief-updating across rounds is negligible (Spearman rho=0.032 for Poor persona, p=0.016). These findings carry implications for LLM agent design, interpretability research, and the broader question of whether classical cognitive economic biases are implicitly encoded in large-scale pretrained language models.
Abstract（参考訳）: 大規模言語モデル(LLM)は、不確実でシーケンシャルな意思決定コンテキストにおいて、自律的なエージェントとしてますます多くデプロイされている。しかし、そのような環境で提示される行動が、原則化された認知パターンを反映しているか、単に表面レベルのプロンプト模倣を反映しているのかは、まだ理解されていない。本稿では,GPT-4.1を3つの社会経済的人格(Rich, Middle-income, Poor)のうちの1つに割り当て,Fair(50%), Biased Low(35%), Streak(連続的損失後の動的確率増加)の3つの異なる構成のスロットマシン環境に配置した。条件毎の50個の独立反復と6,950個の決定を記録した結果、このモデルはケーネマンとトヴェルスキーのプロスペクト理論によって予測される重要な行動的シグネチャを、そのように指示されることなく再現することがわかった。ポーア・ペルソナは1セッション当たり37.4ラウンド(SD=15.5)、リッチ・ペルソナ(SD=0.31)は1.1ラウンド(Kruskal-Wallis H=393.5, p<2.2e-16)の差が大きい。ペルソナによるリスクスコアは、大きな効果の大きさを示す(Cohen's d=4.15 for Poor vs Rich)。感情ラベルは意思決定者(chi-square=3205.4, Cramer's V=0.39)よりもポストホックアノテーションとして機能し、ラウンドを横断する信念更新は無視可能である(Spearman rho=0.032 for Poor persona, p=0.016)。これらの知見は、LLMエージェント設計、解釈可能性研究、そして、古典的認知経済バイアスが大規模事前訓練言語モデルで暗黙的に符号化されているかどうかというより広範な問題に影響を及ぼす。

論文の概要: Persona-Conditioned Risk Behavior in Large Language Models: A Simulated Gambling Study with GPT-4.1

関連論文リスト