Fugu-MT 論文翻訳(概要): Evaluation of Deontic Conditional Reasoning in Large Language Models: The Case of Wason's Selection Task

論文の概要: Evaluation of Deontic Conditional Reasoning in Large Language Models: The Case of Wason's Selection Task

arxiv url: http://arxiv.org/abs/2603.06416v1
Date: Fri, 06 Mar 2026 15:55:13 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-09 13:17:46.181907
Title: Evaluation of Deontic Conditional Reasoning in Large Language Models: The Case of Wason's Selection Task
Title（参考訳）: 大規模言語モデルにおけるDeontic Conditional Reasoningの評価:Wasonの選択課題を事例として
Authors: Hirohiko Abe, Kentaro Ozeki, Risako Ando, Takanobu Morishita, Koji Mineshima, Mitsuhiro Okada,
Abstract要約: 本研究では,大言語モデルの条件推論の領域特異性について,デオン規則の下で検討する。結果は、人間のように、LLMはデオン的なルールでより良い理由を示し、マッチングバイアスのようなエラーを表示する。
参考スコア（独自算出の注目度）: 5.120890045747202
License: http://creativecommons.org/licenses/by/4.0/
Abstract: As large language models (LLMs) advance in linguistic competence, their reasoning abilities are gaining increasing attention. In humans, reasoning often performs well in domain specific settings, particularly in normative rather than purely formal contexts. Although prior studies have compared LLM and human reasoning, the domain specificity of LLM reasoning remains underexplored. In this study, we introduce a new Wason Selection Task dataset that explicitly encodes deontic modality to systematically distinguish deontic from descriptive conditionals, and use it to examine LLMs' conditional reasoning under deontic rules. We further analyze whether observed error patterns are better explained by confirmation bias (a tendency to seek rule-supporting evidence) or by matching bias (a tendency to ignore negation and select items that lexically match elements of the rule). Results show that, like humans, LLMs reason better with deontic rules and display matching-bias-like errors. Together, these findings suggest that the performance of LLMs varies systematically across rule types and that their error patterns can parallel well-known human biases in this paradigm.
Abstract（参考訳）: 大きな言語モデル(LLM)が言語能力に進歩するにつれて、その推論能力はますます注目を集めている。人間では、推論はドメイン固有の設定、特に純粋に形式的な文脈ではなく規範的によく機能する。以前の研究ではLSMとヒトの推論を比較していたが、LSM推論の領域特異性は未解明のままである。本研究では,記述的条件からデオン的条件を体系的に識別するために,デオン的モダリティを明示的に符号化した新しいWason Selection Taskデータセットを導入し,デオン的規則の下でのLLMの条件推論の検証に利用した。さらに,確認バイアス(規則を支持する証拠を求める傾向)や一致バイアス(否定を無視し,規則の要素と語彙的に一致する項目を選択する傾向)により,観察された誤りパターンがよりよく説明されるかを分析する。結果は、人間と同様に、LLMはデオン的なルールでより良い理由を示し、マッチングバイアスのようなエラーを表示する。これらの結果から, LLMの性能は規則の種類によって様々に変化し, それらの誤りパターンは, このパラダイムでよく知られた人間のバイアスを並列に受けられることが示唆された。

論文の概要: Evaluation of Deontic Conditional Reasoning in Large Language Models: The Case of Wason's Selection Task

関連論文リスト