Fugu-MT 論文翻訳(概要): Do Machines Think Emotionally? Cognitive Appraisal Analysis of Large Language Models

論文の概要: Do Machines Think Emotionally? Cognitive Appraisal Analysis of Large Language Models

arxiv url: http://arxiv.org/abs/2508.05880v1
Date: Thu, 07 Aug 2025 22:19:15 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-11 20:39:06.022414
Title: Do Machines Think Emotionally? Cognitive Appraisal Analysis of Large Language Models
Title（参考訳）: 機械は感情的に考えるか? 大規模言語モデルの認知的評価分析
Authors: Sree Bhattacharyya, Lucas Craig, Tharun Dilliraj, Jia Li, James Z. Wang,
Abstract要約: 我々は,大規模言語モデルで暗黙的に使用される内的認知構造を評価するために,感情に対する認知推論に関する大規模ベンチマークを導入する。結果と分析結果から,LLMの多種多様な推論パターンが明らかとなった。
参考スコア（独自算出の注目度）: 13.341709038654198
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Affective Computing has been established as a crucial field of inquiry to advance the holistic development of Artificial Intelligence (AI) systems. Foundation models -- especially Large Language Models (LLMs) -- have been evaluated, trained, or instruction-tuned in several past works, to become better predictors or generators of emotion. Most of these studies, however, approach emotion-related tasks in a supervised manner, assessing or training the capabilities of LLMs using discrete emotion labels associated with stimuli (e.g., text, images, video, audio). Evaluation studies, in particular, have often been limited to standard and superficial emotion-related tasks, such as the recognition of evoked or expressed emotions. In this paper, we move beyond surface-level emotion tasks to investigate how LLMs reason about emotions through cognitive dimensions. Drawing from cognitive appraisal theory, we examine whether LLMs produce coherent and plausible cognitive reasoning when reasoning about emotionally charged stimuli. We introduce a large-scale benchmark on Cognitive Reasoning for Emotions - CoRE - to evaluate internal cognitive structures implicitly used by LLMs for emotional reasoning. Through a plethora of evaluation experiments and analysis, we seek to answer: (a) Are models more likely to implicitly rely on specific cognitive appraisal dimensions?, (b) What cognitive dimensions are important for characterizing specific emotions?, and, (c) Can the internal representations of different emotion categories in LLMs be interpreted through cognitive appraisal dimensions? Our results and analyses reveal diverse reasoning patterns across different LLMs. Our benchmark and code will be made publicly available.
Abstract（参考訳）: Affective Computingは、人工知能(AI)システムの総合的な開発を進めるための重要な調査分野として確立されている。基礎モデル -- 特にLarge Language Models (LLMs) -- は、より優れた予測子や感情のジェネレータとなるために、過去の作業で評価、訓練、あるいは指導された。しかしながら、これらの研究の多くは、刺激(例えば、テキスト、画像、ビデオ、音声)に関連付けられた個別の感情ラベルを用いて、LLMの能力を評価し、訓練することで、教師付き方法で感情関連タスクにアプローチする。特に評価研究は、しばしば、誘発された感情や表現された感情の認識のような、標準的な、表面的な感情に関連するタスクに限られている。本稿では,LLMが認知的次元を通じて感情をどう考えるかを検討するために,表面レベルの感情タスクを超えて行動する。認知評価理論から,LLMが感情に荷担した刺激を推論する際に,コヒーレントで妥当な認知的推論をもたらすかどうかを検討する。本研究では,情緒的推論にLLMが暗黙的に用いている内的認知構造を評価するために,感情に対する認知的推論(CoRE)に関する大規模ベンチマークを導入する。さまざまな評価実験と分析を通じて、私たちは答えを求めます。 (a)モデルは、特定の認知的評価次元に暗黙的に依存する傾向がありますか? ? b)特定の感情を特徴づけるのに、認知的次元が重要であるか。と (c)LLMにおける異なる感情カテゴリーの内部表現は認知的評価次元を通して解釈できるか? 結果と分析結果から,LLMの多種多様な推論パターンが明らかとなった。ベンチマークとコードは公開されます。

論文の概要: Do Machines Think Emotionally? Cognitive Appraisal Analysis of Large Language Models

関連論文リスト