GPT-4 Emulates Average-Human Emotional Cognition from a Third-Person   Perspective
        - URL: http://arxiv.org/abs/2408.13718v1
- Date: Sun, 11 Aug 2024 01:22:09 GMT
- Title: GPT-4 Emulates Average-Human Emotional Cognition from a Third-Person   Perspective
- Authors: Ala N. Tak, Jonathan Gratch, 
- Abstract summary: We first look at carefully crafted emotion-evoking stimuli, originally designed to find patterns of brain neural activity.
We show that GPT-4 is especially accurate in reasoning about such stimuli.
We find that GPT-4's interpretations align more closely with human judgments about the emotions of others than with self-assessments.
- Score: 1.642094639107215
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   This paper extends recent investigations on the emotional reasoning abilities of Large Language Models (LLMs). Current research on LLMs has not directly evaluated the distinction between how LLMs predict the self-attribution of emotions and the perception of others' emotions. We first look at carefully crafted emotion-evoking stimuli, originally designed to find patterns of brain neural activity representing fine-grained inferred emotional attributions of others. We show that GPT-4 is especially accurate in reasoning about such stimuli. This suggests LLMs agree with humans' attributions of others' emotions in stereotypical scenarios remarkably more than self-attributions of emotions in idiosyncratic situations. To further explore this, our second study utilizes a dataset containing annotations from both the author and a third-person perspective. We find that GPT-4's interpretations align more closely with human judgments about the emotions of others than with self-assessments. Notably, conventional computational models of emotion primarily rely on self-reported ground truth as the gold standard. However, an average observer's standpoint, which LLMs appear to have adopted, might be more relevant for many downstream applications, at least in the absence of individual information and adequate safety considerations. 
 
      
        Related papers
        - Beyond Context to Cognitive Appraisal: Emotion Reasoning as a Theory of   Mind Benchmark for Large Language Models [11.255011967393838]
 This study advances beyond surface-level perceptual features to investigate how large language models (LLMs) reason about others' emotional states using contextual information.<n>Grounded in Cognitive Appraisal Theory, we curate a specialized ToM evaluation dataset1 to assess both forward reasoning - from context to emotion- and backward reasoning - from emotion to inferred context.
 arXiv  Detail & Related papers  (2025-05-31T01:18:04Z)
- EmotionHallucer: Evaluating Emotion Hallucinations in Multimodal Large   Language Models [17.710835703681873]
 We introduce EmotionHallucer, the first benchmark for detecting and analyzing emotion hallucinations in MLLMs.<n>Building on this, we assess emotion hallucinations from two dimensions: emotion psychology knowledge and real-world multimodal perception.<n>We propose the PEP-MEK framework, which yields an average improvement of 9.90% in emotion hallucination detection across selected models.
 arXiv  Detail & Related papers  (2025-05-16T16:14:08Z)
- Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in   Large Language Models [75.85319609088354]
 Sentient Agent as a Judge (SAGE) is an evaluation framework for large language models.<n>SAGE instantiates a Sentient Agent that simulates human-like emotional changes and inner thoughts during interaction.<n>SAGE provides a principled, scalable and interpretable tool for tracking progress toward genuinely empathetic and socially adept language agents.
 arXiv  Detail & Related papers  (2025-05-01T19:06:10Z)
- AI with Emotions: Exploring Emotional Expressions in Large Language   Models [0.0]
 Large Language Models (LLMs) play role-play as agents answering questions with specified emotional states.
Russell's Circumplex model characterizes emotions along the sleepy-activated (arousal) and pleasure-displeasure (valence) axes.
 evaluation showed that the emotional states of the generated answers were consistent with the specifications.
 arXiv  Detail & Related papers  (2025-04-20T18:49:25Z)
- Rethinking Emotion Annotations in the Era of Large Language Models [8.701939656132973]
 We analyze the complexities of emotion annotation in the context of Large Language Models (LLMs)
In our experiments, GPT-4 achieves high ratings in a human evaluation study, painting a more positive picture than previous work.
To harness GPT-4's strength while preserving human perspective, we explore two ways of integrating GPT-4 into emotion annotation pipelines.
 arXiv  Detail & Related papers  (2024-12-10T20:30:51Z)
- AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language   Models [18.482881562645264]
 This study is the first to explore the potential of Large Language Models (LLMs) in recognizing ambiguous emotions.
We design zero-shot and few-shot prompting and incorporate past dialogue as context information for ambiguous emotion recognition.
 arXiv  Detail & Related papers  (2024-09-26T23:25:21Z)
- Think out Loud: Emotion Deducing Explanation in Dialogues [57.90554323226896]
 We propose a new task "Emotion Deducing Explanation in Dialogues" (EDEN)
EDEN recognizes emotion and causes in an explicitly thinking way.
It can help Large Language Models (LLMs) achieve better recognition of emotions and causes.
 arXiv  Detail & Related papers  (2024-06-07T08:58:29Z)
- Enhancing Emotional Generation Capability of Large Language Models via   Emotional Chain-of-Thought [50.13429055093534]
 Large Language Models (LLMs) have shown remarkable performance in various emotion recognition tasks.
We propose the Emotional Chain-of-Thought (ECoT) to enhance the performance of LLMs on various emotional generation tasks.
 arXiv  Detail & Related papers  (2024-01-12T16:42:10Z)
- Language Models (Mostly) Do Not Consider Emotion Triggers When   Predicting Emotion [87.18073195745914]
 We investigate how well human-annotated emotion triggers correlate with features deemed salient in their prediction of emotions.
Using EmoTrigger, we evaluate the ability of large language models to identify emotion triggers.
Our analysis reveals that emotion triggers are largely not considered salient features for emotion prediction models, instead there is intricate interplay between various features and the task of emotion detection.
 arXiv  Detail & Related papers  (2023-11-16T06:20:13Z)
- What's Next in Affective Modeling? Large Language Models [3.0902630634005797]
 GPT-4 performs well across multiple emotion tasks.
It can distinguish emotion theories and come up with emotional stories.
We suggest that LLMs could play an important role in affective modeling.
 arXiv  Detail & Related papers  (2023-10-03T16:39:20Z)
- Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using   EmotionBench [83.41621219298489]
 We evaluate Large Language Models' (LLMs) anthropomorphic capabilities using the emotion appraisal theory from psychology.
We collect a dataset containing over 400 situations that have proven effective in eliciting the eight emotions central to our study.
We conduct a human evaluation involving more than 1,200 subjects worldwide.
 arXiv  Detail & Related papers  (2023-08-07T15:18:30Z)
- Emotional Intelligence of Large Language Models [9.834823298632374]
 Large Language Models (LLMs) have demonstrated remarkable abilities across numerous disciplines.
However, their alignment with human emotions and values, which is critical for real-world applications, has not been systematically evaluated.
Here, we assessed LLMs' Emotional Intelligence (EI), encompassing emotion recognition, interpretation, and understanding.
 arXiv  Detail & Related papers  (2023-07-18T07:49:38Z)
- Large Language Models Understand and Can be Enhanced by Emotional
  Stimuli [53.53886609012119]
 We take the first step towards exploring the ability of Large Language Models to understand emotional stimuli.
Our experiments show that LLMs have a grasp of emotional intelligence, and their performance can be improved with emotional prompts.
Our human study results demonstrate that EmotionPrompt significantly boosts the performance of generative tasks.
 arXiv  Detail & Related papers  (2023-07-14T00:57:12Z)
- Human-Like Intuitive Behavior and Reasoning Biases Emerged in Language
  Models -- and Disappeared in GPT-4 [0.0]
 We show that large language models (LLMs) exhibit behavior that resembles human-like intuition.
We also probe how sturdy the inclination for intuitive-like decision-making is.
 arXiv  Detail & Related papers  (2023-06-13T08:43:13Z)
- A Circular-Structured Representation for Visual Emotion Distribution
  Learning [82.89776298753661]
 We propose a well-grounded circular-structured representation to utilize the prior knowledge for visual emotion distribution learning.
To be specific, we first construct an Emotion Circle to unify any emotional state within it.
On the proposed Emotion Circle, each emotion distribution is represented with an emotion vector, which is defined with three attributes.
 arXiv  Detail & Related papers  (2021-06-23T14:53:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.