Hybrid Curriculum Learning for Emotion Recognition in Conversation
- URL: http://arxiv.org/abs/2112.11718v1
- Date: Wed, 22 Dec 2021 08:02:58 GMT
- Title: Hybrid Curriculum Learning for Emotion Recognition in Conversation
- Authors: Lin Yang, Yi Shen, Yue Mao, Longjun Cai
- Abstract summary: Our framework consists of two curricula: (1) conversation-level curriculum (CC); and (2) utterance-level curriculum (UC)
With the proposed model-agnostic hybrid curriculum learning strategy, we observe significant performance boosts over a wide range of existing ERC models.
- Score: 10.912215835115063
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Emotion recognition in conversation (ERC) aims to detect the emotion label
for each utterance. Motivated by recent studies which have proven that feeding
training examples in a meaningful order rather than considering them randomly
can boost the performance of models, we propose an ERC-oriented hybrid
curriculum learning framework. Our framework consists of two curricula: (1)
conversation-level curriculum (CC); and (2) utterance-level curriculum (UC). In
CC, we construct a difficulty measurer based on "emotion shift" frequency
within a conversation, then the conversations are scheduled in an "easy to
hard" schema according to the difficulty score returned by the difficulty
measurer. For UC, it is implemented from an emotion-similarity perspective,
which progressively strengthens the model's ability in identifying the
confusing emotions. With the proposed model-agnostic hybrid curriculum learning
strategy, we observe significant performance boosts over a wide range of
existing ERC models and we are able to achieve new state-of-the-art results on
four public ERC datasets.
Related papers
- Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer [78.35816158511523]
We present a single-stage emotion recognition approach, employing a Decoupled Subject-Context Transformer (DSCT) for simultaneous subject localization and emotion classification.
We evaluate our single-stage framework on two widely used context-aware emotion recognition datasets, CAER-S and EMOTIC.
arXiv Detail & Related papers (2024-04-26T07:30:32Z) - Context-Aware Siamese Networks for Efficient Emotion Recognition in Conversation [1.1095648823126325]
We propose a way to model the conversational context that we incorporate into a metric learning training strategy.
Using metric learning through a Siamese Network architecture, we achieve 57.71 in macro F1 score for emotion classification in conversation.
arXiv Detail & Related papers (2024-04-17T07:36:40Z) - Watch the Speakers: A Hybrid Continuous Attribution Network for Emotion
Recognition in Conversation With Emotion Disentanglement [8.17164107060944]
Emotion Recognition in Conversation (ERC) has attracted widespread attention in the natural language processing field.
Existing ERC methods face challenges in achieving generalization to diverse scenarios due to insufficient modeling of context.
We present a Hybrid Continuous Attributive Network (HCAN) to address these issues in the perspective of emotional continuation and emotional attribution.
arXiv Detail & Related papers (2023-09-18T14:18:16Z) - Facilitating Multi-turn Emotional Support Conversation with Positive
Emotion Elicitation: A Reinforcement Learning Approach [58.88422314998018]
Emotional support conversation (ESC) aims to provide emotional support (ES) to improve one's mental state.
Existing works stay at fitting grounded responses and responding strategies which ignore the effect on ES and lack explicit goals to guide emotional positive transition.
We introduce a new paradigm to formalize multi-turn ESC as a process of positive emotion elicitation.
arXiv Detail & Related papers (2023-07-16T09:58:44Z) - EmotionIC: emotional inertia and contagion-driven dependency modeling for emotion recognition in conversation [34.24557248359872]
We propose an emotional inertia and contagion-driven dependency modeling approach (EmotionIC) for ERC task.
Our EmotionIC consists of three main components, i.e., Identity Masked Multi-Head Attention (IMMHA), Dialogue-based Gated Recurrent Unit (DiaGRU) and Skip-chain Conditional Random Field (SkipCRF)
Experimental results show that our method can significantly outperform the state-of-the-art models on four benchmark datasets.
arXiv Detail & Related papers (2023-03-20T13:58:35Z) - A Hierarchical Regression Chain Framework for Affective Vocal Burst
Recognition [72.36055502078193]
We propose a hierarchical framework, based on chain regression models, for affective recognition from vocal bursts.
To address the challenge of data sparsity, we also use self-supervised learning (SSL) representations with layer-wise and temporal aggregation modules.
The proposed systems participated in the ACII Affective Vocal Burst (A-VB) Challenge 2022 and ranked first in the "TWO'' and "CULTURE" tasks.
arXiv Detail & Related papers (2023-03-14T16:08:45Z) - Supervised Prototypical Contrastive Learning for Emotion Recognition in
Conversation [25.108385802645163]
We propose a Supervised Prototypical Contrastive Learning (SPCL) loss for the emotion recognition task.
We design a difficulty measure function based on the distance between classes and introduce curriculum learning to alleviate the impact of extreme samples.
We achieve state-of-the-art results on three widely used benchmarks.
arXiv Detail & Related papers (2022-10-17T03:08:23Z) - MEmoBERT: Pre-training Model with Prompt-based Learning for Multimodal
Emotion Recognition [118.73025093045652]
We propose a pre-training model textbfMEmoBERT for multimodal emotion recognition.
Unlike the conventional "pre-train, finetune" paradigm, we propose a prompt-based method that reformulates the downstream emotion classification task as a masked text prediction.
Our proposed MEmoBERT significantly enhances emotion recognition performance.
arXiv Detail & Related papers (2021-10-27T09:57:00Z) - Reinforcement Learning for Emotional Text-to-Speech Synthesis with
Improved Emotion Discriminability [82.39099867188547]
Emotional text-to-speech synthesis (ETTS) has seen much progress in recent years.
We propose a new interactive training paradigm for ETTS, denoted as i-ETTS.
We formulate an iterative training strategy with reinforcement learning to ensure the quality of i-ETTS optimization.
arXiv Detail & Related papers (2021-04-03T13:52:47Z) - Dialogue Response Selection with Hierarchical Curriculum Learning [52.3318584971562]
We study the learning of a matching model for dialogue response selection.
Motivated by the recent finding that random negatives are often too trivial to train a reliable model, we propose a hierarchical curriculum learning framework.
arXiv Detail & Related papers (2020-12-29T14:06:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.