Affective Expression Analysis in-the-wild using Multi-Task Temporal
Statistical Deep Learning Model
- URL: http://arxiv.org/abs/2002.09120v3
- Date: Thu, 5 Mar 2020 08:23:46 GMT
- Title: Affective Expression Analysis in-the-wild using Multi-Task Temporal
Statistical Deep Learning Model
- Authors: Nhu-Tai Do, Tram-Tran Nguyen-Quynh and Soo-Hyung Kim
- Abstract summary: We present an affective expression analysis model that deals with the above challenges.
We experimented on Aff-Wild2 dataset, a large-scale dataset for ABAW Challenge.
- Score: 6.024865915538501
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Affective behavior analysis plays an important role in human-computer
interaction, customer marketing, health monitoring. ABAW Challenge and
Aff-Wild2 dataset raise the new challenge for classifying basic emotions and
regression valence-arousal value under in-the-wild environments. In this paper,
we present an affective expression analysis model that deals with the above
challenges. Our approach includes STAT and Temporal Module for fine-tuning
again face feature model. We experimented on Aff-Wild2 dataset, a large-scale
dataset for ABAW Challenge with the annotations for both the categorical and
valence-arousal emotion. We achieved the expression score 0.543 and
valence-arousal score 0.534 on the validation set.
Related papers
- LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content [62.816876067499415]
We propose LiveXiv: a scalable evolving live benchmark based on scientific ArXiv papers.
LiveXiv accesses domain-specific manuscripts at any given timestamp and proposes to automatically generate visual question-answer pairs.
We benchmark multiple open and proprietary Large Multi-modal Models (LMMs) on the first version of our benchmark, showing its challenging nature and exposing the models true abilities.
arXiv Detail & Related papers (2024-10-14T17:51:23Z) - Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction [54.23208041792073]
Aspect Sentiment Quad Prediction (ASQP) aims to predict all quads (aspect term, aspect category, opinion term, sentiment polarity) for a given review.
A key challenge in the ASQP task is the scarcity of labeled data, which limits the performance of existing methods.
We propose a self-training framework with a pseudo-label scorer, wherein a scorer assesses the match between reviews and their pseudo-labels.
arXiv Detail & Related papers (2024-06-26T05:30:21Z) - VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models [57.43276586087863]
Large Vision-Language Models (LVLMs) suffer from hallucination issues, wherein the models generate plausible-sounding but factually incorrect outputs.
Existing benchmarks are often limited in scope, focusing mainly on object hallucinations.
We introduce a multi-dimensional benchmark covering objects, attributes, and relations, with challenging images selected based on associative biases.
arXiv Detail & Related papers (2024-04-22T04:49:22Z) - DPP-Based Adversarial Prompt Searching for Lanugage Models [56.73828162194457]
Auto-regressive Selective Replacement Ascent (ASRA) is a discrete optimization algorithm that selects prompts based on both quality and similarity with determinantal point process (DPP)
Experimental results on six different pre-trained language models demonstrate the efficacy of ASRA for eliciting toxic content.
arXiv Detail & Related papers (2024-03-01T05:28:06Z) - Decoding Susceptibility: Modeling Misbelief to Misinformation Through a Computational Approach [61.04606493712002]
Susceptibility to misinformation describes the degree of belief in unverifiable claims that is not observable.
Existing susceptibility studies heavily rely on self-reported beliefs.
We propose a computational approach to model users' latent susceptibility levels.
arXiv Detail & Related papers (2023-11-16T07:22:56Z) - Spatial-temporal Transformer for Affective Behavior Analysis [11.10521339384583]
We propose a Transformer with Multi-Head Attention framework to learn the distribution of both the spatial and temporal features.
The results fully demonstrate the effectiveness of our proposed model based on the Aff-Wild2 dataset.
arXiv Detail & Related papers (2023-03-19T04:34:17Z) - Multimodal Feature Extraction and Fusion for Emotional Reaction
Intensity Estimation and Expression Classification in Videos with
Transformers [47.16005553291036]
We present our solutions to the two sub-challenges of Affective Behavior Analysis in the wild (ABAW) 2023.
For the Expression Classification Challenge, we propose a streamlined approach that handles the challenges of classification effectively.
By studying, analyzing, and combining these features, we significantly enhance the model's accuracy for sentiment prediction in a multimodal context.
arXiv Detail & Related papers (2023-03-16T09:03:17Z) - Facial Affective Behavior Analysis Method for 5th ABAW Competition [20.54725479855494]
5th ABAW competition includes three challenges from Aff-Wild2 database.
We construct three different models to solve the corresponding problems to improve the results.
For the experiments of three challenges, we train the models on the provided training data and validate the models on the validation data.
arXiv Detail & Related papers (2023-03-16T08:21:10Z) - Technical Report for Valence-Arousal Estimation in ABAW2 Challenge [20.90072006477564]
We tackle the valence-arousal estimation challenge from ABAW2 ICCV-2021 Competition.
The competition organizers provide an in-the-wild Aff-Wild2 dataset for participants to analyze affective behavior in real-life settings.
We use a two stream model to learn emotion features from appearance and action respectively.
arXiv Detail & Related papers (2021-07-08T15:21:38Z) - Technical Report for Valence-Arousal Estimation on Affwild2 Dataset [0.0]
We tackle the valence-arousal estimation challenge from ABAW FG-2020 Competition.
We use MIMAMO Net citedeng 2020mimamo model to achieve information about micro-motion and macro-motion.
arXiv Detail & Related papers (2021-05-04T14:00:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.