Fugu-MT 論文翻訳(概要): Emotion Recognition from the perspective of Activity Recognition

論文の概要: Emotion Recognition from the perspective of Activity Recognition

arxiv url: http://arxiv.org/abs/2403.16263v1
Date: Sun, 24 Mar 2024 18:53:57 GMT
ステータス: 翻訳完了
システム内更新日: 2024-03-26 16:46:40.340390
Title: Emotion Recognition from the perspective of Activity Recognition
Title（参考訳）: 活動認識の観点からの感情認識
Authors: Savinay Nagendra, Prapti Panigrahi,
Abstract要約: 人間の感情状態、行動、反応を現実世界の環境に適応させることは、潜伏した連続した次元を用いて達成できる。感情認識システムが現実のモバイルおよびコンピューティングデバイスにデプロイされ統合されるためには、世界中の収集されたデータを考慮する必要がある。本稿では,注目機構を備えた新しい3ストリームエンドツーエンドのディープラーニング回帰パイプラインを提案する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Applications of an efficient emotion recognition system can be found in several domains such as medicine, driver fatigue surveillance, social robotics, and human-computer interaction. Appraising human emotional states, behaviors, and reactions displayed in real-world settings can be accomplished using latent continuous dimensions. Continuous dimensional models of human affect, such as those based on valence and arousal are more accurate in describing a broad range of spontaneous everyday emotions than more traditional models of discrete stereotypical emotion categories (e.g. happiness, surprise). Most of the prior work on estimating valence and arousal considers laboratory settings and acted data. But, for emotion recognition systems to be deployed and integrated into real-world mobile and computing devices, we need to consider data collected in the world. Action recognition is a domain of Computer Vision that involves capturing complementary information on appearance from still frames and motion between frames. In this paper, we treat emotion recognition from the perspective of action recognition by exploring the application of deep learning architectures specifically designed for action recognition, for continuous affect recognition. We propose a novel three-stream end-to-end deep learning regression pipeline with an attention mechanism, which is an ensemble design based on sub-modules of multiple state-of-the-art action recognition systems. The pipeline constitutes a novel data pre-processing approach with a spatial self-attention mechanism to extract keyframes. The optical flow of high-attention regions of the face is extracted to capture temporal context. AFEW-VA in-the-wild dataset has been used to conduct comparative experiments. Quantitative analysis shows that the proposed model outperforms multiple standard baselines of both emotion recognition and action recognition models.
Abstract（参考訳）: 効率的な感情認識システムの応用は、医療、運転者の疲労監視、社会ロボティクス、人間とコンピュータの相互作用など、いくつかの領域で見られる。人間の感情状態、行動、反応を現実世界の環境に適応させることは、潜伏した連続した次元を用いて達成できる。原子価や覚醒に基づく人間の感情の連続的な次元モデルは、離散的なステレオタイプ感情カテゴリー(例えば、幸福、驚き)の伝統的なモデルよりも、広範囲の自然の感情を記述する上でより正確である。精度と覚醒を推定する以前の研究のほとんどは、実験室の設定を考慮し、データを処理した。しかし、感情認識システムが現実世界のモバイルおよびコンピューティングデバイスにデプロイされ、統合されるためには、世界中に収集されたデータを考慮する必要がある。アクション認識はコンピュータビジョンの領域であり、静止フレームからの外観とフレーム間の動きの相補的な情報をキャプチャする。本稿では,行動認識に特化して設計された深層学習アーキテクチャを探索し,行動認識の観点から感情認識を扱う。本稿では,複数の動作認識システムのサブモジュールをベースとしたアンサンブル設計であるアテンション機構を備えた,新しい3ストリームエンドツーエンドのディープラーニング回帰パイプラインを提案する。パイプラインは、キーフレームを抽出する空間的自己アテンション機構を備えた、新しいデータ前処理アプローチを構成する。顔の高アテンション領域の光学的流れを抽出し、時間的文脈を捉える。 AFEW-VA in-the-wildデータセットは比較実験に使われている。定量的分析により,提案モデルが感情認識モデルと行動認識モデルの両方の標準ベースラインより優れていることが示された。

関連論文リスト

Multi-modal Mood Reader: Pre-trained Model Empowers Cross-Subject Emotion Recognition [23.505616142198487]
我々は、クロスオブジェクト感情認識のための訓練済みモデルに基づくMultimodal Mood Readerを開発した。このモデルは、大規模データセットの事前学習を通じて、脳波信号の普遍的な潜在表現を学習する。公開データセットに関する大規模な実験は、クロスオブジェクト感情認識タスクにおけるMood Readerの優れたパフォーマンスを示している。
論文参考訳（メタデータ） (2024-05-28T14:31:11Z)
Multimodal Emotion Recognition using Transfer Learning from Speaker Recognition and BERT-based models [53.31917090073727]
本稿では,音声とテキストのモダリティから,伝達学習モデルと微調整モデルとを融合したニューラルネットワークによる感情認識フレームワークを提案する。本稿では,対話型感情的モーションキャプチャー・データセットにおけるマルチモーダル・アプローチの有効性を評価する。
論文参考訳（メタデータ） (2022-02-16T00:23:42Z)
Affect Analysis in-the-wild: Valence-Arousal, Expressions, Action Units and a Unified Framework [83.21732533130846]
Aff-Wild と Aff-Wild2 の2つである。これは、これらのデータベースで訓練された深層ニューラルネットワークの2つのクラスの設計を示す。インパクト認識を共同で学び、効果的に一般化し、実行することができる新しいマルチタスクおよび全体主義のフレームワークが提示されます。
論文参考訳（メタデータ） (2021-03-29T17:36:20Z)
Cognitive architecture aided by working-memory for self-supervised multi-modal humans recognition [54.749127627191655]
人間パートナーを認識する能力は、パーソナライズされた長期的な人間とロボットの相互作用を構築するための重要な社会的スキルです。ディープラーニングネットワークは最先端の結果を達成し,そのような課題に対処するための適切なツールであることが実証された。 1つの解決策は、ロボットに自己スーパービジョンで直接の感覚データから学習させることである。
論文参考訳（メタデータ） (2021-03-16T13:50:24Z)
Continuous Emotion Recognition with Spatiotemporal Convolutional Neural Networks [82.54695985117783]
In-theld でキャプチャした長いビデオシーケンスを用いて,持続的な感情認識のための最先端のディープラーニングアーキテクチャの適合性を検討する。我々は,2D-CNNと長期記憶ユニットを組み合わせた畳み込みリカレントニューラルネットワークと,2D-CNNモデルの微調整時の重みを膨らませて構築した膨らませた3D-CNNモデルを開発した。
論文参考訳（メタデータ） (2020-11-18T13:42:05Z)
Semantics-aware Adaptive Knowledge Distillation for Sensor-to-Vision Action Recognition [131.6328804788164]
本稿では,視覚・センサ・モダリティ(動画)における行動認識を強化するためのフレームワーク,Semantics-Aware Adaptive Knowledge Distillation Networks (SAKDN)を提案する。 SAKDNは複数のウェアラブルセンサーを教師のモダリティとして使用し、RGB動画を学生のモダリティとして使用している。
論文参考訳（メタデータ） (2020-09-01T03:38:31Z)
Temporal aggregation of audio-visual modalities for emotion recognition [0.5352699766206808]
本研究では,時間的オフセットの異なる時間的オフセットと時間的ウィンドウからの音声・視覚的モダリティを組み合わせた感情認識のためのマルチモーダル融合手法を提案する。提案手法は,文献と人間の精度評価から,他の手法よりも優れている。
論文参考訳（メタデータ） (2020-07-08T18:44:15Z)
Attention-Oriented Action Recognition for Real-Time Human-Robot Interaction [11.285529781751984]
本稿では,リアルタイムインタラクションの必要性に応えるために,アテンション指向のマルチレベルネットワークフレームワークを提案する。具体的には、プレアテンションネットワークを使用して、低解像度でシーン内のインタラクションに大まかにフォーカスする。他のコンパクトCNNは、抽出されたスケルトンシーケンスをアクション認識用の入力として受信する。
論文参考訳（メタデータ） (2020-07-02T12:41:28Z)
Continuous Emotion Recognition via Deep Convolutional Autoencoder and Support Vector Regressor [70.2226417364135]
マシンはユーザの感情状態を高い精度で認識できることが不可欠である。ディープニューラルネットワークは感情を認識する上で大きな成功を収めている。表情認識に基づく連続的感情認識のための新しいモデルを提案する。
論文参考訳（メタデータ） (2020-01-31T17:47:16Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。