Related papers: Dynamic Stress Detection: A Study of Temporal Progression Modelling of Stress in Speech

Dynamic Stress Detection: A Study of Temporal Progression Modelling of Stress in Speech

URL: http://arxiv.org/abs/2510.08586v1
Date: Thu, 02 Oct 2025 06:30:44 GMT
Title: Dynamic Stress Detection: A Study of Temporal Progression Modelling of Stress in Speech
Authors: Vishakha Lall, Yisi Liu,
Abstract summary: We model stress as a temporally evolving phenomenon influenced by historical emotional state.<n>We propose a dynamic labelling strategy that fine-grained stress annotations from emotional labels.<n>Our approach achieves notable accuracy gains on MuSE and StressID over existing baselines.
Score: 1.3320917259299652
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Detecting psychological stress from speech is critical in high-pressure settings. While prior work has leveraged acoustic features for stress detection, most treat stress as a static label. In this work, we model stress as a temporally evolving phenomenon influenced by historical emotional state. We propose a dynamic labelling strategy that derives fine-grained stress annotations from emotional labels and introduce cross-attention-based sequential models, a Unidirectional LSTM and a Transformer Encoder, to capture temporal stress progression. Our approach achieves notable accuracy gains on MuSE (+5%) and StressID (+18%) over existing baselines, and generalises well to a custom real-world dataset. These results highlight the value of modelling stress as a dynamic construct in speech.

Related papers

Retrieval Heads are Dynamic [101.60087217027949]
Recent studies have identified "retrieval heads" in Large Language Models (LLMs)<n>In this paper, we investigate retrieval heads from a dynamic perspective.
arXiv Detail & Related papers (2026-01-07T02:29:24Z)
DepFlow: Disentangled Speech Generation to Mitigate Semantic Bias in Depression Detection [54.209716321122194]
We present DepFlow, a depression-conditioned text-to-speech framework.<n>A Depression Acoustic Camouflage learns speaker- and content-invariant depression embeddings through adversarial training.<n>A flow-matching TTS model with FiLM modulation injects these embeddings into synthesis, enabling control over depressive severity.<n>A prototype-based severity mapping mechanism provides smooth and interpretable manipulation across the depression continuum.
arXiv Detail & Related papers (2026-01-01T10:44:38Z)
StressTest: Can YOUR Speech LM Handle the Stress? [30.973919141559644]
Sentence stress refers to emphasis on words within a spoken utterance to highlight or contrast an idea.<n>We introduce StressTest, a benchmark designed to evaluate models' ability to distinguish between meanings of speech based on the stress pattern.<n>We propose a novel data generation pipeline, and create Stress-17k, a training set that simulates change of meaning implied by stress variation.
arXiv Detail & Related papers (2025-05-28T18:32:56Z)
Dynamic Manipulation of Deformable Objects in 3D: Simulation, Benchmark and Learning Strategy [88.8665000676562]
Prior methods often simplify the problem to low-speed or 2D settings, limiting their applicability to real-world 3D tasks.<n>To mitigate data scarcity, we introduce a novel simulation framework and benchmark grounded in reduced-order dynamics.<n>We propose Dynamics Informed Diffusion Policy (DIDP), a framework that integrates imitation pretraining with physics-informed test-time adaptation.
arXiv Detail & Related papers (2025-05-23T03:28:25Z)
MISE: Meta-knowledge Inheritance for Social Media-Based Stressor Estimation [20.284960134507543]
This study introduce a new task aimed at estimating more specific stressors through users' posts on social media.<n>We propose a novel meta-learning based stressor estimation framework that is enhanced by a meta-knowledge inheritance mechanism.<n>We construct a social media-based stressor estimation dataset that can help train artificial intelligence models to facilitate human well-being.
arXiv Detail & Related papers (2025-05-03T18:12:36Z)
Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior [118.92747171905727]
This paper introduces a novel frequency-based trigger injection model for launching backdoor attacks with multiple triggers on learned image compression models.<n>We design attack objectives tailored to diverse scenarios, including: 1) degrading compression quality in terms of bit-rate and reconstruction accuracy; 2) targeting task-driven measures like face recognition and semantic segmentation.<n>Experiments show that our trigger injection models, combined with minor modifications to encoder parameters, successfully inject multiple backdoors and their triggers into a single compression model.
arXiv Detail & Related papers (2024-12-02T15:58:40Z)
Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation [59.81482518924723]
We propose a method for capturing and generating subtle shifts for talking-head generation. We develop a talking-head framework that is capable of generating a variety of emotions with precise control over intensity levels. Experiments and analyses validate the effectiveness of our proposed method.
arXiv Detail & Related papers (2024-09-29T01:02:01Z)
Stressor Type Matters! -- Exploring Factors Influencing Cross-Dataset Generalizability of Physiological Stress Detection [5.304745246313982]
This study explores the generalizability of machine learning models trained on HRV features for binary stress detection. Our findings reveal a crucial factor affecting model generalizability: stressor type. We recommend matching the stressor type when deploying HRV-based stress models in new environments.
arXiv Detail & Related papers (2024-05-06T14:47:48Z)
Investigating the Generalizability of Physiological Characteristics of Anxiety [3.4036712573981607]
We evaluate the generalizability of physiological features that have been shown to be correlated with anxiety and stress to high-arousal emotions. This work is the first cross-corpus evaluation across stress and arousal from ECG and EDA signals, contributing new findings about the generalizability of stress detection.
arXiv Detail & Related papers (2024-01-23T16:49:54Z)
Personalization of Stress Mobile Sensing using Self-Supervised Learning [1.7598252755538808]
Stress is widely recognized as a major contributor to a variety of health issues. Real-time stress prediction can enable digital interventions to immediately react at the onset of stress, helping to avoid many psychological and physiological symptoms such as heart rhythm irregularities. However, major challenges with the prediction of stress using machine learning include the subjectivity and sparseness of the labels, a large feature space, relatively few labels, and a complex nonlinear and subjective relationship between the features and outcomes.
arXiv Detail & Related papers (2023-08-04T22:26:33Z)
Personalized Prediction of Recurrent Stress Events Using Self-Supervised Learning on Multimodal Time-Series Data [1.7598252755538808]
We develop a multimodal personalized stress prediction system using wearable biosignal data. We employ self-supervised learning to pre-train the models on each subject's data. Results suggest that our approach can personalize stress prediction to each user with minimal annotations.
arXiv Detail & Related papers (2023-07-07T00:44:06Z)
Stabilizing Transformer Training by Preventing Attention Entropy Collapse [56.45313891694746]
We investigate the training dynamics of Transformers by examining the evolution of the attention layers. We show that $sigma$Reparam successfully prevents entropy collapse in the attention layers, promoting more stable training. We conduct experiments with $sigma$Reparam on image classification, image self-supervised learning, machine translation, speech recognition, and language modeling tasks.
arXiv Detail & Related papers (2023-03-11T03:30:47Z)
Adaptive Feature Alignment for Adversarial Training [56.17654691470554]
CNNs are typically vulnerable to adversarial attacks, which pose a threat to security-sensitive applications. We propose the adaptive feature alignment (AFA) to generate features of arbitrary attacking strengths. Our method is trained to automatically align features of arbitrary attacking strength.
arXiv Detail & Related papers (2021-05-31T17:01:05Z)

This list is automatically generated from the titles and abstracts of the papers in this site.