Motion-Robust Multimodal Fusion of PPG and Accelerometer Signals for Three-Class Heart Rhythm Classification
- URL: http://arxiv.org/abs/2511.00949v1
- Date: Sun, 02 Nov 2025 14:21:55 GMT
- Title: Motion-Robust Multimodal Fusion of PPG and Accelerometer Signals for Three-Class Heart Rhythm Classification
- Authors: Yangyang Zhao, Matti Kaisti, Olli Lahdenoja, Tero Koivisto,
- Abstract summary: Atrial fibrillation (AF) is a leading cause of stroke and mortality, particularly in elderly patients.<n>We introduce RhythmiNet, a residual neural network enhanced with temporal and channel attention modules.<n>The model performs three-class rhythm classification: AF, sinus rhythm (SR) and Other.
- Score: 5.573626931805866
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Atrial fibrillation (AF) is a leading cause of stroke and mortality, particularly in elderly patients. Wrist-worn photoplethysmography (PPG) enables non-invasive, continuous rhythm monitoring, yet suffers from significant vulnerability to motion artifacts and physiological noise. Many existing approaches rely solely on single-channel PPG and are limited to binary AF detection, often failing to capture the broader range of arrhythmias encountered in clinical settings. We introduce RhythmiNet, a residual neural network enhanced with temporal and channel attention modules that jointly leverage PPG and accelerometer (ACC) signals. The model performs three-class rhythm classification: AF, sinus rhythm (SR), and Other. To assess robustness across varying movement conditions, test data are stratified by accelerometer-based motion intensity percentiles without excluding any segments. RhythmiNet achieved a 4.3% improvement in macro-AUC over the PPG-only baseline. In addition, performance surpassed a logistic regression model based on handcrafted HRV features by 12%, highlighting the benefit of multimodal fusion and attention-based learning in noisy, real-world clinical data.
Related papers
- FDP: A Frequency-Decomposition Preprocessing Pipeline for Unsupervised Anomaly Detection in Brain MRI [44.4791295950757]
We develop an unsupervised anomaly detection (UAD) approach for brain MRI.<n>We conduct the first systematic frequency-domain analysis of pathological signatures.<n>We show that Frequency-Decomposition Preprocessing (FDP) framework can leverage frequency-domain reconstruction for simultaneous pathology suppression and anatomical preservation.
arXiv Detail & Related papers (2025-11-17T02:40:14Z) - Detection of Autonomic Dysreflexia in Individuals With Spinal Cord Injury Using Multimodal Wearable Sensors [2.208475400165877]
Autonomic Dysreflexia (AD) is a potentially life-threatening condition characterized by sudden, severe blood pressure spikes in individuals with spinal cord injury (SCI)<n>This study presents a non-invasive, explainable machine learning framework for detecting AD using multimodal wearable sensors.
arXiv Detail & Related papers (2025-07-23T21:18:23Z) - Masked Autoencoders that Feel the Heart: Unveiling Simplicity Bias for ECG Analyses [24.039917512972977]
We show that ECG models tend to overfit dominant and repetitive patterns, overlooking fine-grained but clinically critical cues, known as Simplicity Bias (SB)<n>We propose a novel method comprising two key components: 1) Temporal-frequency aware Filters to capture temporal-frequency features reflecting the dynamic characteristics of ECG signals, and 2) building on this, Multi-Grained Prototype Reconstruction for coarse and fine representation learning across dual domains, further mitigating SB.
arXiv Detail & Related papers (2025-06-25T03:25:49Z) - Recovering Pulse Waves from Video Using Deep Unrolling and Deep Equilibrium Models [45.94962431110573]
Camera-based monitoring of vital signs, also known as imaging photoplethysmography (i), has seen applications in driver-monitoring, affective computing, and more.<n>We introduce methods that combine signal processing and deep learning methods in an inverse problem.
arXiv Detail & Related papers (2025-03-21T16:11:21Z) - Self-supervised inter-intra period-aware ECG representation learning for detecting atrial fibrillation [41.82319894067087]
We propose an inter-intra period-aware ECG representation learning approach.
Considering ECGs of atrial fibrillation patients exhibit the irregularity in RR intervals and the absence of P-waves, we develop specific pre-training tasks for interperiod and intraperiod representations.
Our approach demonstrates remarkable AUC performances on the BTCH dataset, textiti.e., 0.953/0.996 for paroxysmal/persistent atrial fibrillation detection.
arXiv Detail & Related papers (2024-10-08T10:03:52Z) - SQUWA: Signal Quality Aware DNN Architecture for Enhanced Accuracy in Atrial Fibrillation Detection from Noisy PPG Signals [37.788535094404644]
Atrial fibrillation (AF) significantly increases the risk of stroke, heart disease, and mortality.
Photoplethysmography ( PPG) signals are susceptible to corruption from motion artifacts and other factors often encountered in ambulatory settings.
We propose a novel deep learning model, designed to learn how to retain accurate predictions from partially corrupted PPG.
arXiv Detail & Related papers (2024-04-15T01:07:08Z) - Improving Diffusion Models for ECG Imputation with an Augmented Template
Prior [43.6099225257178]
noisy and poor-quality recordings are a major issue for signals collected using mobile health systems.
Recent studies have explored the imputation of missing values in ECG with probabilistic time-series models.
We present a template-guided denoising diffusion probabilistic model (DDPM), PulseDiff, which is conditioned on an informative prior for a range of health conditions.
arXiv Detail & Related papers (2023-10-24T11:34:15Z) - MMA-RNN: A Multi-level Multi-task Attention-based Recurrent Neural
Network for Discrimination and Localization of Atrial Fibrillation [1.8037893225125925]
This paper proposes the Multi-level Multi-task Attention-based Recurrent Neural Network for three-class discrimination on patients and localization of the exact timing of AF episodes.
The model is designed as an end-to-end framework to enhance information interaction and reduce error accumulation.
arXiv Detail & Related papers (2023-02-07T19:59:55Z) - Generalizing electrocardiogram delineation: training convolutional
neural networks with synthetic data augmentation [63.51064808536065]
Existing databases for ECG delineation are small, being insufficient in size and in the array of pathological conditions they represent.
This article delves has two main contributions. First, a pseudo-synthetic data generation algorithm was developed, based in probabilistically composing ECG traces given "pools" of fundamental segments, as cropped from the original databases, and a set of rules for their arrangement into coherent synthetic traces.
Second, two novel segmentation-based loss functions have been developed, which attempt at enforcing the prediction of an exact number of independent structures and at producing closer segmentation boundaries by focusing on a reduced number of samples.
arXiv Detail & Related papers (2021-11-25T10:11:41Z) - Non-contact PPG Signal and Heart Rate Estimation with Multi-hierarchical
Convolutional Network [12.119293125608976]
Heart rate (HR) are important physiological parameters of the human body.
This study presents an efficient multi-archhierical- convolutional network that can estimate HR from face video clips.
arXiv Detail & Related papers (2021-04-06T03:04:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.