On Improving PPG-Based Sleep Staging: A Pilot Study
- URL: http://arxiv.org/abs/2508.02689v1
- Date: Wed, 23 Jul 2025 14:11:41 GMT
- Title: On Improving PPG-Based Sleep Staging: A Pilot Study
- Authors: Jiawei Wang, Yu Guan, Chen Chen, Ligang Zhou, Laurence T. Yang, Sai Gu,
- Abstract summary: Photoplethys(mography) sensors are widely adopted in consumer devices, but consistently reliable sleep staging using PPG alone remains a non-trivial challenge.<n>We compare conventional single-stream model with dual-stream cross-attention strategies, based on which complementary information can be learned via PPG and PPG-derived modalities.<n>We found that substantial performance gain can be achieved by combining PPG and its auxiliary information under the dual-stream cross-attention architecture.
- Score: 21.462014806247353
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Sleep monitoring through accessible wearable technology is crucial to improving well-being in ubiquitous computing. Although photoplethysmography(PPG) sensors are widely adopted in consumer devices, achieving consistently reliable sleep staging using PPG alone remains a non-trivial challenge. In this work, we explore multiple strategies to enhance the performance of PPG-based sleep staging. Specifically, we compare conventional single-stream model with dual-stream cross-attention strategies, based on which complementary information can be learned via PPG and PPG-derived modalities such as augmented PPG or synthetic ECG. To study the effectiveness of the aforementioned approaches in four-stage sleep monitoring task, we conducted experiments on the world's largest sleep staging dataset, i.e., the Multi-Ethnic Study of Atherosclerosis(MESA). We found that substantial performance gain can be achieved by combining PPG and its auxiliary information under the dual-stream cross-attention architecture. Source code of this project can be found at https://github.com/DavyWJW/sleep-staging-models
Related papers
- Combining scEEG and PPG for reliable sleep staging using lightweight wearables [30.175969145789896]
Reliable sleep staging remains challenging for lightweight wearable devices such as single-channel electroencephalography (scEEG)<n>scEEG offers direct measurement of cortical activity and serves as the foundation for sleep staging, yet exhibits limited performance on light sleep stages.<n>In this work, we investigate scEEG- PPG fusion for 4-class sleep staging under short-window (30 s - 30 min) constraints.
arXiv Detail & Related papers (2026-02-04T13:00:35Z) - Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition [52.232968183793986]
General Policy Composition (GPC) is a training-free method that enhances performance by combining the distributional scores of multiple pre-trained policies.<n>GPC consistently improves performance and adaptability across a diverse set of tasks.
arXiv Detail & Related papers (2025-10-01T16:05:53Z) - KM-GPT: An Automated Pipeline for Reconstructing Individual Patient Data from Kaplan-Meier Plots [45.53914693601933]
We develop KM-GPT, the first fully automated, AI-powered pipeline for reconstructing IPD directly from Kaplan-Meier plots.<n> KM-GPT integrates advanced image preprocessing, multi-modal reasoning powered by GPT-5, and iterative reconstruction algorithms.<n>Its hybrid reasoning architecture automates the conversion of unstructured information into structured data flows.<n> KM-GPT was rigorously evaluated on synthetic and real-world datasets, consistently demonstrating superior accuracy.
arXiv Detail & Related papers (2025-09-15T00:38:38Z) - Sleep Stage Classification using Multimodal Embedding Fusion from EOG and PSM [0.06282171844772422]
This study introduces a novel approach that leverages ImageBind, a multimodal embedding deep learning model, to integrate PSM data with dual-channel EOG signals for sleep stage classification.<n>Our results demonstrate that fine-tuning ImageBind significantly improves classification accuracy, outperforming existing models.
arXiv Detail & Related papers (2025-06-07T20:18:45Z) - Partitioning Message Passing for Graph Fraud Detection [57.928658584067556]
Label imbalance and homophily-heterophily mixture are the fundamental problems encountered when applying Graph Neural Networks (GNNs) to Graph Fraud Detection (GFD) tasks.<n>Existing GNN-based GFD models are designed to augment graph structure to accommodate the inductive bias of GNNs towards homophily.<n>In our work, we argue that the key to applying GNNs for GFD is not to exclude but to em distinguish neighbors with different labels.
arXiv Detail & Related papers (2024-11-16T11:30:53Z) - MSSC-BiMamba: Multimodal Sleep Stage Classification and Early Diagnosis of Sleep Disorders with Bidirectional Mamba [5.606144017978037]
We develop an automated model for sleep staging and disorder classification to enhance diagnostic accuracy and efficiency.
Considering the characteristics of polysomnography (PSG) multi-lead sleep monitoring, we designed a multimodal sleep state classification model, MSSC-BiMamba.
The model is the first to apply BiMamba to sleep staging with multimodal PSG data, showing substantial gains in computational and memory efficiency.
arXiv Detail & Related papers (2024-05-30T15:16:53Z) - SleepPPG-Net2: Deep learning generalization for sleep staging from photoplethysmography [0.7927502566022343]
Sleep staging is a fundamental component in the diagnosis of sleep disorders and the management of sleep health.
Recent data-driven algorithms for sleep staging have shown high performance on local test sets but lower performance on external datasets.
Sleep-Net2 sets a new standard for staging sleep from raw PPG time-series.
arXiv Detail & Related papers (2024-04-10T09:47:34Z) - Benchmarking Joint Face Spoofing and Forgery Detection with Visual and
Physiological Cues [81.15465149555864]
We establish the first joint face spoofing and detection benchmark using both visual appearance and physiological r cues.
To enhance the r periodicity discrimination, we design a two-branch physiological network using both facial powerful rtemporal signal map and its continuous wavelet transformed counterpart as inputs.
arXiv Detail & Related papers (2022-08-10T15:41:48Z) - Identifying Rhythmic Patterns for Face Forgery Detection and
Categorization [46.21354355137544]
We propose a framework for face forgery detection and categorization consisting of: 1) a Spatial-Temporal Filtering Network (STFNet) for PPG signals, and 2) a Spatial-Temporal Interaction Network (STINet) for constraint and interaction of PPG signals.
With insight into the generation of forgery methods, we further propose intra-source and inter-source blending to boost the performance of the framework.
arXiv Detail & Related papers (2022-07-04T04:57:06Z) - SleepPPG-Net: a deep learning algorithm for robust sleep staging from
continuous photoplethysmography [0.0]
We developed Sleep-Net, a DL model for 4-class sleep staging from the raw PPG time series.
We benchmarked the performance of Sleep-Net against models based on the best-reported state-of-the-art (SOTA) algorithms.
arXiv Detail & Related papers (2022-02-11T16:17:42Z) - Consistency Regularization for Deep Face Anti-Spoofing [69.70647782777051]
Face anti-spoofing (FAS) plays a crucial role in securing face recognition systems.
Motivated by this exciting observation, we conjecture that encouraging feature consistency of different views may be a promising way to boost FAS models.
We enhance both Embedding-level and Prediction-level Consistency Regularization (EPCR) in FAS.
arXiv Detail & Related papers (2021-11-24T08:03:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.