Related papers: PSG-MAE: Robust Multitask Sleep Event Monitoring using Multichannel PSG Reconstruction and Inter-channel Contrastive Learning

PSG-MAE: Robust Multitask Sleep Event Monitoring using Multichannel PSG Reconstruction and Inter-channel Contrastive Learning

URL: http://arxiv.org/abs/2504.13229v1
Date: Thu, 17 Apr 2025 13:43:16 GMT
Title: PSG-MAE: Robust Multitask Sleep Event Monitoring using Multichannel PSG Reconstruction and Inter-channel Contrastive Learning
Authors: Yifei Wang, Qi Liu, Fuli Min, Honghao Wang,
Abstract summary: Polysomnography (PSG) signals are essential for studying sleep processes and diagnosing sleep disorders.<n>We propose PSG-MAE, a mask autoencoder based pre-training framework.<n>We show that PSG-MAE effectively captures both temporal details and inter-channel information from PSG signals.
Score: 16.002946830438766
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Polysomnography (PSG) signals are essential for studying sleep processes and diagnosing sleep disorders. Analyzing PSG data through deep neural networks (DNNs) for automated sleep monitoring has become increasingly feasible. However, the limited availability of datasets for certain sleep events often leads to DNNs focusing on a single task with a single-sourced training dataset. As a result, these models struggle to transfer to new sleep events and lack robustness when applied to new datasets. To address these challenges, we propose PSG-MAE, a mask autoencoder (MAE) based pre-training framework. By performing self-supervised learning on a large volume of unlabeled PSG data, PSG-MAE develops a robust feature extraction network that can be broadly applied to various sleep event monitoring tasks. Unlike conventional MAEs, PSG-MAE generates complementary masks across PSG channels, integrates a multichannel signal reconstruction method, and employs a self-supervised inter-channel contrastive learning (ICCL) strategy. This approach enables the encoder to capture temporal features from each channel while simultaneously learning latent relationships between channels, thereby enhancing the utilization of multichannel information. Experimental results show that PSG-MAE effectively captures both temporal details and inter-channel information from PSG signals. When the encoder pre-trained through PSG-MAE is fine-tuned with downstream feature decomposition networks, it achieves an accuracy of 83.7% for sleep staging and 90.45% for detecting obstructive sleep apnea, which highlights the framework's robustness and broad applicability.

Related papers

Periodic-MAE: Periodic Video Masked Autoencoder for rPPG Estimation [6.32655874508904]
We propose a method that learns a general representation of periodic signals from unlabeled facial videos by capturing subtle changes in skin tone over time.<n>We evaluate the proposed method on the PURE, U-BFCr, MMPD, and V-BFC4V datasets.<n>Our results demonstrate significant performance improvements, particularly in challenging cross-dataset evaluations.
arXiv Detail & Related papers (2025-06-27T02:18:10Z)
PSDNorm: Test-Time Temporal Normalization for Deep Learning on EEG Signals [63.05435596565677]
PSDNorm is a layer that leverages Monge mapping and temporal context to normalize feature maps in deep learning models.<n> PSDNorm achieves state-of-the-art performance at test time on datasets not seen during training.<n> PSDNorm provides a significant improvement in robustness, achieving markedly higher F1 scores for the 20% hardest subjects.
arXiv Detail & Related papers (2025-03-06T16:20:25Z)
Toward Foundational Model for Sleep Analysis Using a Multimodal Hybrid Self-Supervised Learning Framework [2.424910201171407]
This study introduces SynthSleepNet, a multimodal hybrid self-supervised learning framework for analyzing polysomnography (PSG) data.<n> SynthSleepNet effectively integrates masked prediction and contrastive learning to leverage complementary features across multiple modalities.<n>It achieved superior performance compared to state-of-the-art methods across three downstream tasks.
arXiv Detail & Related papers (2025-02-18T10:11:50Z)
Multi-Source and Test-Time Domain Adaptation on Multivariate Signals using Spatio-Temporal Monge Alignment [59.75420353684495]
Machine learning applications on signals such as computer vision or biomedical data often face challenges due to the variability that exists across hardware devices or session recordings. In this work, we propose Spatio-Temporal Monge Alignment (STMA) to mitigate these variabilities. We show that STMA leads to significant and consistent performance gains between datasets acquired with very different settings.
arXiv Detail & Related papers (2024-07-19T13:33:38Z)
ARNN: Attentive Recurrent Neural Network for Multi-channel EEG Signals to Identify Epileptic Seizures [2.3907933297014927]
An Attention Recurrent Neural Network (ARNN) is proposed that can process a large amount of data efficiently and accurately. ARNN cell recurrently applies attention layers along a sequence and has linear complexity with the sequence length. This framework is inspired in part by the attention layer and long short-term memory (LSTM) cells, but it scales this typical cell up by several orders to parallelize for multi-channel EEG signals.
arXiv Detail & Related papers (2024-03-05T19:15:17Z)
Multi-Signal Reconstruction Using Masked Autoencoder From EEG During Polysomnography [24.336598771550157]
Polysomnography (PSG) is an indispensable diagnostic tool in sleep medicine. We propose a novel system capable of reconstructing multi-signal PSG from a single-channel EEG. Our results present promise for the development of more accessible and long-term sleep monitoring systems.
arXiv Detail & Related papers (2023-11-14T02:57:37Z)
Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning [53.00683059396803]
Mask image model (MIM) has been widely used due to its simplicity and effectiveness in recovering original information from masked images. We propose a decision-based MIM that utilizes reinforcement learning (RL) to automatically search for optimal image masking ratio and masking strategy. Our approach has a significant advantage over alternative self-supervised methods on the task of neuron segmentation.
arXiv Detail & Related papers (2023-10-06T10:40:46Z)
Joint Channel Estimation and Feedback with Masked Token Transformers in Massive MIMO Systems [74.52117784544758]
This paper proposes an encoder-decoder based network that unveils the intrinsic frequency-domain correlation within the CSI matrix. The entire encoder-decoder network is utilized for channel compression. Our method outperforms state-of-the-art channel estimation and feedback techniques in joint tasks.
arXiv Detail & Related papers (2023-06-08T06:15:17Z)
Convolutional Monge Mapping Normalization for learning on sleep data [63.22081662149488]
We propose a new method called Convolutional Monge Mapping Normalization (CMMN) CMMN consists in filtering the signals in order to adapt their power spectrum density (PSD) to a Wasserstein barycenter estimated on training data. Numerical experiments on sleep EEG data show that CMMN leads to significant and consistent performance gains independent from the neural network architecture.
arXiv Detail & Related papers (2023-05-30T08:24:01Z)
Co-attention Propagation Network for Zero-Shot Video Object Segmentation [91.71692262860323]
Zero-shot object segmentation (ZS-VOS) aims to segment objects in a video sequence without prior knowledge of these objects. Existing ZS-VOS methods often struggle to distinguish between foreground and background or to keep track of the foreground in complex scenarios. We propose an encoder-decoder-based hierarchical co-attention propagation network (HCPN) capable of tracking and segmenting objects.
arXiv Detail & Related papers (2023-04-08T04:45:48Z)
Convolutional Neural Networks for Sleep Stage Scoring on a Two-Channel EEG Signal [63.18666008322476]
Sleep problems are one of the major diseases all over the world. Basic tool used by specialists is the Polysomnogram, which is a collection of different signals recorded during sleep. Specialists have to score the different signals according to one of the standard guidelines.
arXiv Detail & Related papers (2021-03-30T09:59:56Z)
MRNet: a Multi-scale Residual Network for EEG-based Sleep Staging [5.141687309207561]
We propose a new framework, called MRNet, for data-driven sleep staging by integrating a multi-scale feature fusion model and a sequential correction algorithm. EEG signals lose considerable detailed information in network propagation, which affects the representation of deep features. Experiment results demonstrate the competitive performance of our proposed approach on both accuracy and F1 score.
arXiv Detail & Related papers (2021-01-07T13:48:30Z)
Automate Obstructive Sleep Apnea Diagnosis Using Convolutional Neural Networks [4.882119124419393]
This paper presents a CNN architecture with 1D convolutional and FCN layers for classification. The proposed 1D CNN model achieves excellent classification results without manually preprocesssing PSG signals.
arXiv Detail & Related papers (2020-06-13T15:35:18Z)

This list is automatically generated from the titles and abstracts of the papers in this site.