Self-similarity Prior Distillation for Unsupervised Remote Physiological
Measurement
- URL: http://arxiv.org/abs/2311.05100v1
- Date: Thu, 9 Nov 2023 02:24:51 GMT
- Title: Self-similarity Prior Distillation for Unsupervised Remote Physiological
Measurement
- Authors: Xinyu Zhang, Weiyu Sun, Hao Lu, Ying Chen, Yun Ge, Xiaolin Huang, Jie
Yuan, Yingcong Chen
- Abstract summary: We propose a Self-Similarity Prior Distillation (SSPD) framework for unsupervised r estimation.
SSPD capitalizes on the intrinsic self-similarity of cardiac activities.
It achieves comparable or even superior performance compared to state-of-the-art supervised methods.
- Score: 40.68840376187229
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Remote photoplethysmography (rPPG) is a noninvasive technique that aims to
capture subtle variations in facial pixels caused by changes in blood volume
resulting from cardiac activities. Most existing unsupervised methods for rPPG
tasks focus on the contrastive learning between samples while neglecting the
inherent self-similar prior in physiological signals. In this paper, we propose
a Self-Similarity Prior Distillation (SSPD) framework for unsupervised rPPG
estimation, which capitalizes on the intrinsic self-similarity of cardiac
activities. Specifically, we first introduce a physical-prior embedded
augmentation technique to mitigate the effect of various types of noise. Then,
we tailor a self-similarity-aware network to extract more reliable self-similar
physiological features. Finally, we develop a hierarchical self-distillation
paradigm to assist the network in disentangling self-similar physiological
patterns from facial videos. Comprehensive experiments demonstrate that the
unsupervised SSPD framework achieves comparable or even superior performance
compared to the state-of-the-art supervised methods. Meanwhile, SSPD maintains
the lowest inference time and computation cost among end-to-end models. The
source codes are available at https://github.com/LinXi1C/SSPD.
Related papers
- Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation [37.79819260918366]
Continual Test-Time Adaptation (CTTA) aims to adapt the pre-trained model to ever-evolving target domains.
We explore the integration of a Mixture-of-Activation-Sparsity-Experts (MoASE) as an adapter for the CTTA task.
arXiv Detail & Related papers (2024-05-26T08:51:39Z) - rPPG-MAE: Self-supervised Pre-training with Masked Autoencoders for
Remote Physiological Measurement [36.54109704201048]
Remote photoplethysmography (r-MAE) is an important technique for perceiving human vital signs.
In this paper, we develop a self-supervised framework for extracting inherent self-similar prior in physiological signals.
We also evaluate the proposed method on two public datasets, namely PURE and UBFC-r.
arXiv Detail & Related papers (2023-06-04T08:53:28Z) - Benchmarking Joint Face Spoofing and Forgery Detection with Visual and
Physiological Cues [81.15465149555864]
We establish the first joint face spoofing and detection benchmark using both visual appearance and physiological r cues.
To enhance the r periodicity discrimination, we design a two-branch physiological network using both facial powerful rtemporal signal map and its continuous wavelet transformed counterpart as inputs.
arXiv Detail & Related papers (2022-08-10T15:41:48Z) - The Way to my Heart is through Contrastive Learning: Remote
Photoplethysmography from Unlabelled Video [10.479541955106328]
The ability to reliably estimate physiological signals from video is a powerful tool in low-cost, pre-clinical health monitoring.
We propose a new approach to remote photoplethysmography (r) - the measurement of blood volume changes from observations of a person's face or skin.
arXiv Detail & Related papers (2021-11-18T15:21:33Z) - On the Robustness of Pretraining and Self-Supervision for a Deep
Learning-based Analysis of Diabetic Retinopathy [70.71457102672545]
We compare the impact of different training procedures for diabetic retinopathy grading.
We investigate different aspects such as quantitative performance, statistics of the learned feature representations, interpretability and robustness to image distortions.
Our results indicate that models from ImageNet pretraining report a significant increase in performance, generalization and robustness to image distortions.
arXiv Detail & Related papers (2021-06-25T08:32:45Z) - A Deep Drift-Diffusion Model for Image Aesthetic Score Distribution
Prediction [68.76594695163386]
We propose a Deep Drift-Diffusion model inspired by psychologists to predict aesthetic score distribution from images.
The DDD model can describe the psychological process of aesthetic perception instead of traditional modeling of the results of assessment.
Our novel DDD model is simple but efficient, which outperforms the state-of-the-art methods in aesthetic score distribution prediction.
arXiv Detail & Related papers (2020-10-15T11:01:46Z) - autoTICI: Automatic Brain Tissue Reperfusion Scoring on 2D DSA Images of
Acute Ischemic Stroke Patients [7.239126951855493]
The Thrombolysis in Cerebral Infarction (TICI) score is an important metric for reperfusion therapy assessment in acute ischemic stroke.
Existing TICI scores are defined in coarse ordinal grades based on visual inspection, leading to inter- and intra-observer variation.
In this work, we present autoTICI, an automatic and quantitative TICI scoring method.
arXiv Detail & Related papers (2020-10-03T22:06:55Z) - Detecting Parkinsonian Tremor from IMU Data Collected In-The-Wild using
Deep Multiple-Instance Learning [59.74684475991192]
Parkinson's Disease (PD) is a slowly evolving neuro-logical disease that affects about 1% of the population above 60 years old.
PD symptoms include tremor, rigidity and braykinesia.
We present a method for automatically identifying tremorous episodes related to PD, based on IMU signals captured via a smartphone device.
arXiv Detail & Related papers (2020-05-06T09:02:30Z) - Retinopathy of Prematurity Stage Diagnosis Using Object Segmentation and
Convolutional Neural Networks [68.96150598294072]
Retinopathy of Prematurity (ROP) is an eye disorder primarily affecting premature infants with lower weights.
It causes proliferation of vessels in the retina and could result in vision loss and, eventually, retinal detachment, leading to blindness.
In recent years, there has been a significant effort to automate the diagnosis using deep learning.
This paper builds upon the success of previous models and develops a novel architecture, which combines object segmentation and convolutional neural networks (CNN)
Our proposed system first trains an object segmentation model to identify the demarcation line at a pixel level and adds the resulting mask as an additional "color" channel in
arXiv Detail & Related papers (2020-04-03T14:07:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.