Remote Photoplethysmography from Low Resolution videos: An end-to-end
solution using Efficient ConvNets
- URL: http://arxiv.org/abs/2208.06817v1
- Date: Sun, 14 Aug 2022 10:04:25 GMT
- Title: Remote Photoplethysmography from Low Resolution videos: An end-to-end
solution using Efficient ConvNets
- Authors: Bharath Ramakrishnan, Ruijia Deng
- Abstract summary: We propose to use efficient convolutional networks to accurately measure the heart rate of user from low resolution facial videos.
To ensure that we are able to obtain the heart rate in real time, we compress the deep learning model by pruning it, thereby reducing its memory footprint.
- Score: 0.2768955853144218
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Measurement of the cardiac pulse from facial video has become an interesting
pursuit of research over the last few years. This is mainly due to the
increasing importance of obtaining the heart rate of an individual in a
non-invasive manner, which can be highly useful for applications in gaming and
the medical industry. Another instrumental area of research over the past few
years has been the advent of Deep Learning and using Deep Neural networks to
enhance task performance. In this work, we propose to use efficient
convolutional networks to accurately measure the heart rate of user from low
resolution facial videos. Furthermore, to ensure that we are able to obtain the
heart rate in real time, we compress the deep learning model by pruning it,
thereby reducing its memory footprint. We benchmark the performance of our
approach on the MAHNOB dataset and compare its performance across multiple
approaches.
Related papers
- Remote Heart Rate Monitoring in Smart Environments from Videos with
Self-supervised Pre-training [28.404118669462772]
We introduce a solution that utilizes self-supervised contrastive learning for the estimation of remote photoplethys (mography) and heart rate monitoring.
We propose the use of 3 spatial and 3 temporal augmentations for training an encoder through a contrastive framework, followed by utilizing the late-intermediate embeddings of the encoder for remote PPG and heart rate estimation.
arXiv Detail & Related papers (2023-10-23T22:41:04Z) - Camera-Based HRV Prediction for Remote Learning Environments [4.074837550066978]
Restoring blood volume pulse signals from facial videos is a challenging task that involves a series of preprocessing, image algorithms, and postprocessing to restore waveforms.
The challenge in obtaining HRV indices through r is the necessity for algorithms to precisely predict the BVP peak positions.
In this paper, we collected the Remote Learning Affect and Physiology (RLAP) dataset, which includes over 32 hours of highly synchronized video and labels from 58 subjects.
Using the RLAP dataset, we trained a new model called Seq-r, it is a model based on one-dimensional convolution, and experimental results reveal
arXiv Detail & Related papers (2023-05-07T02:26:00Z) - Differentiable Frequency-based Disentanglement for Aerial Video Action
Recognition [56.91538445510214]
We present a learning algorithm for human activity recognition in videos.
Our approach is designed for UAV videos, which are mainly acquired from obliquely placed dynamic cameras.
We conduct extensive experiments on the UAV Human dataset and the NEC Drone dataset.
arXiv Detail & Related papers (2022-09-15T22:16:52Z) - Visual Heart Rate Estimation from RGB Facial Video using Spectral
Reflectance [0.23438564092609357]
We propose a reliable HR estimation framework using the spectral reflectance of the user, which makes it robust to motion and illumination disturbances.
We employ deep learning-based frameworks such as Faster RCNNs to perform face detection as opposed to the Viola Jones algorithm employed by previous approaches.
arXiv Detail & Related papers (2022-08-09T04:34:04Z) - A Supervised Learning Approach for Robust Health Monitoring using Face
Videos [32.157163136267954]
Non-contact, device-free human sensing methods can eliminate the need for specialized heart and blood pressure monitoring equipment.
In this paper, we used a non-contact method that only requires face videos recorded using commercially-available webcams.
The proposed approach used facial recognition to detect the face in each frame of the video using facial landmarks.
arXiv Detail & Related papers (2021-01-30T22:03:16Z) - Searching for Efficient Architecture for Instrument Segmentation in
Robotic Surgery [58.63306322525082]
Most applications rely on accurate real-time segmentation of high-resolution surgical images.
We design a light-weight and highly-efficient deep residual architecture which is tuned to perform real-time inference of high-resolution images.
arXiv Detail & Related papers (2020-07-08T21:38:29Z) - Motion Pyramid Networks for Accurate and Efficient Cardiac Motion
Estimation [51.72616167073565]
We propose Motion Pyramid Networks, a novel deep learning-based approach for accurate and efficient cardiac motion estimation.
We predict and fuse a pyramid of motion fields from multiple scales of feature representations to generate a more refined motion field.
We then use a novel cyclic teacher-student training strategy to make the inference end-to-end and further improve the tracking performance.
arXiv Detail & Related papers (2020-06-28T21:03:19Z) - AutoHR: A Strong End-to-end Baseline for Remote Heart Rate Measurement
with Neural Searching [76.4844593082362]
We investigate the reason why existing end-to-end networks perform poorly in challenging conditions and establish a strong baseline for remote HR measurement with architecture search (NAS)
Comprehensive experiments are performed on three benchmark datasets on both intra-temporal and cross-dataset testing.
arXiv Detail & Related papers (2020-04-26T05:43:21Z) - LRTD: Long-Range Temporal Dependency based Active Learning for Surgical
Workflow Recognition [67.86810761677403]
We propose a novel active learning method for cost-effective surgical video analysis.
Specifically, we propose a non-local recurrent convolutional network (NL-RCNet), which introduces non-local block to capture the long-range temporal dependency.
We validate our approach on a large surgical video dataset (Cholec80) by performing surgical workflow recognition task.
arXiv Detail & Related papers (2020-04-21T09:21:22Z) - Optimised Convolutional Neural Networks for Heart Rate Estimation and
Human Activity Recognition in Wrist Worn Sensing Applications [3.8137985834223507]
We provide improved heart rate and human activity recognition simultaneously at low sample rates.
This simplifies hardware design and reduces costs and power budgets.
We apply two deep learning pipelines, one for human activity recognition and one for heart rate estimation.
arXiv Detail & Related papers (2020-03-30T11:44:58Z) - Deep-HR: Fast Heart Rate Estimation from Face Video Under Realistic
Conditions [62.68522031911656]
We propose a simple yet efficient approach to benefit the advantages of the Deep Neural Network (DNN) by simplifying HR estimation from a complex task to learning from very correlated representation to HR.
To be more accurate and work well on low-quality videos, two deep encoder-decoder networks are trained to refine the output of FE.
Experimental results on HR-D and MAHNOB datasets confirm that our method could run as a real-time method and estimate the average HR better than state-of-the-art ones.
arXiv Detail & Related papers (2020-02-12T07:00:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.