Deep-HR: Fast Heart Rate Estimation from Face Video Under Realistic
Conditions
- URL: http://arxiv.org/abs/2002.04821v1
- Date: Wed, 12 Feb 2020 07:00:07 GMT
- Title: Deep-HR: Fast Heart Rate Estimation from Face Video Under Realistic
Conditions
- Authors: Mohammad Sabokrou, Masoud Pourreza, Xiaobai Li, Mahmood Fathy, Guoying
Zhao
- Abstract summary: We propose a simple yet efficient approach to benefit the advantages of the Deep Neural Network (DNN) by simplifying HR estimation from a complex task to learning from very correlated representation to HR.
To be more accurate and work well on low-quality videos, two deep encoder-decoder networks are trained to refine the output of FE.
Experimental results on HR-D and MAHNOB datasets confirm that our method could run as a real-time method and estimate the average HR better than state-of-the-art ones.
- Score: 62.68522031911656
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper presents a novel method for remote heart rate (HR) estimation.
Recent studies have proved that blood pumping by the heart is highly correlated
to the intense color of face pixels, and surprisingly can be utilized for
remote HR estimation. Researchers successfully proposed several methods for
this task, but making it work in realistic situations is still a challenging
problem in computer vision community. Furthermore, learning to solve such a
complex task on a dataset with very limited annotated samples is not
reasonable. Consequently, researchers do not prefer to use the deep learning
approaches for this problem. In this paper, we propose a simple yet efficient
approach to benefit the advantages of the Deep Neural Network (DNN) by
simplifying HR estimation from a complex task to learning from very correlated
representation to HR. Inspired by previous work, we learn a component called
Front-End (FE) to provide a discriminative representation of face videos,
afterward a light deep regression auto-encoder as Back-End (BE) is learned to
map the FE representation to HR. Regression task on the informative
representation is simple and could be learned efficiently on limited training
samples. Beside of this, to be more accurate and work well on low-quality
videos, two deep encoder-decoder networks are trained to refine the output of
FE. We also introduce a challenging dataset (HR-D) to show that our method can
efficiently work in realistic conditions. Experimental results on HR-D and
MAHNOB datasets confirm that our method could run as a real-time method and
estimate the average HR better than state-of-the-art ones.
Related papers
- The Power of Active Multi-Task Learning in Reinforcement Learning from Human Feedback [12.388205905012423]
Reinforcement learning from human feedback has contributed to performance improvements in large language models.
We formulate RLHF as the contextual dueling bandit problem and assume a common linear representation.
We prove that to achieve $varepsilon-$optimal, the sample complexity of the source tasks can be significantly reduced.
arXiv Detail & Related papers (2024-05-18T08:29:15Z) - Sample Efficient Reinforcement Learning from Human Feedback via Active
Exploration [29.935758027209292]
Preference-based feedback is important for many applications in reinforcement learning.
In this work, we take advantage of the fact that one can often choose contexts to obtain human feedback.
We show that our method is able to reach better performance with fewer samples of human preferences than multiple baselines.
arXiv Detail & Related papers (2023-12-01T00:54:02Z) - 360$^\circ$ High-Resolution Depth Estimation via Uncertainty-aware Structural Knowledge Transfer [8.988255747467333]
To predict high-resolution (HR) omnidirectional depth map, existing methods typically leverage HR omnidirectional image (ODI) as the input via fully-supervised learning.
In this paper, we explore for the first time to estimate the HR omnidirectional depth directly from a low-resolution (LR) ODI, when no HR depth GT map is available.
Our key idea is to transfer the scene structural knowledge from the HR image modality and the corresponding LR depth maps to achieve the goal of HR depth estimation without any extra inference cost.
arXiv Detail & Related papers (2023-04-17T03:24:21Z) - EVM-CNN: Real-Time Contactless Heart Rate Estimation from Facial Video [9.664590078212441]
A new framework is introduced to remotely estimate the heart rate (HR) under realistic conditions by combining spatial and temporal filtering and a convolutional neural network.
Our proposed approach shows better performance compared with the benchmark on the MMSE-HR dataset in terms of both the average HR estimation and short-time HR estimation.
arXiv Detail & Related papers (2022-12-25T15:25:15Z) - Few-Shot Preference Learning for Human-in-the-Loop RL [13.773589150740898]
Motivated by the success of meta-learning, we pre-train preference models on prior task data and quickly adapt them for new tasks using only a handful of queries.
We reduce the amount of online feedback needed to train manipulation policies in Meta-World by 20$times$, and demonstrate the effectiveness of our method on a real Franka Panda Robot.
arXiv Detail & Related papers (2022-12-06T23:12:26Z) - A Practical Contrastive Learning Framework for Single-Image
Super-Resolution [51.422185656787285]
We investigate contrastive learning-based single image super-resolution from two perspectives.
We propose a practical contrastive learning framework for SISR, named PCL-SR.
Compared with existing benchmark methods, we re-train them by our proposed PCL-SR framework and achieve superior performance.
arXiv Detail & Related papers (2021-11-27T15:42:12Z) - FasterPose: A Faster Simple Baseline for Human Pose Estimation [65.8413964785972]
We propose a design paradigm for cost-effective network with LR representation for efficient pose estimation, named FasterPose.
We study the training behavior of FasterPose, and formulate a novel regressive cross-entropy (RCE) loss function for accelerating the convergence.
Compared with the previously dominant network of pose estimation, our method reduces 58% of the FLOPs and simultaneously gains 1.3% improvement of accuracy.
arXiv Detail & Related papers (2021-07-07T13:39:08Z) - Few-Cost Salient Object Detection with Adversarial-Paced Learning [95.0220555274653]
This paper proposes to learn the effective salient object detection model based on the manual annotation on a few training images only.
We name this task as the few-cost salient object detection and propose an adversarial-paced learning (APL)-based framework to facilitate the few-cost learning scenario.
arXiv Detail & Related papers (2021-04-05T14:15:49Z) - Best-Buddy GANs for Highly Detailed Image Super-Resolution [71.13466303340192]
We consider the single image super-resolution (SISR) problem, where a high-resolution (HR) image is generated based on a low-resolution (LR) input.
Most methods along this line rely on a predefined single-LR-single-HR mapping, which is not flexible enough for the SISR task.
We propose best-buddy GANs (Beby-GAN) for rich-detail SISR. Relaxing the immutable one-to-one constraint, we allow the estimated patches to dynamically seek the best supervision.
arXiv Detail & Related papers (2021-03-29T02:58:27Z) - AutoHR: A Strong End-to-end Baseline for Remote Heart Rate Measurement
with Neural Searching [76.4844593082362]
We investigate the reason why existing end-to-end networks perform poorly in challenging conditions and establish a strong baseline for remote HR measurement with architecture search (NAS)
Comprehensive experiments are performed on three benchmark datasets on both intra-temporal and cross-dataset testing.
arXiv Detail & Related papers (2020-04-26T05:43:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.