LFPS-Net: a lightweight fast pulse simulation network for BVP estimation
- URL: http://arxiv.org/abs/2206.12558v1
- Date: Sat, 25 Jun 2022 05:24:52 GMT
- Title: LFPS-Net: a lightweight fast pulse simulation network for BVP estimation
- Authors: Jialiang Zhuang, Yun Zhang, Yuheng Chen, Xiujuan Zheng
- Abstract summary: Heart rate estimation based on remote photoplethysmography plays an important role in several specific scenarios, such as health monitoring and fatigue detection.
Existing methods are committed to taking the average of the predicted HRs of multiple overlapping video clips as the final results for the 30-second facial video.
We propose a lightweight fast pulse simulation network (LFPS-Net), pursuing the best accuracy within a very limited computational and time budget.
- Score: 4.631302854901082
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Heart rate estimation based on remote photoplethysmography plays an important
role in several specific scenarios, such as health monitoring and fatigue
detection. Existing well-established methods are committed to taking the
average of the predicted HRs of multiple overlapping video clips as the final
results for the 30-second facial video. Although these methods with hundreds of
layers and thousands of channels are highly accurate and robust, they require
enormous computational budget and a 30-second wait time, which greatly limits
the application of the algorithms to scale. Under these cicumstacnces, We
propose a lightweight fast pulse simulation network (LFPS-Net), pursuing the
best accuracy within a very limited computational and time budget, focusing on
common mobile platforms, such as smart phones. In order to suppress the noise
component and get stable pulse in a short time, we design a multi-frequency
modal signal fusion mechanism, which exploits the theory of time-frequency
domain analysis to separate multi-modal information from complex signals. It
helps proceeding network learn the effective fetures more easily without adding
any parameter. In addition, we design a oversampling training strategy to solve
the problem caused by the unbalanced distribution of dataset. For the 30-second
facial videos, our proposed method achieves the best results on most of the
evaluation metrics for estimating heart rate or heart rate variability compared
to the best available papers. The proposed method can still obtain very
competitive results by using a short-time (~15-second) facail video.
Related papers
- Batch-FPM: Random batch-update multi-parameter physical Fourier ptychography neural network [0.933064392528114]
Fourier Ptychographic Microscopy (FPM) is a computational imaging technique that enables high-resolution imaging over a large field of view.
We propose a fast and robust FPM reconstruction method based on physical neural networks with batch update gradient descent (SGD) optimization strategy.
Our method has better convergence performance even for low signal-to-noise ratio data sets, such as low exposure time dark-field images.
arXiv Detail & Related papers (2024-08-25T09:24:18Z) - Match and Locate: low-frequency monocular odometry based on deep feature
matching [0.65268245109828]
We introduce a novel approach for the robotic odometry which only requires a single camera.
The approach is based on matching image features between the consecutive frames of the video stream using deep feature matching models.
We evaluate the performance of the approach in the AISG-SLA Visual Localisation Challenge and find that while being computationally efficient and easy to implement our method shows competitive results.
arXiv Detail & Related papers (2023-11-16T17:32:58Z) - Time Scale Network: A Shallow Neural Network For Time Series Data [18.46091267922322]
Time series data is often composed of information at multiple time scales.
Deep learning strategies exist to capture this information, but many make networks larger, require more data, are more demanding to compute, and are difficult to interpret.
We present a minimal, computationally efficient Time Scale Network combining the translation and dilation sequence used in discrete wavelet transforms with traditional convolutional neural networks and back-propagation.
arXiv Detail & Related papers (2023-11-10T16:39:55Z) - Fast Event-based Optical Flow Estimation by Triplet Matching [13.298845944779108]
Event cameras offer advantages over traditional cameras (low latency, high dynamic range, low power, etc.)
Optical flow estimation methods that work on packets of events trade off speed for accuracy.
We propose a novel optical flow estimation scheme based on triplet matching.
arXiv Detail & Related papers (2022-12-23T09:12:16Z) - Taking ROCKET on an Efficiency Mission: Multivariate Time Series
Classification with LightWaveS [3.5786621294068373]
We present LightWaveS, a framework for accurate multivariate time series classification.
It employs just 2.5% of the ROCKET features, while achieving accuracy comparable to recent deep learning models.
We show that we achieve speedup ranging from 9x to 65x compared to ROCKET during inference on an edge device.
arXiv Detail & Related papers (2022-04-04T10:52:20Z) - Deep Impulse Responses: Estimating and Parameterizing Filters with Deep
Networks [76.830358429947]
Impulse response estimation in high noise and in-the-wild settings is a challenging problem.
We propose a novel framework for parameterizing and estimating impulse responses based on recent advances in neural representation learning.
arXiv Detail & Related papers (2022-02-07T18:57:23Z) - Self-Supervised Learning of Perceptually Optimized Block Motion
Estimates for Video Compression [50.48504867843605]
We propose a search-free block motion estimation framework using a multi-stage convolutional neural network.
We deploy the multi-scale structural similarity (MS-SSIM) loss function to optimize the perceptual quality of the motion compensated predicted frames.
arXiv Detail & Related papers (2021-10-05T03:38:43Z) - Sample and Computation Redistribution for Efficient Face Detection [137.19388513633484]
Training data sampling and computation distribution strategies are the keys to efficient and accurate face detection.
scrfdf34 outperforms the best competitor, TinaFace, by $3.86%$ (AP at hard set) while being more than emph3$times$ faster on GPUs with VGA-resolution images.
arXiv Detail & Related papers (2021-05-10T23:51:14Z) - FastFlowNet: A Lightweight Network for Fast Optical Flow Estimation [81.76975488010213]
Dense optical flow estimation plays a key role in many robotic vision tasks.
Current networks often occupy large number of parameters and require heavy computation costs.
Our proposed FastFlowNet works in the well-known coarse-to-fine manner with following innovations.
arXiv Detail & Related papers (2021-03-08T03:09:37Z) - Robust Unsupervised Video Anomaly Detection by Multi-Path Frame
Prediction [61.17654438176999]
We propose a novel and robust unsupervised video anomaly detection method by frame prediction with proper design.
Our proposed method obtains the frame-level AUROC score of 88.3% on the CUHK Avenue dataset.
arXiv Detail & Related papers (2020-11-05T11:34:12Z) - APQ: Joint Search for Network Architecture, Pruning and Quantization
Policy [49.3037538647714]
We present APQ for efficient deep learning inference on resource-constrained hardware.
Unlike previous methods that separately search the neural architecture, pruning policy, and quantization policy, we optimize them in a joint manner.
With the same accuracy, APQ reduces the latency/energy by 2x/1.3x over MobileNetV2+HAQ.
arXiv Detail & Related papers (2020-06-15T16:09:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.