EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals
Measurement
- URL: http://arxiv.org/abs/2110.04447v1
- Date: Sat, 9 Oct 2021 03:51:26 GMT
- Title: EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals
Measurement
- Authors: Xin Liu, Brian L. Hill, Ziheng Jiang, Shwetak Patel, Daniel McDuff
- Abstract summary: We propose two novel neural models for camera-based physiological measurement called EfficientPhys.
Our models achieve state-of-the-art accuracy on three public datasets.
- Score: 5.435325323159416
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Camera-based physiological measurement is a growing field with neural models
providing state-the-art-performance. Prior research have explored various
``end-to-end'' models; however these methods still require several
preprocessing steps. These additional operations are often non-trivial to
implement making replication and deployment difficult and can even have a
higher computational budget than the ``core'' network itself. In this paper, we
propose two novel and efficient neural models for camera-based physiological
measurement called EfficientPhys that remove the need for face detection,
segmentation, normalization, color space transformation or any other
preprocessing steps. Using an input of raw video frames, our models achieve
state-of-the-art accuracy on three public datasets. We show that this is the
case whether using a transformer or convolutional backbone. We further evaluate
the latency of the proposed networks and show that our most light weight
network also achieves a 33% improvement in efficiency.
Related papers
- Query Quantized Neural SLAM [25.72309707436261]
We propose query quantized neural SLAM to reduce variations of input for easier and faster overfitting a frame.
We report visual and numerical comparisons on widely used benchmarks to show our superiority over the latest methods in both reconstruction and camera tracking.
arXiv Detail & Related papers (2024-12-21T04:08:18Z) - Task-Oriented Real-time Visual Inference for IoVT Systems: A Co-design Framework of Neural Networks and Edge Deployment [61.20689382879937]
Task-oriented edge computing addresses this by shifting data analysis to the edge.
Existing methods struggle to balance high model performance with low resource consumption.
We propose a novel co-design framework to optimize neural network architecture.
arXiv Detail & Related papers (2024-10-29T19:02:54Z) - EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image Segmentation [3.6813810514531085]
We introduce a novel 3D medical image segmentation model called EM-Net. Inspired by its success, we introduce a novel Mamba-based 3D medical image segmentation model called EM-Net.
Comprehensive experiments on two challenging multi-organ datasets with other state-of-the-art (SOTA) algorithms show that our method exhibits better segmentation accuracy while requiring nearly half the parameter size of SOTA models and 2x faster training speed.
arXiv Detail & Related papers (2024-09-26T09:34:33Z) - SeUNet-Trans: A Simple yet Effective UNet-Transformer Model for Medical
Image Segmentation [0.0]
We propose a simple yet effective UNet-Transformer (seUNet-Trans) model for medical image segmentation.
In our approach, the UNet model is designed as a feature extractor to generate multiple feature maps from the input images.
By leveraging the UNet architecture and the self-attention mechanism, our model not only retains the preservation of both local and global context information but also is capable of capturing long-range dependencies between input elements.
arXiv Detail & Related papers (2023-10-16T01:13:38Z) - Sample Less, Learn More: Efficient Action Recognition via Frame Feature
Restoration [59.6021678234829]
We propose a novel method to restore the intermediate features for two sparsely sampled and adjacent video frames.
With the integration of our method, the efficiency of three commonly used baselines has been improved by over 50%, with a mere 0.5% reduction in recognition accuracy.
arXiv Detail & Related papers (2023-07-27T13:52:42Z) - Neural Lens Modeling [50.57409162437732]
NeuroLens is a neural lens model for distortion and vignetting that can be used for point projection and ray casting.
It can be used to perform pre-capture calibration using classical calibration targets, and can later be used to perform calibration or refinement during 3D reconstruction.
The model generalizes across many lens types and is trivial to integrate into existing 3D reconstruction and rendering systems.
arXiv Detail & Related papers (2023-04-10T20:09:17Z) - An Adversarial Active Sampling-based Data Augmentation Framework for
Manufacturable Chip Design [55.62660894625669]
Lithography modeling is a crucial problem in chip design to ensure a chip design mask is manufacturable.
Recent developments in machine learning have provided alternative solutions in replacing the time-consuming lithography simulations with deep neural networks.
We propose a litho-aware data augmentation framework to resolve the dilemma of limited data and improve the machine learning model performance.
arXiv Detail & Related papers (2022-10-27T20:53:39Z) - GLEAM: Greedy Learning for Large-Scale Accelerated MRI Reconstruction [50.248694764703714]
Unrolled neural networks have recently achieved state-of-the-art accelerated MRI reconstruction.
These networks unroll iterative optimization algorithms by alternating between physics-based consistency and neural-network based regularization.
We propose Greedy LEarning for Accelerated MRI reconstruction, an efficient training strategy for high-dimensional imaging settings.
arXiv Detail & Related papers (2022-07-18T06:01:29Z) - SpikiLi: A Spiking Simulation of LiDAR based Real-time Object Detection
for Autonomous Driving [0.0]
Spiking Neural Networks are a new neural network design approach that promises tremendous improvements in power efficiency, computation efficiency, and processing latency.
We first illustrate the applicability of spiking neural networks to a complex deep learning task namely Lidar based 3D object detection for automated driving.
arXiv Detail & Related papers (2022-06-06T20:05:17Z) - Searching for Efficient Architecture for Instrument Segmentation in
Robotic Surgery [58.63306322525082]
Most applications rely on accurate real-time segmentation of high-resolution surgical images.
We design a light-weight and highly-efficient deep residual architecture which is tuned to perform real-time inference of high-resolution images.
arXiv Detail & Related papers (2020-07-08T21:38:29Z) - Deep Medical Image Analysis with Representation Learning and
Neuromorphic Computing [1.43494686131174]
We present a capsule network that explicitly learns a representation robust to rotation and affine transformation.
Second, we leverage the latest domain adaptation techniques to achieve a new state-of-the-art accuracy.
Third, we design a spiking neural network trained on the Intel Loihi neuromorphic chip.
arXiv Detail & Related papers (2020-05-11T20:56:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.