End-to-End Latency Optimization of Multi-view 3D Reconstruction for
Disaster Response
- URL: http://arxiv.org/abs/2304.01488v1
- Date: Tue, 4 Apr 2023 03:04:44 GMT
- Title: End-to-End Latency Optimization of Multi-view 3D Reconstruction for
Disaster Response
- Authors: Xiaojie Zhang, Mingjun Li, Andrew Hilton, Amitangshu Pal, Soumyabrata
Dey, Saptarshi Debroy
- Abstract summary: Multi-view Stereo (MVS) based 3D reconstruction applications are exceedingly time consuming, especially when run on such computationally constrained mobile edge devices.
In this paper, we aim to design a latency optimized MVS algorithm pipeline, with the objective to best balance the end-to-end latency and reconstruction quality.
- Score: 3.471012855429593
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In order to plan rapid response during disasters, first responder agencies
often adopt `bring your own device' (BYOD) model with inexpensive mobile edge
devices (e.g., drones, robots, tablets) for complex video analytics
applications, e.g., 3D reconstruction of a disaster scene. Unlike simpler video
applications, widely used Multi-view Stereo (MVS) based 3D reconstruction
applications (e.g., openMVG/openMVS) are exceedingly time consuming, especially
when run on such computationally constrained mobile edge devices. Additionally,
reducing the reconstruction latency of such inherently sequential algorithms is
challenging as unintelligent, application-agnostic strategies can drastically
degrade the reconstruction (i.e., application outcome) quality making them
useless. In this paper, we aim to design a latency optimized MVS algorithm
pipeline, with the objective to best balance the end-to-end latency and
reconstruction quality by running the pipeline on a collaborative mobile edge
environment. The overall optimization approach is two-pronged where: (a)
application optimizations introduce data-level parallelism by splitting the
pipeline into high frequency and low frequency reconstruction components and
(b) system optimizations incorporate task-level parallelism to the pipelines by
running them opportunistically on available resources with online quality
control in order to balance both latency and quality. Our evaluation on a
hardware testbed using publicly available datasets shows upto ~54% reduction in
latency with negligible loss (~4-7%) in reconstruction quality.
Related papers
- Low-Resource Video Super-Resolution using Memory, Wavelets, and Deformable Convolutions [3.018928786249079]
Video super-resolution (VSR) remains a formidable challenge in its adoption for deployment on resource-constrained edge devices.
We propose a novel lightweight and parameter-efficient neural architecture for VSR that achieves state-of-the-art reconstruction accuracy with just 2.3 million parameters.
arXiv Detail & Related papers (2025-02-03T20:46:15Z) - Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass [68.78222900840132]
We propose Fast 3D Reconstruction (Fast3R), a novel multi-view generalization to DUSt3R that achieves efficient and scalable 3D reconstruction by processing many views in parallel.
Fast3R demonstrates state-of-the-art performance, with significant improvements in inference speed and reduced error accumulation.
arXiv Detail & Related papers (2025-01-23T18:59:55Z) - VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment [63.21396416244634]
VideoLifter is a novel video-to-3D pipeline that leverages a local-to-global strategy on a fragment basis.
It significantly accelerates the reconstruction process, reducing training time by over 82% while holding better visual quality than current SOTA methods.
arXiv Detail & Related papers (2025-01-03T18:52:36Z) - Task-Oriented Real-time Visual Inference for IoVT Systems: A Co-design Framework of Neural Networks and Edge Deployment [61.20689382879937]
Task-oriented edge computing addresses this by shifting data analysis to the edge.
Existing methods struggle to balance high model performance with low resource consumption.
We propose a novel co-design framework to optimize neural network architecture.
arXiv Detail & Related papers (2024-10-29T19:02:54Z) - Taming 3DGS: High-Quality Radiance Fields with Limited Resources [50.92437599516609]
3D Gaussian Splatting (3DGS) has transformed novel-view synthesis with its fast, interpretable, and high-fidelity rendering.
We tackle the challenges of training and rendering 3DGS models on a budget.
We derive faster, numerically equivalent solutions for gradient computation and attribute updates.
arXiv Detail & Related papers (2024-06-21T20:44:23Z) - PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System
Co-design [16.76965926088238]
PipeRAG is a novel algorithm-system co-design approach to reduce generation latency and enhance generation quality.
Our evaluation shows that PipeRAG achieves up to 2.6$times$ speedup in end-to-end generation latency while improving generation quality.
arXiv Detail & Related papers (2024-03-08T21:09:20Z) - Residual Multiplicative Filter Networks for Multiscale Reconstruction [24.962697695403037]
We introduce a new coordinate network architecture and training scheme that enables coarse-to-fine optimization with fine-grained control over the frequency support of learned reconstructions.
We demonstrate how these modifications enable multiscale optimization for coarse-to-fine fitting to natural images.
We then evaluate our model on synthetically generated datasets for the the problem of single-particle cryo-EM reconstruction.
arXiv Detail & Related papers (2022-06-01T20:16:28Z) - Real-Time GPU-Accelerated Machine Learning Based Multiuser Detection for
5G and Beyond [70.81551587109833]
nonlinear beamforming filters can significantly outperform linear approaches in stationary scenarios with massive connectivity.
One of the main challenges comes from the real-time implementation of these algorithms.
This paper explores the acceleration of APSM-based algorithms through massive parallelization.
arXiv Detail & Related papers (2022-01-13T15:20:45Z) - Architecture Aware Latency Constrained Sparse Neural Networks [35.50683537052815]
In this paper, we design an architecture aware latency constrained sparse framework to prune and accelerate CNN models.
We also propose a novel sparse convolution algorithm for efficient computation.
Our system-algorithm co-design framework can achieve much better frontier among network accuracy and latency on resource-constrained mobile devices.
arXiv Detail & Related papers (2021-09-01T03:41:31Z) - Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture
and Pruning Search [64.80878113422824]
We propose an automatic search framework that derives sparse super-resolution (SR) models with high image quality while satisfying the real-time inference requirement.
With the proposed framework, we are the first to achieve real-time SR inference (with only tens of milliseconds per frame) for implementing 720p resolution with competitive image quality.
arXiv Detail & Related papers (2021-08-18T06:47:31Z) - A New Backbone for Hyperspectral Image Reconstruction [90.48427561874402]
3D hyperspectral image (HSI) reconstruction refers to inverse process of snapshot compressive imaging.
Proposal is for a Spatial/Spectral Invariant Residual U-Net, namely SSI-ResU-Net.
We show that SSI-ResU-Net achieves competing performance with over 77.3% reduction in terms of floating-point operations.
arXiv Detail & Related papers (2021-08-17T16:20:51Z) - Multi-Exit Semantic Segmentation Networks [78.44441236864057]
We propose a framework for converting state-of-the-art segmentation models to MESS networks.
specially trained CNNs that employ parametrised early exits along their depth to save during inference on easier samples.
We co-optimise the number, placement and architecture of the attached segmentation heads, along with the exit policy, to adapt to the device capabilities and application-specific requirements.
arXiv Detail & Related papers (2021-06-07T11:37:03Z) - Real-time Non-line-of-sight Imaging with Two-step Deep Remapping [0.0]
Non-line-of-sight (NLOS) imaging takes the indirect light into account.
Most solutions employ a transient scanning process, followed by a back-projection based algorithm to reconstruct the NLOS scenes.
Here we propose a new NLOS solution to address the above defects, with innovations on both detection equipment and reconstruction algorithm.
arXiv Detail & Related papers (2021-01-26T00:08:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.