BDIS: Bayesian Dense Inverse Searching Method for Real-Time Stereo
Surgical Image Matching
- URL: http://arxiv.org/abs/2205.03133v1
- Date: Fri, 6 May 2022 10:50:49 GMT
- Title: BDIS: Bayesian Dense Inverse Searching Method for Real-Time Stereo
Surgical Image Matching
- Authors: Jingwei Song, Qiuchen Zhu, Jianyu Lin and Maani Ghaffari
- Abstract summary: This paper proposes the first CPU-level real-time prior-free stereo matching algorithm for general MIS tasks.
We achieve an average 17 Hz on 640*480 images with a single-core CPU (i5-9400) for surgical images.
It has similar or higher accuracy and fewer outliers than the baseline ELAS in MIS, while it is 4-5 times faster.
- Score: 2.990820994368054
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In stereoscope-based Minimally Invasive Surgeries (MIS), dense stereo
matching plays an indispensable role in 3D shape recovery, AR, VR, and
navigation tasks. Although numerous Deep Neural Network (DNN) approaches are
proposed, the conventional prior-free approaches are still popular in the
industry because of the lack of open-source annotated data set and the
limitation of the task-specific pre-trained DNNs. Among the prior-free stereo
matching algorithms, there is no successful real-time algorithm in none GPU
environment for MIS. This paper proposes the first CPU-level real-time
prior-free stereo matching algorithm for general MIS tasks. We achieve an
average 17 Hz on 640*480 images with a single-core CPU (i5-9400) for surgical
images. Meanwhile, it achieves slightly better accuracy than the popular ELAS.
The patch-based fast disparity searching algorithm is adopted for the rectified
stereo images. A coarse-to-fine Bayesian probability and a spatial Gaussian
mixed model were proposed to evaluate the patch probability at different
scales. An optional probability density function estimation algorithm was
adopted to quantify the prediction variance. Extensive experiments demonstrated
the proposed method's capability to handle ambiguities introduced by the
textureless surfaces and the photometric inconsistency from the non-Lambertian
reflectance and dark illumination. The estimated probability managed to balance
the confidences of the patches for stereo images at different scales. It has
similar or higher accuracy and fewer outliers than the baseline ELAS in MIS,
while it is 4-5 times faster. The code and the synthetic data sets are
available at https://github.com/JingweiSong/BDIS-v2.
Related papers
- Generalization of pixel-wise phase estimation by CNN and improvement of
phase-unwrapping by MRF optimization for one-shot 3D scan [0.621405559652172]
Active stereo technique using single pattern projection, a.k.a. one-shot 3D scan, have drawn a wide attention from industry, medical purposes, etc.
One severe drawback of one-shot 3D scan is sparse reconstruction.
We propose a pixel-wise technique for one-shot scan, which is applicable to any types of static pattern if the pattern is regular and periodic.
arXiv Detail & Related papers (2023-09-26T10:45:04Z) - Curvature regularization for Non-line-of-sight Imaging from
Under-sampled Data [5.591221518341613]
Non-line-of-sight (NLOS) imaging aims to reconstruct the three-dimensional hidden scenes from the data measured in the line-of-sight.
We propose novel NLOS reconstruction models based on curvature regularization.
We evaluate the proposed algorithms on both synthetic and real datasets.
arXiv Detail & Related papers (2023-01-01T14:10:43Z) - Compound Batch Normalization for Long-tailed Image Classification [77.42829178064807]
We propose a compound batch normalization method based on a Gaussian mixture.
It can model the feature space more comprehensively and reduce the dominance of head classes.
The proposed method outperforms existing methods on long-tailed image classification.
arXiv Detail & Related papers (2022-12-02T07:31:39Z) - Robust and accurate depth estimation by fusing LiDAR and Stereo [8.85338187686374]
We propose a precision and robust method for fusing the LiDAR and stereo cameras.
This method fully combines the advantages of the LiDAR and stereo camera.
We evaluate the proposed pipeline on the KITTI benchmark.
arXiv Detail & Related papers (2022-07-13T11:55:15Z) - SiPRNet: End-to-End Learning for Single-Shot Phase Retrieval [8.820823270160695]
convolutional neural networks (CNN) have played important roles in various image reconstruction tasks.
In this paper, we design a novel CNN structure, named SiPRNet, to recover a signal from a single Fourier intensity measurement.
The proposed approach consistently outperforms other CNN-based and traditional optimization-based methods in single-shot maskless phase retrieval.
arXiv Detail & Related papers (2022-05-23T16:24:52Z) - Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction [138.04956118993934]
We propose a novel Transformer-based method, coarse-to-fine sparse Transformer (CST)
CST embedding HSI sparsity into deep learning for HSI reconstruction.
In particular, CST uses our proposed spectra-aware screening mechanism (SASM) for coarse patch selecting. Then the selected patches are fed into our customized spectra-aggregation hashing multi-head self-attention (SAH-MSA) for fine pixel clustering and self-similarity capturing.
arXiv Detail & Related papers (2022-03-09T16:17:47Z) - Unsupervised Single Image Super-resolution Under Complex Noise [60.566471567837574]
This paper proposes a model-based unsupervised SISR method to deal with the general SISR task with unknown degradations.
The proposed method can evidently surpass the current state of the art (SotA) method (about 1dB PSNR) not only with a slighter model (0.34M vs. 2.40M) but also faster speed.
arXiv Detail & Related papers (2021-07-02T11:55:40Z) - Bayesian dense inverse searching algorithm for real-time stereo matching
in minimally invasive surgery [1.2074552857379273]
This paper reports a CPU-level real-time stereo matching method for surgical images (10 Hz on 640 * 480 image with a single core of i5-9400)
The proposed method is built on the fast ''dense inverse searching'' algorithm, which estimates the disparity of the stereo images.
Experiments indicate that the estimated depth has higher accuracy and fewer outliers than the baseline methods in the surgical scenario.
arXiv Detail & Related papers (2021-06-14T02:26:27Z) - Displacement-Invariant Cost Computation for Efficient Stereo Matching [122.94051630000934]
Deep learning methods have dominated stereo matching leaderboards by yielding unprecedented disparity accuracy.
But their inference time is typically slow, on the order of seconds for a pair of 540p images.
We propose a emphdisplacement-invariant cost module to compute the matching costs without needing a 4D feature volume.
arXiv Detail & Related papers (2020-12-01T23:58:16Z) - Human Body Model Fitting by Learned Gradient Descent [48.79414884222403]
We propose a novel algorithm for the fitting of 3D human shape to images.
We show that this algorithm is fast (avg. 120ms convergence), robust to dataset, and achieves state-of-the-art results on public evaluation datasets.
arXiv Detail & Related papers (2020-08-19T14:26:47Z) - Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance
Disparity Estimation [51.17232267143098]
We propose a novel system named Disp R-CNN for 3D object detection from stereo images.
We use a statistical shape model to generate dense disparity pseudo-ground-truth without the need of LiDAR point clouds.
Experiments on the KITTI dataset show that, even when LiDAR ground-truth is not available at training time, Disp R-CNN achieves competitive performance and outperforms previous state-of-the-art methods by 20% in terms of average precision.
arXiv Detail & Related papers (2020-04-07T17:48:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.