Related papers: Deep Iterative 2D/3D Registration

Deep Iterative 2D/3D Registration

URL: http://arxiv.org/abs/2107.10004v1
Date: Wed, 21 Jul 2021 10:51:29 GMT
Title: Deep Iterative 2D/3D Registration
Authors: Srikrishna Jaganathan, Jian Wang, Anja Borsdorf, Karthik Shetty, Andreas Maier
Abstract summary: We propose a novel Deep Learning driven 2D/3D registration framework that can be used end-to-end for iterative registration tasks. We accomplish this by learning the update step of the 2D/3D registration framework using Point-to-Plane Correspondences. Our proposed method achieves an average runtime of around 8s, a mean re-projection distance error of 0.60 $pm$ 0.40 mm with a success ratio of 97 percent and a capture range of 60 mm.
Score: 9.813316061451392
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Deep Learning-based 2D/3D registration methods are highly robust but often lack the necessary registration accuracy for clinical application. A refinement step using the classical optimization-based 2D/3D registration method applied in combination with Deep Learning-based techniques can provide the required accuracy. However, it also increases the runtime. In this work, we propose a novel Deep Learning driven 2D/3D registration framework that can be used end-to-end for iterative registration tasks without relying on any further refinement step. We accomplish this by learning the update step of the 2D/3D registration framework using Point-to-Plane Correspondences. The update step is learned using iterative residual refinement-based optical flow estimation, in combination with the Point-to-Plane correspondence solver embedded as a known operator. Our proposed method achieves an average runtime of around 8s, a mean re-projection distance error of 0.60 $\pm$ 0.40 mm with a success ratio of 97 percent and a capture range of 60 mm. The combination of high registration accuracy, high robustness, and fast runtime makes our solution ideal for clinical applications.

Related papers

Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline [64.42938561167402]
We propose an online 3D reconstruction method using 3D Gaussian-based SLAM, combined with a feed-forward recurrent prediction module.<n>This approach replaces slow test-time optimization with fast network inference, significantly improving tracking speed.<n>Our method achieves performance on par with the state-of-the-art SplaTAM, while reducing tracking time by more than 90%.
arXiv Detail & Related papers (2025-08-06T16:16:58Z)
DELTAv2: Accelerating Dense 3D Tracking [79.63990337419514]
We propose a novel algorithm for accelerating dense long-term 3D point tracking in videos.<n>We introduce a coarse-to-fine strategy that begins tracking with a small subset of points and progressively expands the set of tracked trajectories.<n>The newly added trajectories are using a learnable module, which is trained end-to-end alongside the tracking network.
arXiv Detail & Related papers (2025-08-02T03:15:47Z)
Better Pose Initialization for Fast and Robust 2D/3D Pelvis Registration [1.8352113484137624]
This paper presents an approach for improving 2D/3D pelvis registration in optimization-based pose estimators. We find that even a coarse initializer greatly improves pose estimator accuracy, and improves overall computational efficiency.
arXiv Detail & Related papers (2025-03-10T18:42:13Z)
DELTA: Dense Efficient Long-range 3D Tracking for any video [82.26753323263009]
We introduce DELTA, a novel method that efficiently tracks every pixel in 3D space, enabling accurate motion estimation across entire videos. Our approach leverages a joint global-local attention mechanism for reduced-resolution tracking, followed by a transformer-based upsampler to achieve high-resolution predictions. Our method provides a robust solution for applications requiring fine-grained, long-term motion tracking in 3D space.
arXiv Detail & Related papers (2024-10-31T17:59:01Z)
Dual-frame Fluid Motion Estimation with Test-time Optimization and Zero-divergence Loss [9.287932323337163]
3D particle tracking velocimetry (PTV) is a key technique for analyzing turbulent flow. Deep learning-based methods have achieved impressive accuracy in dual-frame fluid motion estimation. We introduce a new method that is completely self-supervised and notably outperforms its fully-supervised counterparts.
arXiv Detail & Related papers (2024-10-15T18:00:00Z)
DynaWeightPnP: Toward global real-time 3D-2D solver in PnP without correspondences [7.191124861153032]
This paper addresses a special Perspective-n-Point (Weight) problem: estimating the optimal pose to align 3D and 2D shapes in real-time without correspondences. Experiments were conducted on a typical case, that is, a 3D-2D centerline registration task within Endovascular Image-Guided Interventions. Results demonstrated that the proposed algorithm achieves registration processing rates of 60 Hz (without post-refinement) and 31 (with post-refinement) with competitive accuracy comparable to existing methods.
arXiv Detail & Related papers (2024-09-27T05:31:33Z)
P3P: Pseudo-3D Pre-training for Scaling 3D Masked Autoencoders [34.64343313442465]
Pre-training in 3D is pivotal for advancing 3D perception tasks. However, the scarcity of clean 3D data poses significant challenges for scaling 3D pre-training efforts. We introduce an innovative self-supervised pre-training framework. Our method achieves state-of-the-art performance in 3D classification, detection, and few-shot learning.
arXiv Detail & Related papers (2024-08-19T13:59:53Z)
3DMOTFormer: Graph Transformer for Online 3D Multi-Object Tracking [15.330384668966806]
State-of-the-art 3D multi-object tracking (MOT) approaches typically rely on non-learned model-based algorithms such as Kalman Filter. We propose 3DMOTFormer, a learned geometry-based 3D MOT framework building upon the transformer architecture. Our approach achieves 71.2% and 68.2% AMOTA on the nuScenes validation and test split, respectively.
arXiv Detail & Related papers (2023-08-12T19:19:58Z)
UncLe-SLAM: Uncertainty Learning for Dense Neural SLAM [60.575435353047304]
We present an uncertainty learning framework for dense neural simultaneous localization and mapping (SLAM) We propose an online framework for sensor uncertainty estimation that can be trained in a self-supervised manner from only 2D input data.
arXiv Detail & Related papers (2023-06-19T16:26:25Z)
Dynamic Iterative Refinement for Efficient 3D Hand Pose Estimation [87.54604263202941]
We propose a tiny deep neural network of which partial layers are iteratively exploited for refining its previous estimations. We employ learned gating criteria to decide whether to exit from the weight-sharing loop, allowing per-sample adaptation in our model. Our method consistently outperforms state-of-the-art 2D/3D hand pose estimation approaches in terms of both accuracy and efficiency for widely used benchmarks.
arXiv Detail & Related papers (2021-11-11T23:31:34Z)
6D Pose Estimation with Combined Deep Learning and 3D Vision Techniques for a Fast and Accurate Object Grasping [0.19686770963118383]
Real-time robotic grasping is a priority target for highly advanced autonomous systems. This paper proposes a novel method with a 2-stage approach that combines a fast 2D object recognition using a deep neural network. The proposed solution has a potential to perform robustly on real-time applications, requiring both efficiency and accuracy.
arXiv Detail & Related papers (2021-11-11T15:36:55Z)
Learning the Update Operator for 2D/3D Image Registration [10.720342813316531]
preoperative volume can be overlaid over the 2D images using 2D/3D image registration. Deep learning-based 2D/3D registration methods have shown promising results by improving computational efficiency and robustness. We show an improvement of 1.8 times in terms of registration accuracy for the update step prediction compared to learning without the known operator.
arXiv Detail & Related papers (2021-02-04T19:52:59Z)
Human Body Model Fitting by Learned Gradient Descent [48.79414884222403]
We propose a novel algorithm for the fitting of 3D human shape to images. We show that this algorithm is fast (avg. 120ms convergence), robust to dataset, and achieves state-of-the-art results on public evaluation datasets.
arXiv Detail & Related papers (2020-08-19T14:26:47Z)
Learning 3D-3D Correspondences for One-shot Partial-to-partial Registration [66.41922513553367]
We show that learning-based partial-to-partial registration can be achieved in a one-shot manner. We propose an Optimal Transport layer able to account for occluded points thanks to the use of bins. The resulting OPRNet framework outperforms the state of the art on standard benchmarks.
arXiv Detail & Related papers (2020-06-08T12:35:47Z)
3DSSD: Point-based 3D Single Stage Object Detector [61.67928229961813]
We present a point-based 3D single stage object detector, named 3DSSD, achieving a good balance between accuracy and efficiency. Our method outperforms all state-of-the-art voxel-based single stage methods by a large margin, and has comparable performance to two stage point-based methods as well.
arXiv Detail & Related papers (2020-02-24T12:01:58Z)

This list is automatically generated from the titles and abstracts of the papers in this site.