Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven   Surface Normal-aware Tracking and Mapping
        - URL: http://arxiv.org/abs/2501.19319v1
- Date: Fri, 31 Jan 2025 17:15:34 GMT
- Title: Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven   Surface Normal-aware Tracking and Mapping
- Authors: Yiming Huang, Beilei Cui, Long Bai, Zhen Chen, Jinlin Wu, Zhen Li, Hongbin Liu, Hongliang Ren, 
- Abstract summary: Endo-2DTAM is a real-time endoscopic SLAM system with 2D Gaussian Splatting (2DGS)<n>Our robust tracking module combines point-to-point and point-to-plane distance metrics.<n>Our mapping module utilizes normal consistency and depth distortion to enhance surface reconstruction quality.
- Score: 12.027762278121052
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract:   Simultaneous Localization and Mapping (SLAM) is essential for precise surgical interventions and robotic tasks in minimally invasive procedures. While recent advancements in 3D Gaussian Splatting (3DGS) have improved SLAM with high-quality novel view synthesis and fast rendering, these systems struggle with accurate depth and surface reconstruction due to multi-view inconsistencies. Simply incorporating SLAM and 3DGS leads to mismatches between the reconstructed frames. In this work, we present Endo-2DTAM, a real-time endoscopic SLAM system with 2D Gaussian Splatting (2DGS) to address these challenges. Endo-2DTAM incorporates a surface normal-aware pipeline, which consists of tracking, mapping, and bundle adjustment modules for geometrically accurate reconstruction. Our robust tracking module combines point-to-point and point-to-plane distance metrics, while the mapping module utilizes normal consistency and depth distortion to enhance surface reconstruction quality. We also introduce a pose-consistent strategy for efficient and geometrically coherent keyframe sampling. Extensive experiments on public endoscopic datasets demonstrate that Endo-2DTAM achieves an RMSE of $1.87\pm 0.63$ mm for depth reconstruction of surgical scenes while maintaining computationally efficient tracking, high-quality visual appearance, and real-time rendering. Our code will be released at github.com/lastbasket/Endo-2DTAM. 
 
      
        Related papers
        - Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline [64.42938561167402]
 We propose an online 3D reconstruction method using 3D Gaussian-based SLAM, combined with a feed-forward recurrent prediction module.<n>This approach replaces slow test-time optimization with fast network inference, significantly improving tracking speed.<n>Our method achieves performance on par with the state-of-the-art SplaTAM, while reducing tracking time by more than 90%.
 arXiv  Detail & Related papers  (2025-08-06T16:16:58Z)
- EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian   Splatting [7.7956059927002705]
 We introduce optical flow loss as a geometric constraint, which effectively constrains both the 3D structure of the scene and the camera motion.<n>In addition, to improve scene representation in the SLAM system, we improve the 3DGS refinement strategy by focusing on viewpoints corresponding to Keyframes.<n>Our method outperforms existing state-of-the-art methods in novel view synthesis and pose estimation.
 arXiv  Detail & Related papers  (2025-06-26T16:06:46Z)
- GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field [18.520468059548865]
 GSFF-SLAM is a novel dense semantic SLAM system based on 3D Gaussian Splatting.
Our method supports semantic reconstruction using various forms of 2D priors, particularly sparse and noisy signals.
When utilizing 2D ground truth priors, GSFF-SLAM achieves state-of-the-art semantic segmentation performance with 95.03% mIoU.
 arXiv  Detail & Related papers  (2025-04-28T01:21:35Z)
- T-3DGS: Removing Transient Objects for 3D Scene Reconstruction [83.05271859398779]
 Transient objects in video sequences can significantly degrade the quality of 3D scene reconstructions.
We propose T-3DGS, a novel framework that robustly filters out transient distractors during 3D reconstruction using Gaussian Splatting.
 arXiv  Detail & Related papers  (2024-11-29T07:45:24Z)
- GausSurf: Geometry-Guided 3D Gaussian Splatting for Surface   Reconstruction [79.42244344704154]
 GausSurf employs geometry guidance from multi-view consistency in texture-rich areas and normal priors in texture-less areas of a scene.<n>Our method surpasses state-of-the-art methods in terms of reconstruction quality and computation time.
 arXiv  Detail & Related papers  (2024-11-29T03:54:54Z)
- GUS-IR: Gaussian Splatting with Unified Shading for Inverse Rendering [83.69136534797686]
 We present GUS-IR, a novel framework designed to address the inverse rendering problem for complicated scenes featuring rough and glossy surfaces.
This paper starts by analyzing and comparing two prominent shading techniques popularly used for inverse rendering, forward shading and deferred shading.
We propose a unified shading solution that combines the advantages of both techniques for better decomposition.
 arXiv  Detail & Related papers  (2024-11-12T01:51:05Z)
- SurgicalGS: Dynamic 3D Gaussian Splatting for Accurate Robotic-Assisted   Surgical Scene Reconstruction [18.074890506856114]
 We present SurgicalGS, a dynamic 3D Gaussian Splatting framework specifically designed for surgical scene reconstruction with improved geometric accuracy.
Our approach first initialises a Gaussian point cloud using depth priors, employing binary motion masks to identify pixels with significant depth variations and fusing point clouds from depth maps across frames for initialisation.
We use the Flexible Deformation Model to represent dynamic scene and introduce a normalised depth regularisation loss along with an unsupervised depth smoothness constraint to ensure more accurate geometric reconstruction.
 arXiv  Detail & Related papers  (2024-10-11T22:46:46Z)
- GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for   Improved Visual Localization [1.4466437171584356]
 We propose a two-stage procedure that integrates dense and robust keypoint descriptors from the lightweight XFeat feature extractor into 3DGS.
In the second stage, the initial pose estimate is refined by minimizing the rendering-based photometric warp loss.
 Benchmarking on widely used indoor and outdoor datasets demonstrates improvements over recent neural rendering-based localization methods.
 arXiv  Detail & Related papers  (2024-09-24T23:18:32Z)
- Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel   View Synthesis [11.236094544193605]
 Conventional geometry-based SLAM systems lack dense 3D reconstruction capabilities.
We propose a real-time RGB-D SLAM system that incorporates a novel view synthesis technique, 3D Gaussian Splatting.
 arXiv  Detail & Related papers  (2024-08-10T21:23:08Z)
- SMORE: Simultaneous Map and Object REconstruction [66.66729715211642]
 We present a method for dynamic surface reconstruction of large-scale urban scenes from LiDAR.
We take a holistic perspective and optimize a compositional model of a dynamic scene that decomposes the world into rigidly-moving objects and the background.
 arXiv  Detail & Related papers  (2024-06-19T23:53:31Z)
- MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from   Multi-View Stereo [54.00987996368157]
 We present MVSGaussian, a new generalizable 3D Gaussian representation approach derived from Multi-View Stereo (MVS)
MVSGaussian achieves real-time rendering with better synthesis quality for each scene.
 arXiv  Detail & Related papers  (2024-05-20T17:59:30Z)
- MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision,   Depth, and Inertial Measurements [59.70107451308687]
 We show for the first time that using 3D Gaussians for map representation with unposed camera images and inertial measurements can enable accurate SLAM.
Our method, MM3DGS, addresses the limitations of prior rendering by enabling faster scale awareness, and improved trajectory tracking.
We also release a multi-modal dataset, UT-MM, collected from a mobile robot equipped with a camera and an inertial measurement unit.
 arXiv  Detail & Related papers  (2024-04-01T04:57:41Z)
- 2D Gaussian Splatting for Geometrically Accurate Radiance Fields [50.056790168812114]
 3D Gaussian Splatting (3DGS) has recently revolutionized radiance field reconstruction, achieving high quality novel view synthesis and fast rendering speed without baking.
We present 2D Gaussian Splatting (2DGS), a novel approach to model and reconstruct geometrically accurate radiance fields from multi-view images.
We demonstrate that our differentiable terms allows for noise-free and detailed geometry reconstruction while maintaining competitive appearance quality, fast training speed, and real-time rendering.
 arXiv  Detail & Related papers  (2024-03-26T17:21:24Z)
- Gaussian Splatting SLAM [16.3858380078553]
 We present the first application of 3D Gaussian Splatting in monocular SLAM.
Our method runs live at 3fps, unifying the required representation for accurate tracking, mapping, and high-quality rendering.
Several innovations are required to continuously reconstruct 3D scenes with high fidelity from a live camera.
 arXiv  Detail & Related papers  (2023-12-11T18:19:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.