Related papers: A Fourier-enhanced multi-modal 3D small object optical mark recognition and positioning method for percutaneous abdominal puncture surgical navigation

A Fourier-enhanced multi-modal 3D small object optical mark recognition and positioning method for percutaneous abdominal puncture surgical navigation

URL: http://arxiv.org/abs/2404.08990v1
Date: Sat, 13 Apr 2024 12:28:40 GMT
Title: A Fourier-enhanced multi-modal 3D small object optical mark recognition and positioning method for percutaneous abdominal puncture surgical navigation
Authors: Zezhao Guo, Yanzhong Guo, Zhanfang Zhao,
Abstract summary: This paper proposes a muti-modal 3D small object medical marker detection method, which identifies the center of a small single ring as the needle insertion point. The experimental results show this novel method achieves high-precision and high-stability positioning.
Score: 0.27309692684728604
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Navigation for thoracoabdominal puncture surgery is used to locate the needle entry point on the patient's body surface. The traditional reflective ball navigation method is difficult to position the needle entry point on the soft, irregular, smooth chest and abdomen. Due to the lack of clear characteristic points on the body surface using structured light technology, it is difficult to identify and locate arbitrary needle insertion points. Based on the high stability and high accuracy requirements of surgical navigation, this paper proposed a novel method, a muti-modal 3D small object medical marker detection method, which identifies the center of a small single ring as the needle insertion point. Moreover, this novel method leverages Fourier transform enhancement technology to augment the dataset, enrich image details, and enhance the network's capability. The method extracts the Region of Interest (ROI) of the feature image from both enhanced and original images, followed by generating a mask map. Subsequently, the point cloud of the ROI from the depth map is obtained through the registration of ROI point cloud contour fitting. In addition, this method employs Tukey loss for optimal precision. The experimental results show this novel method proposed in this paper not only achieves high-precision and high-stability positioning, but also enables the positioning of any needle insertion point.

Related papers

Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques [91.26187560114381]
Feature matching is a cornerstone task in computer vision, essential for applications such as image retrieval, stereo matching, 3D reconstruction, and SLAM.<n>This survey comprehensively reviews modality-based feature matching, exploring traditional handcrafted methods and contemporary deep learning approaches.
arXiv Detail & Related papers (2025-07-30T15:56:36Z)
BCRNet: Enhancing Landmark Detection in Laparoscopic Liver Surgery via Bezier Curve Refinement [14.918845671238737]
BCRNet is a novel framework that significantly enhances landmark detection in laparoscopic liver surgery.<n>The framework starts with a Multi-modal Feature Extraction (MFE) module designed to robustly capture semantic features.<n>BCRNet outperforms state-of-the-art methods, achieving significant performance improvements.
arXiv Detail & Related papers (2025-06-18T09:00:08Z)
High-precision surgical navigation using speckle structured light-based thoracoabdominal puncture robot [0.27309692684728604]
During puncture robotic surgical navigation, the needle insertion point is positioned on the patient's chest and abdomen body surface. Traditional reflective ball tracking method is difficult to apply. This paper designs and experiments a method that is different from previous reflective ball optical markers. It is based on a speckle structured light camera to identify the patient's body surface and fit it into a hollow ring with a diameter of 24mm.
arXiv Detail & Related papers (2024-05-06T08:59:51Z)
EyeLS: Shadow-Guided Instrument Landing System for Intraocular Target Approaching in Robotic Eye Surgery [51.05595735405451]
Robotic ophthalmic surgery is an emerging technology to facilitate high-precision interventions such as retina penetration in subretinal injection and removal of floating tissues in retinal detachment. Current image-based methods cannot effectively estimate the needle tip's trajectory towards both retinal and floating targets. We propose to use the shadow positions of the target and the instrument tip to estimate their relative depth position. Our method succeeds target approaching on a retina model, and achieves an average depth error of 0.0127 mm and 0.3473 mm for floating and retinal targets respectively in the surgical simulator.
arXiv Detail & Related papers (2023-11-15T09:11:37Z)
Deep learning network to correct axial and coronal eye motion in 3D OCT retinal imaging [65.47834983591957]
We propose deep learning based neural networks to correct axial and coronal motion artifacts in OCT based on a single scan. The experimental result shows that the proposed method can effectively correct motion artifacts and achieve smaller error than other methods.
arXiv Detail & Related papers (2023-05-27T03:55:19Z)
Robust Landmark-based Stent Tracking in X-ray Fluoroscopy [10.917460255497227]
We propose an end-to-end deep learning framework for single stent tracking. It consists of three hierarchical modules: U-Net based landmark detection, ResNet based stent proposal and feature extraction. Experiments show that our method performs significantly better in detection compared with the state-of-the-art point-based tracking models.
arXiv Detail & Related papers (2022-07-20T14:20:03Z)
Comparison of Depth Estimation Setups from Stereo Endoscopy and Optical Tracking for Point Measurements [1.1084983279967584]
To support minimally-invasive mitral valve repair, quantitative measurements from the valve can be obtained using an infra-red tracked stylus. Hand-eye calibration is required that links both coordinate systems and is a prerequisite to project the points onto the image plane. A complementary approach to this is to use a vision-based endoscopic stereo-setup to detect and triangulate points of interest, to obtain the 3D coordinates. Preliminary results indicate that 3D landmark estimation, either labeled manually or through partly automated detection with a deep learning approach, provides more accurate triangulated depth measurements when performed with a tailored image-based method than
arXiv Detail & Related papers (2022-01-26T10:15:46Z)
SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object Detection [78.90102636266276]
We propose a novel set abstraction method named Semantics-Augmented Set Abstraction (SASA) Based on the estimated point-wise foreground scores, we then propose a semantics-guided point sampling algorithm to help retain more important foreground points during down-sampling. In practice, SASA shows to be effective in identifying valuable points related to foreground objects and improving feature learning for point-based 3D detection.
arXiv Detail & Related papers (2022-01-06T08:54:47Z)
Attention Aware Wavelet-based Detection of Morphed Face Images [18.22557507385582]
We propose a wavelet-based morph detection methodology which adopts an end-to-end trainable soft attention mechanism. We evaluate performance of the proposed framework using three datasets, VISAPP17, LMA, and MorGAN.
arXiv Detail & Related papers (2021-06-29T19:29:19Z)
Revisiting 3D Context Modeling with Supervised Pre-training for Universal Lesion Detection in CT Slices [48.85784310158493]
We propose a Modified Pseudo-3D Feature Pyramid Network (MP3D FPN) to efficiently extract 3D context enhanced 2D features for universal lesion detection in CT slices. With the novel pre-training method, the proposed MP3D FPN achieves state-of-the-art detection performance on the DeepLesion dataset. The proposed 3D pre-trained weights can potentially be used to boost the performance of other 3D medical image analysis tasks.
arXiv Detail & Related papers (2020-12-16T07:11:16Z)
Tattoo tomography: Freehand 3D photoacoustic image reconstruction with an optical pattern [49.240017254888336]
Photoacoustic tomography (PAT) is a novel imaging technique that can resolve both morphological and functional tissue properties. A current drawback is the limited field-of-view provided by the conventionally applied 2D probes. We present a novel approach to 3D reconstruction of PAT data that does not require an external tracking system.
arXiv Detail & Related papers (2020-11-10T09:27:56Z)
Weakly-supervised Learning For Catheter Segmentation in 3D Frustum Ultrasound [74.22397862400177]
We propose a novel Frustum ultrasound based catheter segmentation method. The proposed method achieved the state-of-the-art performance with an efficiency of 0.25 second per volume.
arXiv Detail & Related papers (2020-10-19T13:56:22Z)
Reconstruction and Quantification of 3D Iris Surface for Angle-Closure Glaucoma Detection in Anterior Segment OCT [42.797124360552715]
We propose a novel framework for reconstruction and quantification of 3D iris surface from AS- OCT imagery. We consider it to be the first work to detect angle-closure glaucoma by means of 3D representation. We show that 3D-based representation achieves better performance in angle-closure glaucoma detection than does 2D-based feature.
arXiv Detail & Related papers (2020-06-09T10:56:50Z)

This list is automatically generated from the titles and abstracts of the papers in this site.