Topology-Constrained Learning for Efficient Laparoscopic Liver Landmark Detection
- URL: http://arxiv.org/abs/2507.00519v1
- Date: Tue, 01 Jul 2025 07:35:36 GMT
- Title: Topology-Constrained Learning for Efficient Laparoscopic Liver Landmark Detection
- Authors: Ruize Cui, Jiaan Zhang, Jialun Pei, Kai Wang, Pheng-Ann Heng, Jing Qin,
- Abstract summary: Liver landmarks provide crucial anatomical guidance to the surgeon during laparoscopic liver surgery.<n>TopoNet is a novel topology-constrained learning framework for laparoscopic liver landmark detection.<n>Our framework adopts a snake-CNN dual-path encoder to simultaneously capture detailed RGB texture information and depth-informed topological structures.
- Score: 46.2391319253146
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Liver landmarks provide crucial anatomical guidance to the surgeon during laparoscopic liver surgery to minimize surgical risk. However, the tubular structural properties of landmarks and dynamic intraoperative deformations pose significant challenges for automatic landmark detection. In this study, we introduce TopoNet, a novel topology-constrained learning framework for laparoscopic liver landmark detection. Our framework adopts a snake-CNN dual-path encoder to simultaneously capture detailed RGB texture information and depth-informed topological structures. Meanwhile, we propose a boundary-aware topology fusion (BTF) module, which adaptively merges RGB-D features to enhance edge perception while preserving global topology. Additionally, a topological constraint loss function is embedded, which contains a center-line constraint loss and a topological persistence loss to ensure homotopy equivalence between predictions and labels. Extensive experiments on L3D and P2ILF datasets demonstrate that TopoNet achieves outstanding accuracy and computational complexity, highlighting the potential for clinical applications in laparoscopic liver surgery. Our code will be available at https://github.com/cuiruize/TopoNet.
Related papers
- PIVOTS: Aligning unseen Structures using Preoperative to Intraoperative Volume-To-Surface Registration for Liver Navigation [11.658316634846697]
PIVOTS is a neural network that takes point clouds as input for deformation prediction.<n>We train the neural network on synthetic data simulated from a biomechanical simulation pipeline.<n>Results demonstrate superior registration performance compared to baseline methods.
arXiv Detail & Related papers (2025-07-27T16:01:26Z) - BCRNet: Enhancing Landmark Detection in Laparoscopic Liver Surgery via Bezier Curve Refinement [14.918845671238737]
BCRNet is a novel framework that significantly enhances landmark detection in laparoscopic liver surgery.<n>The framework starts with a Multi-modal Feature Extraction (MFE) module designed to robustly capture semantic features.<n>BCRNet outperforms state-of-the-art methods, achieving significant performance improvements.
arXiv Detail & Related papers (2025-06-18T09:00:08Z) - Myocardial Region-guided Feature Aggregation Net for Automatic Coronary artery Segmentation and Stenosis Assessment using Coronary Computed Tomography Angiography [13.885760158090692]
Myocardial Region-guided Feature Aggregation Net is a novel U-shaped dual-encoder architecture that integrates anatomical prior knowledge to enhance robustness in coronary artery segmentation.<n>Our framework incorporates three key innovations: (1) a Myocardial Region-guided Module that directs attention to coronary regions via bridging expansion and multi-scale feature fusion, (2) a Residual Feature Extraction Module that combines parallel spatial channel attention with residual blocks to enhance local-global feature discrimination, and (3) a Multi-scale Feature Fusion Module for adaptive aggregation of hierarchical vascular features.
arXiv Detail & Related papers (2025-04-27T16:43:52Z) - Landmark-Free Preoperative-to-Intraoperative Registration in Laparoscopic Liver Resection [50.388465935739376]
Liver registration by overlaying preoperative 3D models onto intraoperative 2D frames can assist surgeons in perceiving the spatial anatomy of the liver clearly for a higher surgical success rate.<n>Existing registration methods rely heavily on anatomical landmark-based, which encounter two major limitations.<n>We propose a landmark-free preoperative-to-intraoperative registration framework utilizing effective self-supervised learning.
arXiv Detail & Related papers (2025-04-21T14:55:57Z) - Geo-UNet: A Geometrically Constrained Neural Framework for Clinical-Grade Lumen Segmentation in Intravascular Ultrasound [7.760705377465734]
Current segmentation networks like the UNet lack the precision needed for clinical adoption in IVUS.
We propose the Geo-UNet framework to address these issues via a design informed by the geometry of the segmentation task.
The efficacy of our framework on a venous IVUS dataset is shown against state-of-the-art models.
arXiv Detail & Related papers (2024-08-09T02:55:25Z) - FS-Net: Full Scale Network and Adaptive Threshold for Improving Extraction of Micro-Retinal Vessel Structures [4.507779218329283]
Segmenting retinal vessels presents unique challenges.<n>Recent neural network approaches struggle to balance local and global properties.<n>We propose a comprehensive micro-vessel extraction mechanism based on an encoder-decoder neural network architecture.
arXiv Detail & Related papers (2023-11-14T10:32:17Z) - Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images [55.83984261827332]
In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network.
We develop a novel segmentation backbone that integrates a wavelet-enhanced feature extractor network and a multi-scale transformer module.
Our proposed method achieves better segmentation accuracy with a high degree of reliability as compared to other state-of-the-art segmentation approaches.
arXiv Detail & Related papers (2022-12-01T07:32:56Z) - Self-Supervised Masked Convolutional Transformer Block for Anomaly
Detection [122.4894940892536]
We present a novel self-supervised masked convolutional transformer block (SSMCTB) that comprises the reconstruction-based functionality at a core architectural level.
In this work, we extend our previous self-supervised predictive convolutional attentive block (SSPCAB) with a 3D masked convolutional layer, a transformer for channel-wise attention, as well as a novel self-supervised objective based on Huber loss.
arXiv Detail & Related papers (2022-09-25T04:56:10Z) - Fuzzy Attention Neural Network to Tackle Discontinuity in Airway
Segmentation [67.19443246236048]
Airway segmentation is crucial for the examination, diagnosis, and prognosis of lung diseases.
Some small-sized airway branches (e.g., bronchus and terminaloles) significantly aggravate the difficulty of automatic segmentation.
This paper presents an efficient method for airway segmentation, comprising a novel fuzzy attention neural network and a comprehensive loss function.
arXiv Detail & Related papers (2022-09-05T16:38:13Z) - Topological obstructions in neural networks learning [67.8848058842671]
We study global properties of the loss gradient function flow.
We use topological data analysis of the loss function and its Morse complex to relate local behavior along gradient trajectories with global properties of the loss surface.
arXiv Detail & Related papers (2020-12-31T18:53:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.