Real-time landmark detection for precise endoscopic submucosal
dissection via shape-aware relation network
- URL: http://arxiv.org/abs/2111.04733v1
- Date: Mon, 8 Nov 2021 07:57:30 GMT
- Title: Real-time landmark detection for precise endoscopic submucosal
dissection via shape-aware relation network
- Authors: Jiacheng Wang, Yueming Jin, Shuntian Cai, Hongzhi Xu, Pheng-Ann Heng,
Jing Qin, Liansheng Wang
- Abstract summary: We propose a shape-aware relation network for accurate and real-time landmark detection in endoscopic submucosal dissection surgery.
We first devise an algorithm to automatically generate relation keypoint heatmaps, which intuitively represent the prior knowledge of spatial relations among landmarks.
We then develop two complementary regularization schemes to progressively incorporate the prior knowledge into the training process.
- Score: 51.44506007844284
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose a novel shape-aware relation network for accurate and real-time
landmark detection in endoscopic submucosal dissection (ESD) surgery. This task
is of great clinical significance but extremely challenging due to bleeding,
lighting reflection, and motion blur in the complicated surgical environment.
Compared with existing solutions, which either neglect geometric relationships
among targeting objects or capture the relationships by using complicated
aggregation schemes, the proposed network is capable of achieving satisfactory
accuracy while maintaining real-time performance by taking full advantage of
the spatial relations among landmarks. We first devise an algorithm to
automatically generate relation keypoint heatmaps, which are able to
intuitively represent the prior knowledge of spatial relations among landmarks
without using any extra manual annotation efforts. We then develop two
complementary regularization schemes to progressively incorporate the prior
knowledge into the training process. While one scheme introduces pixel-level
regularization by multi-task learning, the other integrates global-level
regularization by harnessing a newly designed grouped consistency evaluator,
which adds relation constraints to the proposed network in an adversarial
manner. Both schemes are beneficial to the model in training, and can be
readily unloaded in inference to achieve real-time detection. We establish a
large in-house dataset of ESD surgery for esophageal cancer to validate the
effectiveness of our proposed method. Extensive experimental results
demonstrate that our approach outperforms state-of-the-art methods in terms of
accuracy and efficiency, achieving better detection results faster. Promising
results on two downstream applications further corroborate the great potential
of our method in ESD clinical practice.
Related papers
- AD-Net: Attention-based dilated convolutional residual network with guided decoder for robust skin lesion segmentation [0.0]
This research presents a robust approach utilizing a dilated convolutional residual network.
It incorporates an attention-based spatial feature enhancement block (ASFEB) and employs a guided decoder strategy.
The effectiveness of the proposed AD-Net was evaluated using four public benchmark datasets.
arXiv Detail & Related papers (2024-09-09T08:21:17Z) - Augmentation is AUtO-Net: Augmentation-Driven Contrastive Multiview
Learning for Medical Image Segmentation [3.1002416427168304]
This thesis focuses on retinal blood vessel segmentation tasks.
It provides an extensive literature review of deep learning-based medical image segmentation approaches.
It proposes a novel efficient, simple multiview learning framework.
arXiv Detail & Related papers (2023-11-02T06:31:08Z) - PREM: A Simple Yet Effective Approach for Node-Level Graph Anomaly
Detection [65.24854366973794]
Node-level graph anomaly detection (GAD) plays a critical role in identifying anomalous nodes from graph-structured data in domains such as medicine, social networks, and e-commerce.
We introduce a simple method termed PREprocessing and Matching (PREM for short) to improve the efficiency of GAD.
Our approach streamlines GAD, reducing time and memory consumption while maintaining powerful anomaly detection capabilities.
arXiv Detail & Related papers (2023-10-18T02:59:57Z) - Phase-Specific Augmented Reality Guidance for Microscopic Cataract
Surgery Using Long-Short Spatiotemporal Aggregation Transformer [14.568834378003707]
Phaemulsification cataract surgery (PCS) is a routine procedure using a surgical microscope.
PCS guidance systems extract valuable information from surgical microscopic videos to enhance proficiency.
Existing PCS guidance systems suffer from non-phasespecific guidance, leading to redundant visual information.
We propose a novel phase-specific augmented reality (AR) guidance system, which offers tailored AR information corresponding to the recognized surgical phase.
arXiv Detail & Related papers (2023-09-11T02:56:56Z) - Taxonomy Adaptive Cross-Domain Adaptation in Medical Imaging via
Optimization Trajectory Distillation [73.83178465971552]
The success of automated medical image analysis depends on large-scale and expert-annotated training sets.
Unsupervised domain adaptation (UDA) has been raised as a promising approach to alleviate the burden of labeled data collection.
We propose optimization trajectory distillation, a unified approach to address the two technical challenges from a new perspective.
arXiv Detail & Related papers (2023-07-27T08:58:05Z) - ECO-TR: Efficient Correspondences Finding Via Coarse-to-Fine Refinement [80.94378602238432]
We propose an efficient structure named Correspondence Efficient Transformer (ECO-TR) by finding correspondences in a coarse-to-fine manner.
To achieve this, multiple transformer blocks are stage-wisely connected to gradually refine the predicted coordinates.
Experiments on various sparse and dense matching tasks demonstrate the superiority of our method in both efficiency and effectiveness against existing state-of-the-arts.
arXiv Detail & Related papers (2022-09-25T13:05:33Z) - Learn Fine-grained Adaptive Loss for Multiple Anatomical Landmark
Detection in Medical Images [15.7026400415269]
We propose a novel learning-to-learn framework for landmark detection.
Our proposed framework is general and shows the potential to improve the efficiency of anatomical landmark detection.
arXiv Detail & Related papers (2021-05-19T13:39:18Z) - Few-shot Medical Image Segmentation using a Global Correlation Network
with Discriminative Embedding [60.89561661441736]
We propose a novel method for few-shot medical image segmentation.
We construct our few-shot image segmentor using a deep convolutional network trained episodically.
We enhance discriminability of deep embedding to encourage clustering of the feature domains of the same class.
arXiv Detail & Related papers (2020-12-10T04:01:07Z) - A Coupled Manifold Optimization Framework to Jointly Model the
Functional Connectomics and Behavioral Data Spaces [5.382679710017696]
We propose a coupled manifold optimization framework which projects fMRI data onto a low dimensional matrix manifold common to the cohort.
The patient specific loadings simultaneously map onto a behavioral measure of interest via a second, non-linear, manifold.
We validate our framework on resting state fMRI from fifty-eight patients with Autism Spectrum Disorder.
arXiv Detail & Related papers (2020-07-03T20:12:51Z) - Cascaded Human-Object Interaction Recognition [175.60439054047043]
We introduce a cascade architecture for a multi-stage, coarse-to-fine HOI understanding.
At each stage, an instance localization network progressively refines HOI proposals and feeds them into an interaction recognition network.
With our carefully-designed human-centric relation features, these two modules work collaboratively towards effective interaction understanding.
arXiv Detail & Related papers (2020-03-09T17:05:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.