Improved Heatmap-based Landmark Detection
- URL: http://arxiv.org/abs/2110.05676v1
- Date: Tue, 12 Oct 2021 01:41:01 GMT
- Title: Improved Heatmap-based Landmark Detection
- Authors: Huifeng Yao, Ziyu Guo, Yatao Zhang, Xiaomeng Li
- Abstract summary: This paper proposes a landmark detection network for detecting sutures in endoscopic pictures.
It solves the problem of a variable number of suture points in the images.
This paper performed the tests using a simulated dataset of 2708 photos and a real dataset of 2376 images.
- Score: 9.0766494606964
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Mitral valve repair is a very difficult operation, often requiring
experienced surgeons. The doctor will insert a prosthetic ring to aid in the
restoration of heart function. The location of the prosthesis' sutures is
critical. Obtaining and studying them during the procedure is a valuable
learning experience for new surgeons. This paper proposes a landmark detection
network for detecting sutures in endoscopic pictures, which solves the problem
of a variable number of suture points in the images. Because there are two
datasets, one from the simulated domain and the other from real intraoperative
data, this work uses cycleGAN to interconvert the images from the two domains
to obtain a larger dataset and a better score on real intraoperative data. This
paper performed the tests using a simulated dataset of 2708 photos and a real
dataset of 2376 images. The mean sensitivity on the simulated dataset is about
75.64% and the precision is about 73.62%. The mean sensitivity on the real
dataset is about 50.23% and the precision is about 62.76%. The data is from the
AdaptOR MICCAI Challenge 2021, which can be found at
https://zenodo.org/record/4646979\#.YO1zLUxCQ2x.
Related papers
- ISLES 2024: The first longitudinal multimodal multi-center real-world dataset in (sub-)acute stroke [2.7919032539697444]
Stroke remains a leading cause of global morbidity and mortality, placing a heavy socioeconomic burden.
To develop machine learning algorithms that can extract meaningful and reproducible models of brain function from stroke images.
Our dataset is the first to offer comprehensive longitudinal stroke data, including acute CT imaging with angiography and perfusion, follow-up MRI at 2-9 days, and acute and longitudinal clinical data up to a three-month outcome.
arXiv Detail & Related papers (2024-08-20T18:59:52Z) - CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers [66.15847237150909]
We introduce a self-supervised deep learning architecture to segment catheters in longitudinal ultrasound images.
The network architecture builds upon AiAReSeg, a segmentation transformer built with the Attention in Attention mechanism.
We validated our model on a test dataset, consisting of unseen synthetic data and images collected from silicon aorta phantoms.
arXiv Detail & Related papers (2024-03-21T15:13:36Z) - Cross-Dataset Adaptation for Instrument Classification in Cataract
Surgery Videos [54.1843419649895]
State-of-the-art models, which perform this task well on a particular dataset, perform poorly when tested on another dataset.
We propose a novel end-to-end Unsupervised Domain Adaptation (UDA) method called the Barlow Adaptor.
In addition, we introduce a novel loss called the Barlow Feature Alignment Loss (BFAL) which aligns features across different domains.
arXiv Detail & Related papers (2023-07-31T18:14:18Z) - Detecting the Sensing Area of A Laparoscopic Probe in Minimally Invasive
Cancer Surgery [6.0097646269887965]
In surgical oncology, it is challenging for surgeons to identify lymph nodes and completely resect cancer.
A novel tethered laparoscopic gamma detector is used to localize a preoperatively injected radiotracer.
Gamma activity visualization is challenging to present to the operator because the probe is non-imaging and it does not visibly indicate the activity on the tissue surface.
arXiv Detail & Related papers (2023-07-07T15:33:49Z) - Semantic segmentation of surgical hyperspectral images under geometric
domain shifts [69.91792194237212]
We present the first analysis of state-of-the-art semantic segmentation networks in the presence of geometric out-of-distribution (OOD) data.
We also address generalizability with a dedicated augmentation technique termed "Organ Transplantation"
Our scheme improves on the SOA DSC by up to 67 % (RGB) and 90 % (HSI) and renders performance on par with in-distribution performance on real OOD test data.
arXiv Detail & Related papers (2023-03-20T09:50:07Z) - Recurrence With Correlation Network for Medical Image Registration [66.63200823918429]
We present Recurrence with Correlation Network (RWCNet), a medical image registration network with multi-scale features and a cost volume layer.
We demonstrate that these architectural features improve medical image registration accuracy in two image registration datasets.
arXiv Detail & Related papers (2023-02-05T02:41:46Z) - Data-Efficient Vision Transformers for Multi-Label Disease
Classification on Chest Radiographs [55.78588835407174]
Vision Transformers (ViTs) have not been applied to this task despite their high classification performance on generic images.
ViTs do not rely on convolutions but on patch-based self-attention and in contrast to CNNs, no prior knowledge of local connectivity is present.
Our results show that while the performance between ViTs and CNNs is on par with a small benefit for ViTs, DeiTs outperform the former if a reasonably large data set is available for training.
arXiv Detail & Related papers (2022-08-17T09:07:45Z) - Point detection through multi-instance deep heatmap regression for
sutures in endoscopy [0.8937790536664091]
We formulate the suture detection task as a multi-instance deep heatmap regression problem.
We introduce the novel use of a 2D Gaussian layer followed by a differentiable 2D spatial Soft-Argmax layer to function as a local non-maximum suppression.
The proposed model shows an improvement over the baseline in the intra-operative and the simulator domains.
arXiv Detail & Related papers (2021-11-16T13:45:23Z) - Mutually improved endoscopic image synthesis and landmark detection in
unpaired image-to-image translation [0.9322743017642274]
The CycleGAN framework allows for unsupervised image-to-image translation of unpaired data.
In a scenario of surgical training on a physical surgical simulator, this method can be used to transform endoscopic images of phantoms into images which more closely resemble the intra-operative appearance of the same surgical target structure.
We show that a task defined on sparse landmark labels improves consistency of synthesis by the generator network in both domains.
arXiv Detail & Related papers (2021-07-14T19:09:50Z) - Y-Net for Chest X-Ray Preprocessing: Simultaneous Classification of
Geometry and Segmentation of Annotations [70.0118756144807]
This work introduces a general pre-processing step for chest x-ray input into machine learning algorithms.
A modified Y-Net architecture based on the VGG11 encoder is used to simultaneously learn geometric orientation and segmentation of radiographs.
Results were evaluated by expert clinicians, with acceptable geometry in 95.8% and annotation mask in 96.2%, compared to 27.0% and 34.9% respectively in control images.
arXiv Detail & Related papers (2020-05-08T02:16:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.