Searching for Efficient Architecture for Instrument Segmentation in
Robotic Surgery
- URL: http://arxiv.org/abs/2007.04449v1
- Date: Wed, 8 Jul 2020 21:38:29 GMT
- Title: Searching for Efficient Architecture for Instrument Segmentation in
Robotic Surgery
- Authors: Daniil Pakhomov, Nassir Navab
- Abstract summary: Most applications rely on accurate real-time segmentation of high-resolution surgical images.
We design a light-weight and highly-efficient deep residual architecture which is tuned to perform real-time inference of high-resolution images.
- Score: 58.63306322525082
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Segmentation of surgical instruments is an important problem in
robot-assisted surgery: it is a crucial step towards full instrument pose
estimation and is directly used for masking of augmented reality overlays
during surgical procedures. Most applications rely on accurate real-time
segmentation of high-resolution surgical images. While previous research
focused primarily on methods that deliver high accuracy segmentation masks,
majority of them can not be used for real-time applications due to their
computational cost. In this work, we design a light-weight and highly-efficient
deep residual architecture which is tuned to perform real-time inference of
high-resolution images. To account for reduced accuracy of the discovered
light-weight deep residual network and avoid adding any additional
computational burden, we perform a differentiable search over dilation rates
for residual units of our network. We test our discovered architecture on the
EndoVis 2017 Robotic Instruments dataset and verify that our model is the
state-of-the-art in terms of speed and accuracy tradeoff with a speed of up to
125 FPS on high resolution images.
Related papers
- SEGSRNet for Stereo-Endoscopic Image Super-Resolution and Surgical Instrument Segmentation [0.5461938536945723]
SEGSRNet addresses the challenge of precisely identifying surgical instruments in low-resolution stereo endoscopic images.
Our innovative framework enhances image clarity and segmentation accuracy by applying state-of-the-art super-resolution techniques before segmentation.
arXiv Detail & Related papers (2024-04-20T09:27:05Z) - CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers [66.15847237150909]
We introduce a self-supervised deep learning architecture to segment catheters in longitudinal ultrasound images.
The network architecture builds upon AiAReSeg, a segmentation transformer built with the Attention in Attention mechanism.
We validated our model on a test dataset, consisting of unseen synthetic data and images collected from silicon aorta phantoms.
arXiv Detail & Related papers (2024-03-21T15:13:36Z) - An Automated Real-Time Approach for Image Processing and Segmentation of Fluoroscopic Images and Videos Using a Single Deep Learning Network [2.752817022620644]
The potential of using machine learning for image segmentation in total knee lies in its ability to improve segmentation accuracy, automate the process, and provide real-time assistance to surgeons.
This paper proposes a methodology to use deep learning for robust real-time total knee image segmentation.
The deep learning model, trained on a large dataset, demonstrates outstanding performance in accurately segmenting both the implanted femur and tibia.
arXiv Detail & Related papers (2024-01-23T05:00:02Z) - Efficient Deformable Tissue Reconstruction via Orthogonal Neural Plane [58.871015937204255]
We introduce Fast Orthogonal Plane (plane) for the reconstruction of deformable tissues.
We conceptualize surgical procedures as 4D volumes, and break them down into static and dynamic fields comprised of neural planes.
This factorization iscretizes four-dimensional space, leading to a decreased memory usage and faster optimization.
arXiv Detail & Related papers (2023-12-23T13:27:50Z) - Sculpting Efficiency: Pruning Medical Imaging Models for On-Device
Inference [13.403419873964422]
We highlight the excess operational complexity in a suboptimally configured ML model from prior work.
Our results show a compression rate of 1148x with minimal loss in quality.
We consider avenues for future research in streamlining for clinical researchers to develop models quicker and better suited for real-world use.
arXiv Detail & Related papers (2023-09-10T17:34:14Z) - LDMRes-Net: Enabling Efficient Medical Image Segmentation on IoT and
Edge Platforms [9.626726110488386]
We propose a lightweight dual-multiscale residual block-based computational neural network tailored for medical image segmentation on IoT and edge platforms.
LDMRes-Net overcomes limitations with its remarkably low number of learnable parameters (0.072M), making it highly suitable for resource-constrained devices.
arXiv Detail & Related papers (2023-06-09T10:34:18Z) - Neural LerPlane Representations for Fast 4D Reconstruction of Deformable
Tissues [52.886545681833596]
LerPlane is a novel method for fast and accurate reconstruction of surgical scenes under a single-viewpoint setting.
LerPlane treats surgical procedures as 4D volumes and factorizes them into explicit 2D planes of static and dynamic fields.
LerPlane shares static fields, significantly reducing the workload of dynamic tissue modeling.
arXiv Detail & Related papers (2023-05-31T14:38:35Z) - A parameter refinement method for Ptychography based on Deep Learning
concepts [55.41644538483948]
coarse parametrisation in propagation distance, position errors and partial coherence frequently menaces the experiment viability.
A modern Deep Learning framework is used to correct autonomously the setup incoherences, thus improving the quality of a ptychography reconstruction.
We tested our system on both synthetic datasets and also on real data acquired at the TwinMic beamline of the Elettra synchrotron facility.
arXiv Detail & Related papers (2021-05-18T10:15:17Z) - Synthetic and Real Inputs for Tool Segmentation in Robotic Surgery [10.562627972607892]
We show that it may be possible to use robot kinematic data coupled with laparoscopic images to alleviate the labelling problem.
We propose a new deep learning based model for parallel processing of both laparoscopic and simulation images.
arXiv Detail & Related papers (2020-07-17T16:33:33Z) - Real-time Semantic Segmentation with Fast Attention [94.88466483540692]
We propose a novel architecture for semantic segmentation of high-resolution images and videos in real-time.
The proposed architecture relies on our fast spatial attention, which is a simple yet efficient modification of the popular self-attention mechanism.
We show that results on multiple datasets demonstrate superior performance with better accuracy and speed compared to existing approaches.
arXiv Detail & Related papers (2020-07-07T22:37:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.