End-to-End Real-time Catheter Segmentation with Optical Flow-Guided
Warping during Endovascular Intervention
- URL: http://arxiv.org/abs/2006.09117v1
- Date: Tue, 16 Jun 2020 12:53:27 GMT
- Title: End-to-End Real-time Catheter Segmentation with Optical Flow-Guided
Warping during Endovascular Intervention
- Authors: Anh Nguyen, Dennis Kundrat, Giulio Dagnino, Wenqiang Chi, Mohamed E.
M. K. Abdelaziz, Yao Guo, YingLiang Ma, Trevor M. Y. Kwok, Celia Riga, and
Guang-Zhong Yang
- Abstract summary: We present FW-Net, an end-to-end and real-time deep learning framework for endovascular intervention.
We show that by effectively learning temporal continuity, the network can successfully segment and track the catheters in real-time sequences using only raw ground-truth for training.
- Score: 26.467626509096043
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate real-time catheter segmentation is an important pre-requisite for
robot-assisted endovascular intervention. Most of the existing learning-based
methods for catheter segmentation and tracking are only trained on small-scale
datasets or synthetic data due to the difficulties of ground-truth annotation.
Furthermore, the temporal continuity in intraoperative imaging sequences is not
fully utilised. In this paper, we present FW-Net, an end-to-end and real-time
deep learning framework for endovascular intervention. The proposed FW-Net has
three modules: a segmentation network with encoder-decoder architecture, a flow
network to extract optical flow information, and a novel flow-guided warping
function to learn the frame-to-frame temporal continuity. We show that by
effectively learning temporal continuity, the network can successfully segment
and track the catheters in real-time sequences using only raw ground-truth for
training. Detailed validation results confirm that our FW-Net outperforms
state-of-the-art techniques while achieving real-time performance.
Related papers
- Real-time guidewire tracking and segmentation in intraoperative x-ray [52.51797358201872]
We propose a two-stage deep learning framework for real-time guidewire segmentation and tracking.
In the first stage, a Yolov5 detector is trained, using the original X-ray images as well as synthetic ones, to output the bounding boxes of possible target guidewires.
In the second stage, a novel and efficient network is proposed to segment the guidewire in each detected bounding box.
arXiv Detail & Related papers (2024-04-12T20:39:19Z) - CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers [66.15847237150909]
We introduce a self-supervised deep learning architecture to segment catheters in longitudinal ultrasound images.
The network architecture builds upon AiAReSeg, a segmentation transformer built with the Attention in Attention mechanism.
We validated our model on a test dataset, consisting of unseen synthetic data and images collected from silicon aorta phantoms.
arXiv Detail & Related papers (2024-03-21T15:13:36Z) - Exploring Optical Flow Inclusion into nnU-Net Framework for Surgical Instrument Segmentation [1.3444601218847545]
The nnU-Net framework excelled in semantic segmentation analyzing single frames without temporal information.
Optical flow (OF) is a tool commonly used in video tasks to estimate motion and represent it in a single frame, containing temporal information.
This work seeks to employ OF maps as an additional input to the nnU-Net architecture to improve its performance in the surgical instrument segmentation task.
arXiv Detail & Related papers (2024-03-15T11:36:26Z) - Leveraging Frequency Domain Learning in 3D Vessel Segmentation [50.54833091336862]
In this study, we leverage Fourier domain learning as a substitute for multi-scale convolutional kernels in 3D hierarchical segmentation models.
We show that our novel network achieves remarkable dice performance (84.37% on ASACA500 and 80.32% on ImageCAS) in tubular vessel segmentation tasks.
arXiv Detail & Related papers (2024-01-11T19:07:58Z) - Abstract Flow for Temporal Semantic Segmentation on the Permutohedral
Lattice [27.37701107719647]
We extend a backbone LatticeNet to process temporal point cloud data.
We propose a new module called Abstract Flow which allows the network to match parts of the scene with similar abstract features.
We obtain state-of-the-art results on the Semantic KITTI dataset that contains LiDAR scans from real urban environments.
arXiv Detail & Related papers (2022-03-29T12:14:31Z) - Temporally Constrained Neural Networks (TCNN): A framework for
semi-supervised video semantic segmentation [5.0754434714665715]
We present Temporally Constrained Neural Networks (TCNN), a semi-supervised framework used for video semantic segmentation of surgical videos.
In this work, we show that autoencoder networks can be used to efficiently provide both spatial and temporal supervisory signals.
We demonstrate that lower-dimensional representations of predicted masks can be leveraged to provide a consistent improvement on both sparsely labeled datasets.
arXiv Detail & Related papers (2021-12-27T18:06:12Z) - Efficient Global-Local Memory for Real-time Instrument Segmentation of
Robotic Surgical Video [53.14186293442669]
We identify two important clues for surgical instrument perception, including local temporal dependency from adjacent frames and global semantic correlation in long-range duration.
We propose a novel dual-memory network (DMNet) to relate both global and local-temporal knowledge.
Our method largely outperforms the state-of-the-art works on segmentation accuracy while maintaining a real-time speed.
arXiv Detail & Related papers (2021-09-28T10:10:14Z) - Simulation-to-Real domain adaptation with teacher-student learning for
endoscopic instrument segmentation [1.1047993346634768]
We introduce a teacher-student learning approach that learns jointly from annotated simulation data and unlabeled real data.
Empirical results on three datasets highlight the effectiveness of the proposed framework.
arXiv Detail & Related papers (2021-03-02T09:30:28Z) - Real-Time Segmentation of Non-Rigid Surgical Tools based on Deep
Learning and Tracking [12.408997542491152]
Real-time tool segmentation is an essential component in computer-assisted surgical systems.
We propose a novel real-time automatic method based on Fully Convolutional Networks (FCN) and optical flow tracking.
arXiv Detail & Related papers (2020-09-07T11:06:14Z) - A Prospective Study on Sequence-Driven Temporal Sampling and Ego-Motion
Compensation for Action Recognition in the EPIC-Kitchens Dataset [68.8204255655161]
Action recognition is one of the top-challenging research fields in computer vision.
ego-motion recorded sequences have become of important relevance.
The proposed method aims to cope with it by estimating this ego-motion or camera motion.
arXiv Detail & Related papers (2020-08-26T14:44:45Z) - Learning Motion Flows for Semi-supervised Instrument Segmentation from
Robotic Surgical Video [64.44583693846751]
We study the semi-supervised instrument segmentation from robotic surgical videos with sparse annotations.
By exploiting generated data pairs, our framework can recover and even enhance temporal consistency of training sequences.
Results show that our method outperforms the state-of-the-art semisupervised methods by a large margin.
arXiv Detail & Related papers (2020-07-06T02:39:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.