Using spatial-temporal ensembles of convolutional neural networks for
lumen segmentation in ureteroscopy
- URL: http://arxiv.org/abs/2104.01985v1
- Date: Mon, 5 Apr 2021 16:24:32 GMT
- Title: Using spatial-temporal ensembles of convolutional neural networks for
lumen segmentation in ureteroscopy
- Authors: Jorge F. Lazo, Aldo Marzullo, Sara Moccia, Michele Catellani, Benoit
Rosa, Michel de Mathelin, Elena De Momi
- Abstract summary: This paper presents an automatic method based on Convolutional Neural Networks (CNNs)
The proposed method is based on an ensemble of 4 parallel CNNs to simultaneously process single and multi-frame information.
- Score: 11.457020223521605
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Purpose: Ureteroscopy is an efficient endoscopic minimally invasive technique
for the diagnosis and treatment of upper tract urothelial carcinoma (UTUC).
During ureteroscopy, the automatic segmentation of the hollow lumen is of
primary importance, since it indicates the path that the endoscope should
follow. In order to obtain an accurate segmentation of the hollow lumen, this
paper presents an automatic method based on Convolutional Neural Networks
(CNNs).
Methods: The proposed method is based on an ensemble of 4 parallel CNNs to
simultaneously process single and multi-frame information. Of these, two
architectures are taken as core-models, namely U-Net based in residual
blocks($m_1$) and Mask-RCNN($m_2$), which are fed with single still-frames
$I(t)$. The other two models ($M_1$, $M_2$) are modifications of the former
ones consisting on the addition of a stage which makes use of 3D Convolutions
to process temporal information. $M_1$, $M_2$ are fed with triplets of frames
($I(t-1)$, $I(t)$, $I(t+1)$) to produce the segmentation for $I(t)$.
Results: The proposed method was evaluated using a custom dataset of 11
videos (2,673 frames) which were collected and manually annotated from 6
patients. We obtain a Dice similarity coefficient of 0.80, outperforming
previous state-of-the-art methods.
Conclusion: The obtained results show that spatial-temporal information can
be effectively exploited by the ensemble model to improve hollow lumen
segmentation in ureteroscopic images. The method is effective also in presence
of poor visibility, occasional bleeding, or specular reflections.
Related papers
- Intraoperative Registration by Cross-Modal Inverse Neural Rendering [61.687068931599846]
We present a novel approach for 3D/2D intraoperative registration during neurosurgery via cross-modal inverse neural rendering.
Our approach separates implicit neural representation into two components, handling anatomical structure preoperatively and appearance intraoperatively.
We tested our method on retrospective patients' data from clinical cases, showing that our method outperforms state-of-the-art while meeting current clinical standards for registration.
arXiv Detail & Related papers (2024-09-18T13:40:59Z) - Acute ischemic stroke lesion segmentation in non-contrast CT images
using 3D convolutional neural networks [0.0]
We propose an automatic algorithm aimed at volumetric segmentation of acute ischemic stroke lesion in non-contrast computed tomography brain 3D images.
Our deep-learning approach is based on the popular 3D U-Net convolutional neural network architecture.
arXiv Detail & Related papers (2023-01-17T10:39:39Z) - Two-Stream Graph Convolutional Network for Intra-oral Scanner Image
Segmentation [133.02190910009384]
We propose a two-stream graph convolutional network (i.e., TSGCN) to handle inter-view confusion between different raw attributes.
Our TSGCN significantly outperforms state-of-the-art methods in 3D tooth (surface) segmentation.
arXiv Detail & Related papers (2022-04-19T10:41:09Z) - A Neural Ordinary Differential Equation Model for Visualizing Deep
Neural Network Behaviors in Multi-Parametric MRI based Glioma Segmentation [3.1435638364138105]
We develop a neural ordinary differential equation (ODE) model for visualizing deep neural network (DNN) during multi-parametric MRI (mp-MRI) based glioma segmentation.
All neural ODE models successfully illustrated image dynamics as expected.
arXiv Detail & Related papers (2022-03-01T17:16:41Z) - Multiple Sclerosis Lesions Segmentation using Attention-Based CNNs in
FLAIR Images [0.2578242050187029]
Multiple Sclerosis (MS) is an autoimmune, and demyelinating disease that leads to lesions in the central nervous system.
Up to now a multitude of multimodality automatic biomedical approaches is used to segment lesions.
Authors propose a method employing just one modality (FLAIR image) to segment MS lesions accurately.
arXiv Detail & Related papers (2022-01-05T21:37:43Z) - Automatic size and pose homogenization with spatial transformer network
to improve and accelerate pediatric segmentation [51.916106055115755]
We propose a new CNN architecture that is pose and scale invariant thanks to the use of Spatial Transformer Network (STN)
Our architecture is composed of three sequential modules that are estimated together during training.
We test the proposed method in kidney and renal tumor segmentation on abdominal pediatric CT scanners.
arXiv Detail & Related papers (2021-07-06T14:50:03Z) - Automatic CT Segmentation from Bounding Box Annotations using
Convolutional Neural Networks [2.554905387213585]
The proposed method is composed of two steps: 1) generating pseudo masks with bounding box annotations by k-means clustering, and 2) iteratively training a 3D U-Net convolutional neural network as a segmentation model.
For liver, spleen and kidney segmentation, it achieved an accuracy of 95.19%, 92.11%, and 91.45%, respectively.
arXiv Detail & Related papers (2021-05-29T14:48:16Z) - A self-supervised learning strategy for postoperative brain cavity
segmentation simulating resections [46.414990784180546]
Convolutional neural networks (CNNs) are the state-of-the-art image segmentation technique.
CNNs require large annotated datasets for training.
Self-supervised learning strategies can leverage unlabeled data for training.
arXiv Detail & Related papers (2021-05-24T12:27:06Z) - Multi-modal segmentation of 3D brain scans using neural networks [0.0]
Deep convolutional neural networks are trained to segment 3D MRI (MPRAGE, DWI, FLAIR) and CT scans.
segmentation quality is quantified using the Dice metric for a total of 27 anatomical structures.
arXiv Detail & Related papers (2020-08-11T09:13:54Z) - A Novel Approach for Correcting Multiple Discrete Rigid In-Plane Motions
Artefacts in MRI Scans [63.28835187934139]
We propose a novel method for removing motion artefacts using a deep neural network with two input branches.
The proposed method can be applied to artefacts generated by multiple movements of the patient.
arXiv Detail & Related papers (2020-06-24T15:25:11Z) - Appearance Learning for Image-based Motion Estimation in Tomography [60.980769164955454]
In tomographic imaging, anatomical structures are reconstructed by applying a pseudo-inverse forward model to acquired signals.
Patient motion corrupts the geometry alignment in the reconstruction process resulting in motion artifacts.
We propose an appearance learning approach recognizing the structures of rigid motion independently from the scanned object.
arXiv Detail & Related papers (2020-06-18T09:49:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.