LensID: A CNN-RNN-Based Framework Towards Lens Irregularity Detection in
Cataract Surgery Videos
- URL: http://arxiv.org/abs/2107.00875v1
- Date: Fri, 2 Jul 2021 07:27:29 GMT
- Title: LensID: A CNN-RNN-Based Framework Towards Lens Irregularity Detection in
Cataract Surgery Videos
- Authors: Negin Ghamsarian, Mario Taschwer, Doris Putzgruber-Adamitsch,
Stephanie Sarny, Yosuf El-Shabrawi, Klaus Schoeffmann
- Abstract summary: A critical complication after cataract surgery is the dislocation of the lens implant leading to vision deterioration and eye trauma.
We propose an end-to-end recurrent neural network to recognize the lens-implantation phase and a novel semantic segmentation network to segment the lens and pupil after the implantation phase.
- Score: 6.743968799949719
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: A critical complication after cataract surgery is the dislocation of the lens
implant leading to vision deterioration and eye trauma. In order to reduce the
risk of this complication, it is vital to discover the risk factors during the
surgery. However, studying the relationship between lens dislocation and its
suspicious risk factors using numerous videos is a time-extensive procedure.
Hence, the surgeons demand an automatic approach to enable a larger-scale and,
accordingly, more reliable study. In this paper, we propose a novel framework
as the major step towards lens irregularity detection. In particular, we
propose (I) an end-to-end recurrent neural network to recognize the
lens-implantation phase and (II) a novel semantic segmentation network to
segment the lens and pupil after the implantation phase. The phase recognition
results reveal the effectiveness of the proposed surgical phase recognition
approach. Moreover, the segmentation results confirm the proposed segmentation
network's effectiveness compared to state-of-the-art rival approaches.
Related papers
- Predicting Postoperative Intraocular Lens Dislocation in Cataract
Surgery via Deep Learning [5.40411016117853]
A critical yet unpredictable complication following cataract surgery is intraocular lens dislocation.
We develop and evaluate the first fully-automatic framework for the computation of lens unfolding delay, rotation, and instability during surgery.
We exploit a large-scale dataset of cataract surgery videos featuring four intraocular lens brands.
arXiv Detail & Related papers (2023-12-06T10:27:15Z) - Action Recognition in Video Recordings from Gynecologic Laparoscopy [4.002010889177872]
Action recognition is a prerequisite for many applications in laparoscopic video analysis.
In this study, we design and evaluate a CNN-RNN architecture as well as a customized training-inference framework.
arXiv Detail & Related papers (2023-11-30T16:15:46Z) - EyeLS: Shadow-Guided Instrument Landing System for Intraocular Target
Approaching in Robotic Eye Surgery [51.05595735405451]
Robotic ophthalmic surgery is an emerging technology to facilitate high-precision interventions such as retina penetration in subretinal injection and removal of floating tissues in retinal detachment.
Current image-based methods cannot effectively estimate the needle tip's trajectory towards both retinal and floating targets.
We propose to use the shadow positions of the target and the instrument tip to estimate their relative depth position.
Our method succeeds target approaching on a retina model, and achieves an average depth error of 0.0127 mm and 0.3473 mm for floating and retinal targets respectively in the surgical simulator.
arXiv Detail & Related papers (2023-11-15T09:11:37Z) - Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images [55.83984261827332]
In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network.
We develop a novel segmentation backbone that integrates a wavelet-enhanced feature extractor network and a multi-scale transformer module.
Our proposed method achieves better segmentation accuracy with a high degree of reliability as compared to other state-of-the-art segmentation approaches.
arXiv Detail & Related papers (2022-12-01T07:32:56Z) - Learning-Based Keypoint Registration for Fetoscopic Mosaicking [65.02392513942533]
In Twin-to-Twin Transfusion Syndrome (TTTS), abnormal vascular anastomoses in the monochorionic placenta can produce uneven blood flow between the two fetuses.
We propose a learning-based framework for in-vivo fetoscopy frame registration for field-of-view expansion.
arXiv Detail & Related papers (2022-07-26T21:21:12Z) - Real-time landmark detection for precise endoscopic submucosal
dissection via shape-aware relation network [51.44506007844284]
We propose a shape-aware relation network for accurate and real-time landmark detection in endoscopic submucosal dissection surgery.
We first devise an algorithm to automatically generate relation keypoint heatmaps, which intuitively represent the prior knowledge of spatial relations among landmarks.
We then develop two complementary regularization schemes to progressively incorporate the prior knowledge into the training process.
arXiv Detail & Related papers (2021-11-08T07:57:30Z) - MTCD: Cataract Detection via Near Infrared Eye Images [69.62768493464053]
cataract is a common eye disease and one of the leading causes of blindness and vision impairment.
We present a novel algorithm for cataract detection using near-infrared eye images.
Deep learning-based eye segmentation and multitask network classification networks are presented.
arXiv Detail & Related papers (2021-10-06T08:10:28Z) - Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action
Localization [7.235239641693831]
In cataract surgery, the operation is performed with the help of a microscope. Since the microscope enables watching real-time surgery by up to two people only, a major part of surgical training is conducted using the recorded videos.
To optimize the training procedure with the video content, the surgeons require an automatic relevance detection approach.
In this paper, a three- module framework is proposed to detect and classify the relevant phase segments in cataract videos.
arXiv Detail & Related papers (2021-04-29T12:01:08Z) - Retinopathy of Prematurity Stage Diagnosis Using Object Segmentation and
Convolutional Neural Networks [68.96150598294072]
Retinopathy of Prematurity (ROP) is an eye disorder primarily affecting premature infants with lower weights.
It causes proliferation of vessels in the retina and could result in vision loss and, eventually, retinal detachment, leading to blindness.
In recent years, there has been a significant effort to automate the diagnosis using deep learning.
This paper builds upon the success of previous models and develops a novel architecture, which combines object segmentation and convolutional neural networks (CNN)
Our proposed system first trains an object segmentation model to identify the demarcation line at a pixel level and adds the resulting mask as an additional "color" channel in
arXiv Detail & Related papers (2020-04-03T14:07:41Z) - TeCNO: Surgical Phase Recognition with Multi-Stage Temporal
Convolutional Networks [43.95869213955351]
We propose a Multi-Stage Temporal Convolutional Network (MS-TCN) that performs hierarchical prediction refinement for surgical phase recognition.
Our method is thoroughly evaluated on two datasets of laparoscopic cholecystectomy videos with and without the use of additional surgical tool information.
arXiv Detail & Related papers (2020-03-24T10:12:30Z) - Towards Augmented Reality-based Suturing in Monocular Laparoscopic
Training [0.5707453684578819]
The paper proposes an Augmented Reality environment with quantitative and qualitative visual representations to enhance laparoscopic training outcomes performed on a silicone pad.
This is enabled by a multi-task supervised deep neural network which performs multi-class segmentation and depth map prediction.
The network achieves a dice score of 0.67 for surgical needle segmentation, 0.81 for needle holder instrument segmentation and a mean absolute error of 6.5 mm for depth estimation.
arXiv Detail & Related papers (2020-01-19T19:59:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.