Analysis of Macula on Color Fundus Images Using Heightmap Reconstruction
Through Deep Learning
- URL: http://arxiv.org/abs/2012.14140v1
- Date: Mon, 28 Dec 2020 08:21:55 GMT
- Title: Analysis of Macula on Color Fundus Images Using Heightmap Reconstruction
Through Deep Learning
- Authors: Peyman Tahghighi, Reza A.Zoroofi, Sare Safi, Alireza Ramezani
- Abstract summary: We propose a novel architecture for the generator which enhances the details and the quality of output by progressive refinement and the use of deep supervision.
The proposed method can provide additional information for ophthalmologists for diagnosis.
- Score: 5.935761705025763
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: For medical diagnosis based on retinal images, a clear understanding of 3D
structure is often required but due to the 2D nature of images captured, we
cannot infer that information. However, by utilizing 3D reconstruction methods,
we can recover the height information of the macula area on a fundus image
which can be helpful for diagnosis and screening of macular disorders. Recent
approaches have used shading information for heightmap prediction but their
output was not accurate since they ignored the dependency between nearby pixels
and only utilized shading information. Additionally, other methods were
dependent on the availability of more than one image of the retina which is not
available in practice. In this paper, motivated by the success of Conditional
Generative Adversarial Networks(cGANs) and deeply supervised networks, we
propose a novel architecture for the generator which enhances the details and
the quality of output by progressive refinement and the use of deep supervision
to reconstruct the height information of macula on a color fundus image.
Comparisons on our own dataset illustrate that the proposed method outperforms
all of the state-of-the-art methods in image translation and medical image
translation on this particular task. Additionally, perceptual studies also
indicate that the proposed method can provide additional information for
ophthalmologists for diagnosis.
Related papers
- A 3D Facial Reconstruction Evaluation Methodology: Comparing Smartphone Scans with Deep Learning Based Methods Using Geometry and Morphometry Criteria [60.865754842465684]
Three-dimensional (3D) facial shape analysis has gained interest due to its potential clinical applications.
High cost of advanced 3D facial acquisition systems limits their widespread use, driving the development of low-cost acquisition and reconstruction methods.
This study introduces a novel evaluation methodology that goes beyond traditional geometry-based benchmarks by integrating morphometric shape analysis techniques.
arXiv Detail & Related papers (2025-02-13T15:47:45Z) - Autoregressive Sequence Modeling for 3D Medical Image Representation [48.706230961589924]
We introduce a pioneering method for learning 3D medical image representations through an autoregressive sequence pre-training framework.
Our approach various 3D medical images based on spatial, contrast, and semantic correlations, treating them as interconnected visual tokens within a token sequence.
arXiv Detail & Related papers (2024-09-13T10:19:10Z) - Artifact Reduction in 3D and 4D Cone-beam Computed Tomography Images with Deep Learning -- A Review [0.0]
Deep learning techniques have been used to improve image quality in cone-beam computed tomography (CBCT)
We provide an overview of deep learning techniques that have successfully been shown to reduce artifacts in 3D, as well as in time-resolved (4D) CBCT.
One of the key findings of this work is an observed trend towards the use of generative models including GANs and score-based or diffusion models.
arXiv Detail & Related papers (2024-03-27T13:46:01Z) - SdCT-GAN: Reconstructing CT from Biplanar X-Rays with Self-driven
Generative Adversarial Networks [6.624839896733912]
This paper presents a new self-driven generative adversarial network model (SdCT-GAN) for reconstruction of 3D CT images.
It is motivated to pay more attention to image details by introducing a novel auto-encoder structure in the discriminator.
LPIPS evaluation metric is adopted that can quantitatively evaluate the fine contours and textures of reconstructed images better than the existing ones.
arXiv Detail & Related papers (2023-09-10T08:16:02Z) - Multi-View Vertebra Localization and Identification from CT Images [57.56509107412658]
We propose a multi-view vertebra localization and identification from CT images.
We convert the 3D problem into a 2D localization and identification task on different views.
Our method can learn the multi-view global information naturally.
arXiv Detail & Related papers (2023-07-24T14:43:07Z) - 3D Matting: A Benchmark Study on Soft Segmentation Method for Pulmonary
Nodules Applied in Computed Tomography [32.775884701366465]
We introduce the image matting into the 3D scenes and use the alpha matte, i.e., a soft mask, to describe lesions in a 3D medical image.
To address this issue, we conduct a comprehensive study of 3D matting, including both traditional and deep-learning-based methods.
We propose the first end-to-end deep 3D matting network and implement a solid 3D medical image matting benchmark.
arXiv Detail & Related papers (2022-10-11T02:40:18Z) - Attentive Symmetric Autoencoder for Brain MRI Segmentation [56.02577247523737]
We propose a novel Attentive Symmetric Auto-encoder based on Vision Transformer (ViT) for 3D brain MRI segmentation tasks.
In the pre-training stage, the proposed auto-encoder pays more attention to reconstruct the informative patches according to the gradient metrics.
Experimental results show that our proposed attentive symmetric auto-encoder outperforms the state-of-the-art self-supervised learning methods and medical image segmentation models.
arXiv Detail & Related papers (2022-09-19T09:43:19Z) - 3D Matting: A Soft Segmentation Method Applied in Computed Tomography [26.25446145993599]
Three-dimensional (3D) images, such as CT, MRI, and PET, are common in medical imaging applications and important in clinical diagnosis.
Semantic ambiguity is a typical feature of many medical image labels.
In 2D medical images, using soft masks instead of binary masks generated by image matting to characterize lesions can provide rich semantic information.
arXiv Detail & Related papers (2022-09-16T10:18:59Z) - Adversarial Domain Feature Adaptation for Bronchoscopic Depth Estimation [111.89519571205778]
In this work, we propose an alternative domain-adaptive approach to depth estimation.
Our novel two-step structure first trains a depth estimation network with labeled synthetic images in a supervised manner.
The results of our experiments show that the proposed method improves the network's performance on real images by a considerable margin.
arXiv Detail & Related papers (2021-09-24T08:11:34Z) - Generative Adversarial U-Net for Domain-free Medical Image Augmentation [49.72048151146307]
The shortage of annotated medical images is one of the biggest challenges in the field of medical image computing.
In this paper, we develop a novel generative method named generative adversarial U-Net.
Our newly designed model is domain-free and generalizable to various medical images.
arXiv Detail & Related papers (2021-01-12T23:02:26Z) - Heightmap Reconstruction of Macula on Color Fundus Images Using
Conditional Generative Adversarial Networks [5.419608513284394]
Fundus images which are one the most common screening modalities for retina diagnosis lack this information due to their 2D nature.
We propose a novel architecture for the generator which enhances the details in a sequence of steps.
arXiv Detail & Related papers (2020-09-03T12:13:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.