DeepEyeNet: Generating Medical Report for Retinal Images
- URL: http://arxiv.org/abs/2509.12534v1
- Date: Tue, 16 Sep 2025 00:18:56 GMT
- Title: DeepEyeNet: Generating Medical Report for Retinal Images
- Authors: Jia-Hong Huang,
- Abstract summary: The increasing prevalence of retinal diseases poses a significant challenge to the healthcare system.<n>Traditional methods of generating medical reports from retinal images rely on manual interpretation.<n>This thesis investigates the potential of Artificial Intelligence to automate medical report generation for retinal images.
- Score: 4.957002348970864
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The increasing prevalence of retinal diseases poses a significant challenge to the healthcare system, as the demand for ophthalmologists surpasses the available workforce. This imbalance creates a bottleneck in diagnosis and treatment, potentially delaying critical care. Traditional methods of generating medical reports from retinal images rely on manual interpretation, which is time-consuming and prone to errors, further straining ophthalmologists' limited resources. This thesis investigates the potential of Artificial Intelligence (AI) to automate medical report generation for retinal images. AI can quickly analyze large volumes of image data, identifying subtle patterns essential for accurate diagnosis. By automating this process, AI systems can greatly enhance the efficiency of retinal disease diagnosis, reducing doctors' workloads and enabling them to focus on more complex cases. The proposed AI-based methods address key challenges in automated report generation: (1) A multi-modal deep learning approach captures interactions between textual keywords and retinal images, resulting in more comprehensive medical reports; (2) Improved methods for medical keyword representation enhance the system's ability to capture nuances in medical terminology; (3) Strategies to overcome RNN-based models' limitations, particularly in capturing long-range dependencies within medical descriptions; (4) Techniques to enhance the interpretability of the AI-based report generation system, fostering trust and acceptance in clinical practice. These methods are rigorously evaluated using various metrics and achieve state-of-the-art performance. This thesis demonstrates AI's potential to revolutionize retinal disease diagnosis by automating medical report generation, ultimately improving clinical efficiency, diagnostic accuracy, and patient care.
Related papers
- Intelligent Healthcare Imaging Platform: A VLM-Based Framework for Automated Medical Image Analysis and Clinical Report Generation [0.0]
This work presents an intelligent multimodal framework for medical image analysis that leverages Vision-Language Models (VLMs)<n>The framework integrates Google Gemini 2.5 Flash for automated tumor detection and clinical report generation across multiple imaging modalities including CT, MRI, X-ray, and Ultrasound.
arXiv Detail & Related papers (2025-09-16T23:15:44Z) - Medical Reasoning in the Era of LLMs: A Systematic Review of Enhancement Techniques and Applications [59.721265428780946]
Large Language Models (LLMs) in medicine have enabled impressive capabilities, yet a critical gap remains in their ability to perform systematic, transparent, and verifiable reasoning.<n>This paper provides the first systematic review of this emerging field.<n>We propose a taxonomy of reasoning enhancement techniques, categorized into training-time strategies and test-time mechanisms.
arXiv Detail & Related papers (2025-08-01T14:41:31Z) - Deep Learning for Ophthalmology: The State-of-the-Art and Future Trends [7.893548922956548]
The emergence of artificial intelligence (AI) has marked a new era in the realm of ophthalmology.<n>This review explores the cutting-edge applications of deep learning (DL) across a range of ocular conditions.
arXiv Detail & Related papers (2025-01-07T18:53:14Z) - Automated Retinal Image Analysis and Medical Report Generation through Deep Learning [3.4447129363520337]
The increasing prevalence of retinal diseases poses a significant challenge to the healthcare system.
Traditional methods of generating medical reports from retinal images rely on manual interpretation.
This thesis investigates the potential of Artificial Intelligence to automate medical report generation for retinal images.
arXiv Detail & Related papers (2024-08-14T07:47:25Z) - Algorithm-based diagnostic application for diabetic retinopathy
detection [0.0]
Diabetic retinopathy is a growing health problem worldwide and is a leading cause of visual impairment and blindness.
Recent research in the field of diabetic retinopathy diagnosis is using advanced technologies, such as analysis of images obtained by ophthalmoscopy.
This paper describes an automatic DR diagnosis method that includes processing and analysis of ophthalmoscopic images of the eye.
arXiv Detail & Related papers (2023-12-01T12:09:06Z) - Radiology Report Generation Using Transformers Conditioned with
Non-imaging Data [55.17268696112258]
This paper proposes a novel multi-modal transformer network that integrates chest x-ray (CXR) images and associated patient demographic information.
The proposed network uses a convolutional neural network to extract visual features from CXRs and a transformer-based encoder-decoder network that combines the visual features with semantic text embeddings of patient demographic information.
arXiv Detail & Related papers (2023-11-18T14:52:26Z) - Validating polyp and instrument segmentation methods in colonoscopy through Medico 2020 and MedAI 2021 Challenges [58.32937972322058]
"Medico automatic polyp segmentation (Medico 2020)" and "MedAI: Transparency in Medical Image (MedAI 2021)" competitions.
We present a comprehensive summary and analyze each contribution, highlight the strength of the best-performing methods, and discuss the possibility of clinical translations of such methods into the clinic.
arXiv Detail & Related papers (2023-07-30T16:08:45Z) - Explainable Artificial Intelligence in Retinal Imaging for the detection
of Systemic Diseases [0.0]
This study aims to evaluate an explainable staged grading process without using deep Convolutional Neural Networks (CNNs) directly.
We have proposed a clinician-in-the-loop assisted intelligent workflow that performs a retinal vascular assessment on the fundus images.
The semiautomatic methodology aims to have a federated approach to AI in healthcare applications with more inputs and interpretations from clinicians.
arXiv Detail & Related papers (2022-12-14T07:00:31Z) - Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation [116.87918100031153]
We propose a Cross-modal clinical Graph Transformer (CGT) for ophthalmic report generation (ORG)
CGT injects clinical relation triples into the visual features as prior knowledge to drive the decoding procedure.
Experiments on the large-scale FFA-IR benchmark demonstrate that the proposed CGT is able to outperform previous benchmark methods.
arXiv Detail & Related papers (2022-06-04T13:16:30Z) - Robust and Efficient Medical Imaging with Self-Supervision [80.62711706785834]
We present REMEDIS, a unified representation learning strategy to improve robustness and data-efficiency of medical imaging AI.
We study a diverse range of medical imaging tasks and simulate three realistic application scenarios using retrospective data.
arXiv Detail & Related papers (2022-05-19T17:34:18Z) - An Interpretable Multiple-Instance Approach for the Detection of
referable Diabetic Retinopathy from Fundus Images [72.94446225783697]
We propose a machine learning system for the detection of referable Diabetic Retinopathy in fundus images.
By extracting local information from image patches and combining it efficiently through an attention mechanism, our system is able to achieve high classification accuracy.
We evaluate our approach on publicly available retinal image datasets, in which it exhibits near state-of-the-art performance.
arXiv Detail & Related papers (2021-03-02T13:14:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.