Fugu-MT 論文翻訳(概要): Enhanced SegNet with Integrated Grad-CAM for Interpretable Retinal Layer Segmentation in OCT Images

論文の概要: Enhanced SegNet with Integrated Grad-CAM for Interpretable Retinal Layer Segmentation in OCT Images

arxiv url: http://arxiv.org/abs/2509.07795v1
Date: Tue, 09 Sep 2025 14:31:51 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-10 14:38:27.357104
Title: Enhanced SegNet with Integrated Grad-CAM for Interpretable Retinal Layer Segmentation in OCT Images
Title（参考訳）: OCT画像における網膜層セグメンテーションのためのGrad-CAMを用いた拡張SegNet
Authors: S M Asiful Islam Saky, Ugyen Tshering,
Abstract要約: そこで本研究では,SegNetによる網膜層分割の自動化と解釈のためのディープラーニングフレームワークを提案する。改良されたプール戦略を含むアーキテクチャ上の革新は、ノイズの多いOCT画像からの特徴抽出を強化する。 Grad-CAMの可視化では、解剖学的に関連のある領域が強調され、セグメンテーションと臨床バイオマーカーが一致した。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Optical Coherence Tomography (OCT) is essential for diagnosing conditions such as glaucoma, diabetic retinopathy, and age-related macular degeneration. Accurate retinal layer segmentation enables quantitative biomarkers critical for clinical decision-making, but manual segmentation is time-consuming and variable, while conventional deep learning models often lack interpretability. This work proposes an improved SegNet-based deep learning framework for automated and interpretable retinal layer segmentation. Architectural innovations, including modified pooling strategies, enhance feature extraction from noisy OCT images, while a hybrid loss function combining categorical cross-entropy and Dice loss improves performance for thin and imbalanced retinal layers. Gradient-weighted Class Activation Mapping (Grad-CAM) is integrated to provide visual explanations, allowing clinical validation of model decisions. Trained and validated on the Duke OCT dataset, the framework achieved 95.77% validation accuracy, a Dice coefficient of 0.9446, and a Jaccard Index (IoU) of 0.8951. Class-wise results confirmed robust performance across most layers, with challenges remaining for thinner boundaries. Grad-CAM visualizations highlighted anatomically relevant regions, aligning segmentation with clinical biomarkers and improving transparency. By combining architectural improvements, a customized hybrid loss, and explainable AI, this study delivers a high-performing SegNet-based framework that bridges the gap between accuracy and interpretability. The approach offers strong potential for standardizing OCT analysis, enhancing diagnostic efficiency, and fostering clinical trust in AI-driven ophthalmic tools.
Abstract（参考訳）: 光コヒーレンス・トモグラフィ(OCT)は緑内障、糖尿病網膜症、加齢に伴う黄斑変性などの病態の診断に必須である。正確な網膜層セグメンテーションは、臨床的な意思決定に重要な量的バイオマーカーを可能にするが、手動セグメンテーションは時間と可変であり、従来のディープラーニングモデルは解釈可能性に欠けることが多い。この研究は、自動化および解釈可能な網膜層セグメンテーションのための改良されたSegNetベースのディープラーニングフレームワークを提案する。改良されたプール戦略を含むアーキテクチャ上の革新は、ノイズの多いOCT画像からの特徴抽出を強化し、カテゴリ横断エントロピーとDice損失を組み合わせたハイブリッド損失関数は、薄く不均衡な網膜層の性能を向上させる。勾配重み付きクラス活性化マッピング(Grad-CAM)は視覚的説明を提供するために統合されており、モデル決定の臨床的検証を可能にする。 Duke OCTデータセットでトレーニングと検証が行われ、95.77%の検証精度、Dice係数0.9446、Jaccard Index(IoU)0.8951を達成した。クラスワイドの結果、ほとんどの層で堅牢なパフォーマンスが確認された。 Grad-CAMの可視化では、解剖学的に関連のある領域が強調され、セグメンテーションと臨床バイオマーカーが一致し、透明性が向上した。アーキテクチャの改善、カスタマイズされたハイブリッド損失、説明可能なAIを組み合わせることで、この研究は精度と解釈可能性のギャップを埋める高いパフォーマンスのSegNetベースのフレームワークを提供する。このアプローチは、OCT分析の標準化、診断効率の向上、AI駆動眼科ツールにおける臨床信頼の促進に強力な可能性を提供する。

論文の概要: Enhanced SegNet with Integrated Grad-CAM for Interpretable Retinal Layer Segmentation in OCT Images

関連論文リスト