A Closer Look at Edema Area Segmentation in SD-OCT Images Using Adversarial Framework
- URL: http://arxiv.org/abs/2508.18790v1
- Date: Tue, 26 Aug 2025 08:16:48 GMT
- Title: A Closer Look at Edema Area Segmentation in SD-OCT Images Using Adversarial Framework
- Authors: Yuhui Tao, Yizhe Zhang, Qiang Chen,
- Abstract summary: We propose an off-the-shelf adversarial framework for edema area (EA) segmentation with a novel layer-structure-guided post-processing step and a test-time-adaptation strategy.<n>Our framework reframes the dense EA prediction task as one of confirming points between the EA contour and retinal layers, result-ing in predictions that better align with the shape prior to EA.
- Score: 9.612454002801242
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The development of artificial intelligence models for macular edema (ME) analy-sis always relies on expert-annotated pixel-level image datasets which are expen-sive to collect prospectively. While anomaly-detection-based weakly-supervised methods have shown promise in edema area (EA) segmentation task, their per-formance still lags behind fully-supervised approaches. In this paper, we leverage the strong correlation between EA and retinal layers in spectral-domain optical coherence tomography (SD-OCT) images, along with the update characteristics of weakly-supervised learning, to enhance an off-the-shelf adversarial framework for EA segmentation with a novel layer-structure-guided post-processing step and a test-time-adaptation (TTA) strategy. By incorporating additional retinal lay-er information, our framework reframes the dense EA prediction task as one of confirming intersection points between the EA contour and retinal layers, result-ing in predictions that better align with the shape prior of EA. Besides, the TTA framework further helps address discrepancies in the manifestations and presen-tations of EA between training and test sets. Extensive experiments on two pub-licly available datasets demonstrate that these two proposed ingredients can im-prove the accuracy and robustness of EA segmentation, bridging the gap between weakly-supervised and fully-supervised models.
Related papers
- GEM-TFL: Bridging Weak and Full Supervision for Forgery Localization through EM-Guided Decomposition and Temporal Refinement [24.929199892659636]
Temporal Forgery Localization aims to precisely identify manipulated segments within videos or audio streams, providing interpretable evidence for multimedia forensics and security.<n>Most existing TFL methods rely on dense frame-level labels in a fully supervised manner, but Weakly Supervised TFL (WS-TFL) reduces labeling cost by learning only from binary video-level labels.<n>We propose GEM-TFL, a two-phase classification-regression framework that effectively bridges the supervision gap between training and inference.
arXiv Detail & Related papers (2026-03-05T12:07:26Z) - RefLSM: Linearized Structural-Prior Reflectance Model for Medical Image Segmentation and Bias-Field Correction [10.716406019360441]
We propose a novel variational Reflectance-based Level Set Model (RefLSM) for medical image segmentation.<n>RefLSM explicitly integrates Retinex-inspired reflectance decomposition into the segmentation framework.<n>We show that RefLSM achieves superior segmentation accuracy, robustness, and computational efficiency compared to state-of-the-art level set methods.
arXiv Detail & Related papers (2025-12-08T06:06:29Z) - Structure-Aware Contrastive Learning with Fine-Grained Binding Representations for Drug Discovery [3.1716746406651457]
This work introduces a sequence-based drug-target interaction framework that integrates structural priors into protein representations.<n>The model achieves state-of-the-art performance on Human and BioSNAP datasets and remains competitive on BindingDB.
arXiv Detail & Related papers (2025-09-18T09:38:46Z) - Enhancing Diffusion Face Generation with Contrastive Embeddings and SegFormer Guidance [0.0]
We present a benchmark of diffusion models for human face generation on a small-scale CelebAMask-HQ dataset.<n>Our study compares UNet and DiT architectures for unconditional generation and explores LoRA-based fine-tuning of pretrained Stable Diffusion models.
arXiv Detail & Related papers (2025-08-13T14:27:47Z) - ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction [57.930531826380836]
This work explores whether a foundational segmentation model can address label scarcity in the pixel-level vision task as an annotator for unlabeled images.<n>We propose ConformalSAM, a novel SSSS framework which first calibrates the foundation model using the target domain's labeled data and then filters out unreliable pixel labels of unlabeled data.
arXiv Detail & Related papers (2025-07-21T17:02:57Z) - SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning [23.984926906083473]
Panoptic Scene Graph Generation (PSG) integrates instance segmentation with relation understanding to capture pixel-level structural relationships in complex scenes.<n>Recent approaches leveraging pre-trained vision-language models (VLMs) have significantly improved performance in the open-vocabulary setting.<n>We propose SPADE (SPatial-Aware Denoising-nEtwork) framework -- a novel approach for open-vocabulary PSG.
arXiv Detail & Related papers (2025-07-08T09:03:24Z) - CRIA: A Cross-View Interaction and Instance-Adapted Pre-training Framework for Generalizable EEG Representations [52.251569042852815]
CRIA is an adaptive framework that utilizes variable-length and variable-channel coding to achieve a unified representation of EEG data across different datasets.<n>The model employs a cross-attention mechanism to fuse temporal, spectral, and spatial features effectively.<n> Experimental results on the Temple University EEG corpus and the CHB-MIT dataset show that CRIA outperforms existing methods with the same pre-training conditions.
arXiv Detail & Related papers (2025-06-19T06:31:08Z) - An Intra- and Cross-frame Topological Consistency Scheme for Semi-supervised Atherosclerotic Coronary Plaque Segmentation [9.902034502749501]
Coronary atherosclerosis Analysis (CAA) relies on the analysis of vessel cross-section images reconstructed via Curved Planar Reformation.<n>This task presents significant challenges due to the indistinct boundaries and structures of plaques and blood vessels.<n>We propose a novel dual-consistency semi-supervised framework that integrates Intra-frame Topological Consistency (ITC) and Cross-frame Topological Consistency (CTC)<n>Our method surpasses existing semi-supervised methods and approaches the performance of supervised methods on CAA.
arXiv Detail & Related papers (2025-01-14T05:23:42Z) - Understanding and Guiding Weakly Supervised Entity Alignment with Potential Isomorphism Propagation [31.558938631213074]
We present a propagation perspective to analyze weakly supervised EA.
We show that aggregation-based EA models seek propagation operators for pairwise entity similarities.
We develop a general EA framework, PipEA, incorporating this operator to improve the accuracy of every type of aggregation-based model.
arXiv Detail & Related papers (2024-02-05T14:06:15Z) - Separate-and-Enhance: Compositional Finetuning for Text2Image Diffusion
Models [58.46926334842161]
This work illuminates the fundamental reasons for such misalignment, pinpointing issues related to low attention activation scores and mask overlaps.
We propose two novel objectives, the Separate loss and the Enhance loss, that reduce object mask overlaps and maximize attention scores.
Our method diverges from conventional test-time-adaptation techniques, focusing on finetuning critical parameters, which enhances scalability and generalizability.
arXiv Detail & Related papers (2023-12-10T22:07:42Z) - Mixed Graph Contrastive Network for Semi-Supervised Node Classification [63.924129159538076]
We propose a novel graph contrastive learning method, termed Mixed Graph Contrastive Network (MGCN)<n>In our method, we improve the discriminative capability of the latent embeddings by an unperturbed augmentation strategy and a correlation reduction mechanism.<n>By combining the two settings, we extract rich supervision information from both the abundant nodes and the rare yet valuable labeled nodes for discriminative representation learning.
arXiv Detail & Related papers (2022-06-06T14:26:34Z) - Test-time Adaptation with Slot-Centric Models [63.981055778098444]
Slot-TTA is a semi-supervised scene decomposition model that at test time is adapted per scene through gradient descent on reconstruction or cross-view synthesis objectives.
We show substantial out-of-distribution performance improvements against state-of-the-art supervised feed-forward detectors, and alternative test-time adaptation methods.
arXiv Detail & Related papers (2022-03-21T17:59:50Z) - A Macro-Micro Weakly-supervised Framework for AS-OCT Tissue Segmentation [33.684182783291064]
Primary angle closure glaucoma (PACG) is the leading cause of irreversible blindness among Asian people.
The proposed framework consists of two models which provide reliable guidance for each other.
Experiments on the publicly available AGE dataset demonstrate that the proposed framework outperforms the state-of-the-art semi-/weakly-supervised methods.
arXiv Detail & Related papers (2020-07-20T11:26:32Z) - Unsupervised Instance Segmentation in Microscopy Images via Panoptic
Domain Adaptation and Task Re-weighting [86.33696045574692]
We propose a Cycle Consistency Panoptic Domain Adaptive Mask R-CNN (CyC-PDAM) architecture for unsupervised nuclei segmentation in histopathology images.
We first propose a nuclei inpainting mechanism to remove the auxiliary generated objects in the synthesized images.
Secondly, a semantic branch with a domain discriminator is designed to achieve panoptic-level domain adaptation.
arXiv Detail & Related papers (2020-05-05T11:08:26Z) - Deep Semantic Matching with Foreground Detection and Cycle-Consistency [103.22976097225457]
We address weakly supervised semantic matching based on a deep network.
We explicitly estimate the foreground regions to suppress the effect of background clutter.
We develop cycle-consistent losses to enforce the predicted transformations across multiple images to be geometrically plausible and consistent.
arXiv Detail & Related papers (2020-03-31T22:38:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.