An Interpretable Local Editing Model for Counterfactual Medical Image Generation
- URL: http://arxiv.org/abs/2603.00423v1
- Date: Sat, 28 Feb 2026 02:48:15 GMT
- Title: An Interpretable Local Editing Model for Counterfactual Medical Image Generation
- Authors: Hyungi Min, Taeseung You, Hangyeul Lee, Yeongjae Cho, Sungzoon Cho,
- Abstract summary: InstructX2X is a novel interpretable local editing model for counterfactual medical image generation featuring Region-Specific Editing.<n>Our model successfully generates high-quality counterfactual chest X-ray images along with interpretable explanations.
- Score: 11.263626235904995
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Counterfactual medical image generation have emerged as a critical tool for enhancing AI-driven systems in medical domain by answering "what-if" questions. However, existing approaches face two fundamental limitations: First, they fail to prevent unintended modifications, resulting collateral changes in demographic attributes when only disease features should be affected. Second, they lack interpretability in their editing process, which significantly limits their utility in real-world medical applications. To address these limitations, we present InstructX2X, a novel interpretable local editing model for counterfactual medical image generation featuring Region-Specific Editing. This approach restricts modifications to specific regions, effectively preventing unintended changes while simultaneously providing a Guidance Map that offers inherently interpretable visual explanations of the editing process. Additionally, we introduce MIMIC-EDIT-INSTRUCTION, a dataset for counterfactual medical image generation derived from expert-verified medical VQA pairs. Through extensive experiments, InstructX2X achieve state-of-the-art performance across all major evaluation metrics. Our model successfully generates high-quality counterfactual chest X-ray images along with interpretable explanations.
Related papers
- MedREK: Retrieval-Based Editing for Medical LLMs with Key-Aware Prompts [70.64143198545031]
We propose MedREK, a retrieval-based editing framework that integrates a shared query-key module for precise matching with an attention-based prompt encoder for informative guidance.<n>Our results on various medical benchmarks demonstrate that our MedREK achieves superior performance across different core metrics.
arXiv Detail & Related papers (2025-10-15T12:50:33Z) - MedEBench: Diagnosing Reliability in Text-Guided Medical Image Editing [14.713122814049806]
MedEBench is a benchmark designed to diagnose reliability in text-guided medical image editing.<n>MedEBench consists of 1,182 clinically curated image-prompt pairs covering 70 distinct editing tasks and 13 anatomical regions.
arXiv Detail & Related papers (2025-06-02T17:43:01Z) - Causal Disentanglement for Robust Long-tail Medical Image Generation [80.15257897500578]
We propose a novel medical image generation framework, which generates independent pathological and structural features.<n>We leverage a diffusion model guided by pathological findings to model pathological features, enabling the generation of diverse counterfactual images.
arXiv Detail & Related papers (2025-04-20T01:54:18Z) - Interactive Tumor Progression Modeling via Sketch-Based Image Editing [54.47725383502915]
We propose SkEditTumor, a sketch-based diffusion model for controllable tumor progression editing.<n>By leveraging sketches as structural priors, our method enables precise modifications of tumor regions while maintaining structural integrity and visual realism.<n>Our contributions include a novel integration of sketches with diffusion models for medical image editing, fine-grained control over tumor progression visualization, and extensive validation across multiple datasets, setting a new benchmark in the field.
arXiv Detail & Related papers (2025-03-10T00:04:19Z) - MedEdit: Counterfactual Diffusion-based Image Editing on Brain MRI [2.4557713325522914]
We propose MedEdit, a conditional diffusion model for medical image editing.
MedEdit induces pathology in specific areas while balancing the modeling of disease effects and preserving the integrity of the original scan.
We believe this work will enable counterfactual image editing research to further advance the development of realistic and clinically useful imaging tools.
arXiv Detail & Related papers (2024-07-21T21:19:09Z) - RadEdit: stress-testing biomedical vision models via diffusion image editing [45.43408333243842]
This work proposes using generative image editing to simulate dataset shifts and diagnose failure modes of biomedical vision models.
Existing editing methods can produce undesirable changes, with spurious correlations learned due to the co-occurrence of disease and treatment interventions.
We introduce a new editing method RadEdit that uses multiple masks, if present, to constrain changes and ensure consistency in the edited images.
arXiv Detail & Related papers (2023-12-20T09:27:41Z) - Generative Residual Attention Network for Disease Detection [51.60842580044539]
We present a novel approach for disease generation in X-rays using a conditional generative adversarial learning.
We generate a corresponding radiology image in a target domain while preserving the identity of the patient.
We then use the generated X-ray image in the target domain to augment our training to improve the detection performance.
arXiv Detail & Related papers (2021-10-25T14:15:57Z) - Variational Topic Inference for Chest X-Ray Report Generation [102.04931207504173]
Report generation for medical imaging promises to reduce workload and assist diagnosis in clinical practice.
Recent work has shown that deep learning models can successfully caption natural images.
We propose variational topic inference for automatic report generation.
arXiv Detail & Related papers (2021-07-15T13:34:38Z) - Uncertainty-Guided Progressive GANs for Medical Image Translation [37.95176881950121]
Image-to-image translation plays a vital role in tackling various medical imaging tasks.
We propose an uncertainty-guided progressive learning scheme for image-to-image translation.
We demonstrate the efficacy of our model on three challenging medical image translation tasks.
arXiv Detail & Related papers (2021-06-29T16:26:12Z) - Cross Chest Graph for Disease Diagnosis with Structural Relational
Reasoning [2.7148274921314615]
Locating lesions is important in the computer-aided diagnosis of X-ray images.
General weakly-supervised methods have failed to consider the characteristics of X-ray images.
We propose the Cross-chest Graph (CCG), which improves the performance of automatic lesion detection.
arXiv Detail & Related papers (2021-01-22T08:24:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.