Related papers: A Hybrid AI-based and Rule-based Approach to DICOM De-identification: A Solution for the MIDI-B Challenge

A Hybrid AI-based and Rule-based Approach to DICOM De-identification: A Solution for the MIDI-B Challenge

URL: http://arxiv.org/abs/2509.00437v1
Date: Sat, 30 Aug 2025 09:52:15 GMT
Title: A Hybrid AI-based and Rule-based Approach to DICOM De-identification: A Solution for the MIDI-B Challenge
Authors: Hamideh Haghiri, Rajesh Baidya, Stefan Dvoretskii, Klaus H. Maier-Hein, Marco Nolden,
Abstract summary: This paper presents a hybrid de-identification framework designed to process Digital Imaging and Communications in Medicine (DICOM) files.<n>Our framework adopts a modified, pre-built rule-based component, updated with The Cancer Imaging Archive (TCIA)'s best practices guidelines.<n>It incorporates PaddleOCR, a robust Optical Character Recognition (OCR) system for extracting text from images, and RoBERTa, a fine-tuned transformer-based model for identifying Personally Identifiable Information (PII) and Protected Health Information (PHI)
Score: 4.40986569501073
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Ensuring the de-identification of medical imaging data is a critical step in enabling safe data sharing. This paper presents a hybrid de-identification framework designed to process Digital Imaging and Communications in Medicine (DICOM) files. Our framework adopts a modified, pre-built rule-based component, updated with The Cancer Imaging Archive (TCIA)'s best practices guidelines, as outlined in DICOM PS 3.15, for improved performance. It incorporates PaddleOCR, a robust Optical Character Recognition (OCR) system for extracting text from images, and RoBERTa, a fine-tuned transformer-based model for identifying and removing Personally Identifiable Information (PII) and Protected Health Information (PHI). Initially, the transformer-based model and the rule-based component were integrated to process for both structured data and free text. However, this coarse-grained approach did not yield optimal results. To improve performance, we refined our approach by applying the transformer model exclusively to free text, while structured data was handled only by rule-based methods. In this framework the DICOM validator dciodvfy was leveraged to ensure the integrity of DICOM files after the deID process. Through iterative refinement, including the incorporation of custom rules and private tag handling, the framework achieved a de-identification accuracy of 99.91% on the MIDI-B test dataset. The results demonstrate the effectiveness of combining rule-based compliance with AI-enabled adaptability in addressing the complex challenges of DICOM de-identification.

Related papers

Revisiting Integration of Image and Metadata for DICOM Series Classification: Cross-Attention and Dictionary Learning [1.4430021185664905]
DICOM series classification remains challenging due to heterogeneous slice content, variable series length, and entirely missing, incomplete or inconsistent DICOM metadata.<n>We propose an end-to-end multimodal framework for DICOM series classification that jointly models image content and acquisition metadata.<n>We evaluate the proposed approach on the publicly available Duke Liver MRI dataset and a large multi-institutional in-house cohort.
arXiv Detail & Related papers (2026-02-27T09:12:24Z)
Fine-Grained Zero-Shot Composed Image Retrieval with Complementary Visual-Semantic Integration [64.12127577975696]
Zero-shot composed image retrieval (ZS-CIR) is a rapidly growing area with significant practical applications.<n>Existing ZS-CIR methods often struggle to capture fine-grained changes and integrate visual and semantic information effectively.<n>We propose a novel Fine-Grained Zero-Shot Composed Image Retrieval method with Complementary Visual-Semantic Integration.
arXiv Detail & Related papers (2026-01-20T15:17:14Z)
Adaptive Agent Selection and Interaction Network for Image-to-point cloud Registration [28.794550758318664]
We propose a novel cross-modal registration framework composed of two key modules: the Iterative Agents Selection (IAS) module and the Reliable Agents Interaction (RAI) module.<n>IAS enhances structural feature awareness with phase maps and employs reinforcement learning principles to efficiently select reliable agents.<n>RAI then leverages these selected agents to guide cross-modal interactions, effectively reducing mismatches and improving overall robustness.
arXiv Detail & Related papers (2025-11-08T10:50:43Z)
Knowledge-Informed Neural Network for Complex-Valued SAR Image Recognition [51.03674130115878]
We introduce the Knowledge-Informed Neural Network (KINN), a lightweight framework built upon a novel "compression-aggregation-compression" architecture.<n>KINN establishes a state-of-the-art in parameter-efficient recognition, offering exceptional generalization in data-scarce and out-of-distribution scenarios.
arXiv Detail & Related papers (2025-10-23T07:12:26Z)
A DICOM Image De-identification Algorithm in the MIDI-B Challenge [1.1770063763895537]
De-identification is essential for the public sharing of medical images in the widely used Digital Imaging and Communications in Medicine (DICOM) format.<n>The MIDI-B challenge was organized to evaluate rule-based DICOM image de-identification algorithms with a large dataset of clinical DICOM images.<n>We detail the de-identification methods we applied - such as pixel masking, date shifting, date hashing, text recognition, text replacement, and text removal - to process datasets during the test phase in strict compliance with these standards.
arXiv Detail & Related papers (2025-08-11T01:38:07Z)
Medical Image De-Identification Resources: Synthetic DICOM Data and Tools for Validation [0.10617782943195009]
Ensuring patient privacy remains a significant challenge for open-access data sharing.<n>Digital Imaging and Communications in Medicine (DICOM) encodes both essential clinical metadata and extensive protected health information (PHI) and personally identifiable information (PII)<n>To address this gap, we developed an openly accessible DICOM dataset infused with synthetic PHI/PII and an evaluation framework for benchmarking image de-identification.
arXiv Detail & Related papers (2025-08-03T18:48:28Z)
DICOM De-Identification via Hybrid AI and Rule-Based Framework for Scalable, Uncertainty-Aware Redaction [0.0]
This paper presents a hybrid de-identification framework that combines rule-based and AI-driven techniques.<n>Our solution addresses critical challenges in medical data de-identification and supports the secure, ethical, and trustworthy release of imaging data for research.
arXiv Detail & Related papers (2025-07-31T17:19:38Z)
Medical Image De-Identification Benchmark Challenge [1.491270549044044]
The aim of the MIDI-B Challenge was to provide a standardized platform for benchmarking of DICOM image deID tools.<n>The challenge employed a large, diverse, multi-center, and multi-modality set of real de-identified radiology images with synthetic PHI/PII inserted.<n>Ten teams successfully completed the test phase of the challenge.
arXiv Detail & Related papers (2025-07-31T14:47:20Z)
Unity in Diversity: Multi-expert Knowledge Confrontation and Collaboration for Generalizable Vehicle Re-identification [60.20318058777603]
Generalizable vehicle re-identification (ReID) seeks to develop models that can adapt to unknown target domains without the need for fine-tuning or retraining.<n>Previous works have mainly focused on extracting domain-invariant features by aligning data distributions between source domains.<n>We propose a two-stage Multi-expert Knowledge Confrontation and Collaboration (MiKeCoCo) method to solve this unique problem.
arXiv Detail & Related papers (2024-07-10T04:06:39Z)
Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-Modal Manipulation [109.1912721224697]
We present the Unified Frequency-Assisted transFormer framework, named UFAFormer, to address the DGM4 problem. By leveraging the discrete wavelet transform, we decompose images into several frequency sub-bands, capturing rich face forgery artifacts. Our proposed frequency encoder, incorporating intra-band and inter-band self-attentions, explicitly aggregates forgery features within and across diverse sub-bands.
arXiv Detail & Related papers (2023-09-18T11:06:42Z)
Zero-shot Composed Text-Image Retrieval [72.43790281036584]
We consider the problem of composed image retrieval (CIR) It aims to train a model that can fuse multi-modal information, e.g., text and images, to accurately retrieve images that match the query, extending the user's expression ability.
arXiv Detail & Related papers (2023-06-12T17:56:01Z)
Uncertainty-Aware Source-Free Adaptive Image Super-Resolution with Wavelet Augmentation Transformer [60.31021888394358]
Unsupervised Domain Adaptation (UDA) can effectively address domain gap issues in real-world image Super-Resolution (SR) We propose a SOurce-free Domain Adaptation framework for image SR (SODA-SR) to address this issue, i.e., adapt a source-trained model to a target domain with only unlabeled target data.
arXiv Detail & Related papers (2023-03-31T03:14:44Z)

This list is automatically generated from the titles and abstracts of the papers in this site.