Deep classification algorithm for De-identification of DICOM medical images
- URL: http://arxiv.org/abs/2508.02177v1
- Date: Mon, 04 Aug 2025 08:21:18 GMT
- Title: Deep classification algorithm for De-identification of DICOM medical images
- Authors: Bufano Michele, Kotter Elmar,
- Abstract summary: De-identification of DICOM files is an essential component of medical image research.<n>The most sensible information, like names, history, personal data and institution were successfully recognized.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Background : De-identification of DICOM (Digital Imaging and Communi-cations in Medicine) files is an essential component of medical image research. Personal Identifiable Information (PII) and/or Personal Health Identifying Information (PHI) need to be hidden or removed due to legal reasons. According to the Health Insurance Portability and Accountability Act (HIPAA) and privacy rules, also full-face photographic images and any compa-rable images are direct identifiers and are considered protected health information that also need to be de-identified. Objective : The study aimed to implement a method that permit to de-identify the PII and PHI information present in the header and burned on the pixel data of DICOM. Methods : To execute the de-identification, we implemented an algorithm based on the safe harbor method, defined by HIPAA. Our algorithm uses input customizable parameter to classify and then possibly de-identify individual DICOM tags. Results : The most sensible information, like names, history, personal data and institution were successfully recognized. Conclusions : We developed a python algorithm that is able to classify infor-mation present in a DICOM file. The flexibility provided by the use of customi-zable input parameters, which allow the user to customize the entire process de-pending on the case (e.g., the language), makes the entire program very promis-ing for both everyday use and research purposes. Our code is available at https://github.com/rtdicomexplorer/deep_deidentification.
Related papers
- Medical Image De-Identification Resources: Synthetic DICOM Data and Tools for Validation [0.10617782943195009]
Ensuring patient privacy remains a significant challenge for open-access data sharing.<n>Digital Imaging and Communications in Medicine (DICOM) encodes both essential clinical metadata and extensive protected health information (PHI) and personally identifiable information (PII)<n>To address this gap, we developed an openly accessible DICOM dataset infused with synthetic PHI/PII and an evaluation framework for benchmarking image de-identification.
arXiv Detail & Related papers (2025-08-03T18:48:28Z) - Medical Image De-Identification Benchmark Challenge [1.491270549044044]
The aim of the MIDI-B Challenge was to provide a standardized platform for benchmarking of DICOM image deID tools.<n>The challenge employed a large, diverse, multi-center, and multi-modality set of real de-identified radiology images with synthetic PHI/PII inserted.<n>Ten teams successfully completed the test phase of the challenge.
arXiv Detail & Related papers (2025-07-31T14:47:20Z) - De-Identification of Medical Imaging Data: A Comprehensive Tool for Ensuring Patient Privacy [4.376648893167674]
Open-source tool can be used to de-identify DICOM magnetic resonance images, computer images, whole slide images and magnetic resonance twix raw data.
Proposal comprises an elaborate anonymization pipeline for multiple types of inputs, reducing the need for additional tools used for de-identification of imaging data.
arXiv Detail & Related papers (2024-10-16T09:31:24Z) - Disentangle Before Anonymize: A Two-stage Framework for Attribute-preserved and Occlusion-robust De-identification [55.741525129613535]
"Disentangle Before Anonymize" is a novel two-stage Framework(DBAF)<n>This framework includes a Contrastive Identity Disentanglement (CID) module and a Key-authorized Reversible Identity Anonymization (KRIA) module.<n>Extensive experiments demonstrate that our method outperforms state-of-the-art de-identification approaches.
arXiv Detail & Related papers (2023-11-15T08:59:02Z) - Privacy Protection in MRI Scans Using 3D Masked Autoencoders [2.463789441707266]
Data anonymization and de-identification is concerned with ensuring the privacy and confidentiality of individuals' personal information.
We propose CP-MAE, a model that de-identifies the face by remodeling it.
With our method we are able to synthesize high-fidelity scans of resolution up to $2563$ -- compared to $1283$ with previous approaches.
arXiv Detail & Related papers (2023-10-24T12:25:37Z) - DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4 [80.36535668574804]
We develop a novel GPT4-enabled de-identification framework (DeID-GPT")
Our developed DeID-GPT showed the highest accuracy and remarkable reliability in masking private information from the unstructured medical text.
This study is one of the earliest to utilize ChatGPT and GPT-4 for medical text data processing and de-identification.
arXiv Detail & Related papers (2023-03-20T11:34:37Z) - ConfounderGAN: Protecting Image Data Privacy with Causal Confounder [85.6757153033139]
We propose ConfounderGAN, a generative adversarial network (GAN) that can make personal image data unlearnable to protect the data privacy of its owners.
Experiments are conducted in six image classification datasets, consisting of three natural object datasets and three medical datasets.
arXiv Detail & Related papers (2022-12-04T08:49:14Z) - Application of Homomorphic Encryption in Medical Imaging [60.51436886110803]
We show how HE can be used to make predictions over medical images while preventing unauthorized secondary use of data.
We report some experiments using 3D chest CT-Scans for a nodule detection task.
arXiv Detail & Related papers (2021-10-12T19:57:12Z) - Soft-Label Anonymous Gastric X-ray Image Distillation [49.24576562557866]
This paper presents a soft-label anonymous gastric X-ray image distillation method based on a gradient descent approach.
Experimental results show that the proposed method can not only effectively compress the medical dataset but also anonymize medical images to protect the patient's private information.
arXiv Detail & Related papers (2021-04-07T02:04:12Z) - MASK: A flexible framework to facilitate de-identification of clinical
texts [2.3015324171336378]
We present MASK, a software package that is designed to perform the de-identification task.
The software is able to perform named entity recognition using some of the state-of-the-art techniques and then mask or redact recognized entities.
arXiv Detail & Related papers (2020-05-24T08:53:00Z) - Gradient-Induced Co-Saliency Detection [81.54194063218216]
Co-saliency detection (Co-SOD) aims to segment the common salient foreground in a group of relevant images.
In this paper, inspired by human behavior, we propose a gradient-induced co-saliency detection method.
arXiv Detail & Related papers (2020-04-28T08:40:55Z) - Towards Face Encryption by Generating Adversarial Identity Masks [53.82211571716117]
We propose a targeted identity-protection iterative method (TIP-IM) to generate adversarial identity masks.
TIP-IM provides 95%+ protection success rate against various state-of-the-art face recognition models.
arXiv Detail & Related papers (2020-03-15T12:45:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.