Is Medical Chest X-ray Data Anonymous?
- URL: http://arxiv.org/abs/2103.08562v1
- Date: Mon, 15 Mar 2021 17:26:43 GMT
- Title: Is Medical Chest X-ray Data Anonymous?
- Authors: Kai Packh\"auser, Sebastian G\"undel, Nicolas M\"unster, Christopher
Syben, Vincent Christlein, Andreas Maier
- Abstract summary: We show that a well-trained deep learning system is able to recover the patient identity from chest X-ray data.
We demonstrate this using the publicly available large-scale ChestX-ray14 dataset.
- Score: 8.29994774042507
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: With the rise and ever-increasing potential of deep learning techniques in
recent years, publicly available medical data sets became a key factor to
enable reproducible development of diagnostic algorithms in the medical domain.
Medical data contains sensitive patient-related information and is therefore
usually anonymized by removing patient identifiers, e.g., patient names before
publication. To the best of our knowledge, we are the first to show that a
well-trained deep learning system is able to recover the patient identity from
chest X-ray data. We demonstrate this using the publicly available large-scale
ChestX-ray14 dataset, a collection of 112,120 frontal-view chest X-ray images
from 30,805 unique patients. Our verification system is able to identify
whether two frontal chest X-ray images are from the same person with an AUC of
0.9940 and a classification accuracy of 95.55%. We further highlight that the
proposed system is able to reveal the same person even ten and more years after
the initial scan. When pursuing a retrieval approach, we observe an mAP@R of
0.9748 and a precision@1 of 0.9963. Based on this high identification rate, a
potential attacker may leak patient-related information and additionally
cross-reference images to obtain more information. Thus, there is a great risk
of sensitive content falling into unauthorized hands or being disseminated
against the will of the concerned patients. Especially during the COVID-19
pandemic, numerous chest X-ray datasets have been published to advance
research. Therefore, such data may be vulnerable to potential attacks by deep
learning-based re-identification algorithms.
Related papers
- Revisiting Computer-Aided Tuberculosis Diagnosis [56.80999479735375]
Tuberculosis (TB) is a major global health threat, causing millions of deaths annually.
Computer-aided tuberculosis diagnosis (CTD) using deep learning has shown promise, but progress is hindered by limited training data.
We establish a large-scale dataset, namely the Tuberculosis X-ray (TBX11K) dataset, which contains 11,200 chest X-ray (CXR) images with corresponding bounding box annotations for TB areas.
This dataset enables the training of sophisticated detectors for high-quality CTD.
arXiv Detail & Related papers (2023-07-06T08:27:48Z) - Deep Reinforcement Learning Framework for Thoracic Diseases
Classification via Prior Knowledge Guidance [49.87607548975686]
The scarcity of labeled data for related diseases poses a huge challenge to an accurate diagnosis.
We propose a novel deep reinforcement learning framework, which introduces prior knowledge to direct the learning of diagnostic agents.
Our approach's performance was demonstrated using the well-known NIHX-ray 14 and CheXpert datasets.
arXiv Detail & Related papers (2023-06-02T01:46:31Z) - X-ray Recognition: Patient identification from X-rays using a
contrastive objective [8.042682839888982]
We show that deep learning models are surprisingly accurate at distinguishing CXRs belonging to the same patient from those belonging to different patients.
These findings suggest potential privacy considerations with the proliferation of large public CXR databases.
arXiv Detail & Related papers (2023-04-29T01:51:54Z) - Deep Learning-based Anonymization of Chest Radiographs: A
Utility-preserving Measure for Patient Privacy [7.240611820374677]
The conventional anonymization process is carried out by obscuring personal information in the images with black boxes.
Such simple measures retain biometric information in the chest radiographs, allowing patients to be re-identified by a linkage attack.
We propose the first deep learning-based approach (PriCheXy-Net) to targetedly anonymize chest radiographs.
arXiv Detail & Related papers (2022-09-23T11:36:32Z) - Cardiomegaly Detection using Deep Convolutional Neural Network with
U-Net [0.0]
A Deep learning-based customized retrained U-Net model for detecting Cardiomegaly disease is presented.
The work used a chest x-ray image dataset to simulate and produced a diagnostic accuracy of 94%.
arXiv Detail & Related papers (2022-05-23T04:02:20Z) - Generative Residual Attention Network for Disease Detection [51.60842580044539]
We present a novel approach for disease generation in X-rays using a conditional generative adversarial learning.
We generate a corresponding radiology image in a target domain while preserving the identity of the patient.
We then use the generated X-ray image in the target domain to augment our training to improve the detection performance.
arXiv Detail & Related papers (2021-10-25T14:15:57Z) - The pitfalls of using open data to develop deep learning solutions for
COVID-19 detection in chest X-rays [64.02097860085202]
Deep learning models have been developed to identify COVID-19 from chest X-rays.
Results have been exceptional when training and testing on open-source data.
Data analysis and model evaluations show that the popular open-source dataset COVIDx is not representative of the real clinical problem.
arXiv Detail & Related papers (2021-09-14T10:59:11Z) - Learning Invariant Feature Representation to Improve Generalization
across Chest X-ray Datasets [55.06983249986729]
We show that a deep learning model performing well when tested on the same dataset as training data starts to perform poorly when it is tested on a dataset from a different source.
By employing an adversarial training strategy, we show that a network can be forced to learn a source-invariant representation.
arXiv Detail & Related papers (2020-08-04T07:41:15Z) - Searching for Pneumothorax in Half a Million Chest X-Ray Images [0.0]
Pneumothorax, a collapsed or dropped lung, is a fatal condition typically detected on a chest X-ray by an experienced radiologist.
In this study, we explored the use of image search to classify pneumothorax among chest X-ray images.
It is the first study to demonstrate that deep pretrained features can be used for CBIR of pneumothorax in half a million chest X-ray images.
arXiv Detail & Related papers (2020-07-30T13:03:52Z) - Y-Net for Chest X-Ray Preprocessing: Simultaneous Classification of
Geometry and Segmentation of Annotations [70.0118756144807]
This work introduces a general pre-processing step for chest x-ray input into machine learning algorithms.
A modified Y-Net architecture based on the VGG11 encoder is used to simultaneously learn geometric orientation and segmentation of radiographs.
Results were evaluated by expert clinicians, with acceptable geometry in 95.8% and annotation mask in 96.2%, compared to 27.0% and 34.9% respectively in control images.
arXiv Detail & Related papers (2020-05-08T02:16:17Z) - Unsupervised Anomaly Detection for X-Ray Images [4.353258086186526]
We investigate how unsupervised methods trained on images without anomalies can be used to assist doctors in evaluating X-ray images of hands.
We adopt state-of-the-art approaches for unsupervised learning to detect anomalies and show how the outputs of these methods can be explained.
We provide an extensive evaluation of different approaches and demonstrate empirically that even without labels it is possible to achieve satisfying results on a real-world dataset of X-ray images of hands.
arXiv Detail & Related papers (2020-01-29T15:14:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.