A tomographic workflow to enable deep learning for X-ray based foreign
object detection
- URL: http://arxiv.org/abs/2201.12184v1
- Date: Fri, 28 Jan 2022 15:33:20 GMT
- Title: A tomographic workflow to enable deep learning for X-ray based foreign
object detection
- Authors: Math\'e T. Zeegers, Tristan van Leeuwen, Dani\"el M. Pelt, Sophia
Bethany Coban, Robert van Liere, Kees Joost Batenburg
- Abstract summary: We propose a Computed Tomography based method for producing training data for supervised learning of foreign object detection.
High-quality ground truth locations of the foreign objects are obtained through accurate 3D reconstructions and segmentations.
We outline the benefits of objectively and reproducibly generating training data in this way compared to conventional radiograph annotation.
- Score: 0.7829352305480283
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Detection of unwanted (`foreign') objects within products is a common
procedure in many branches of industry for maintaining production quality.
X-ray imaging is a fast, non-invasive and widely applicable method for foreign
object detection. Deep learning has recently emerged as a powerful approach for
recognizing patterns in radiographs (i.e., X-ray images), enabling automated
X-ray based foreign object detection. However, these methods require a large
number of training examples and manual annotation of these examples is a
subjective and laborious task. In this work, we propose a Computed Tomography
(CT) based method for producing training data for supervised learning of
foreign object detection, with minimal labour requirements. In our approach, a
few representative objects are CT scanned and reconstructed in 3D. The
radiographs that have been acquired as part of the CT-scan data serve as input
for the machine learning method. High-quality ground truth locations of the
foreign objects are obtained through accurate 3D reconstructions and
segmentations. Using these segmented volumes, corresponding 2D segmentations
are obtained by creating virtual projections. We outline the benefits of
objectively and reproducibly generating training data in this way compared to
conventional radiograph annotation. In addition, we show how the accuracy
depends on the number of objects used for the CT reconstructions. The results
show that in this workflow generally only a relatively small number of
representative objects (i.e., fewer than 10) are needed to achieve adequate
detection performance in an industrial setting. Moreover, for real experimental
data we show that the workflow leads to higher foreign object detection
accuracies than with standard radiograph annotation.
Related papers
- A Fast, Scalable, and Robust Deep Learning-based Iterative Reconstruction Framework for Accelerated Industrial Cone-beam X-ray Computed Tomography [5.104810959579395]
Cone-beam X-ray Computed Tomography (XCT) with large detectors and corresponding large-scale 3D reconstruction plays a pivotal role in micron-scale characterization of materials and parts across various industries.
We present a novel deep neural network-based iterative algorithm that integrates an artifact reduction-trained CNN as a prior model with automated regularization parameter selection.
arXiv Detail & Related papers (2025-01-21T19:34:01Z) - Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images [0.8793721044482612]
This study introduces a novel application of denoising diffusion probabilistic models (DDPMs) to the landmark detection task.
Our key innovation lies in leveraging DDPMs for self-supervised pre-training in landmark detection.
This method enables accurate landmark detection with minimal annotated training data.
arXiv Detail & Related papers (2024-07-25T15:32:59Z) - Vision-Language Modelling For Radiological Imaging and Reports In The
Low Data Regime [70.04389979779195]
This paper explores training medical vision-language models (VLMs) where the visual and language inputs are embedded into a common space.
We explore several candidate methods to improve low-data performance, including adapting generic pre-trained models to novel image and text domains.
Using text-to-image retrieval as a benchmark, we evaluate the performance of these methods with variable sized training datasets of paired chest X-rays and radiological reports.
arXiv Detail & Related papers (2023-03-30T18:20:00Z) - Generalized Few-Shot 3D Object Detection of LiDAR Point Cloud for
Autonomous Driving [91.39625612027386]
We propose a novel task, called generalized few-shot 3D object detection, where we have a large amount of training data for common (base) objects, but only a few data for rare (novel) classes.
Specifically, we analyze in-depth differences between images and point clouds, and then present a practical principle for the few-shot setting in the 3D LiDAR dataset.
To solve this task, we propose an incremental fine-tuning method to extend existing 3D detection models to recognize both common and rare objects.
arXiv Detail & Related papers (2023-02-08T07:11:36Z) - DR-WLC: Dimensionality Reduction cognition for object detection and pose
estimation by Watching, Learning and Checking [30.58114448119465]
Existing object detection and pose estimation methods mostly adopt the same-dimensional data for training.
DR-WLC, a dimensionality reduction cognitive model, can perform both object detection and pose estimation tasks at the same time.
arXiv Detail & Related papers (2023-01-17T15:08:32Z) - Fusing Local Similarities for Retrieval-based 3D Orientation Estimation
of Unseen Objects [70.49392581592089]
We tackle the task of estimating the 3D orientation of previously-unseen objects from monocular images.
We follow a retrieval-based strategy and prevent the network from learning object-specific features.
Our experiments on the LineMOD, LineMOD-Occluded, and T-LESS datasets show that our method yields a significantly better generalization to unseen objects than previous works.
arXiv Detail & Related papers (2022-03-16T08:53:00Z) - CutPaste: Self-Supervised Learning for Anomaly Detection and
Localization [59.719925639875036]
We propose a framework for building anomaly detectors using normal training data only.
We first learn self-supervised deep representations and then build a generative one-class classifier on learned representations.
Our empirical study on MVTec anomaly detection dataset demonstrates the proposed algorithm is general to be able to detect various types of real-world defects.
arXiv Detail & Related papers (2021-04-08T19:04:55Z) - Few-Cost Salient Object Detection with Adversarial-Paced Learning [95.0220555274653]
This paper proposes to learn the effective salient object detection model based on the manual annotation on a few training images only.
We name this task as the few-cost salient object detection and propose an adversarial-paced learning (APL)-based framework to facilitate the few-cost learning scenario.
arXiv Detail & Related papers (2021-04-05T14:15:49Z) - Instance Localization for Self-supervised Detection Pretraining [68.24102560821623]
We propose a new self-supervised pretext task, called instance localization.
We show that integration of bounding boxes into pretraining promotes better task alignment and architecture alignment for transfer learning.
Experimental results demonstrate that our approach yields state-of-the-art transfer learning results for object detection.
arXiv Detail & Related papers (2021-02-16T17:58:57Z) - Tubular Shape Aware Data Generation for Semantic Segmentation in Medical
Imaging [2.6673784948574215]
We present an approach for synthetic data generation of the tube-shaped objects, with a generative adversarial network being regularized with a prior-shape constraint.
Our method eliminates the need for paired image-mask data and requires only a weakly-labeled dataset.
We report the applicability of the approach for the task of segmenting tubes and catheters in the X-ray images.
arXiv Detail & Related papers (2020-10-02T10:28:25Z) - Separation of target anatomical structure and occlusions in chest
radiographs [2.0478628221188497]
We propose a Fully Convolutional Network to suppress, for a specific task, undesired visual structure from radiographs.
The proposed algorithm creates reconstructed radiographs and ground-truth data from high resolution CT-scans.
arXiv Detail & Related papers (2020-02-03T14:01:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.