Performance, Successes and Limitations of Deep Learning Semantic
Segmentation of Multiple Defects in Transmission Electron Micrographs
- URL: http://arxiv.org/abs/2110.08244v1
- Date: Fri, 15 Oct 2021 17:57:59 GMT
- Title: Performance, Successes and Limitations of Deep Learning Semantic
Segmentation of Multiple Defects in Transmission Electron Micrographs
- Authors: Ryan Jacobs, Mingren Shen, Yuhan Liu, Wei Hao, Xiaoshan Li, Ruoyu He,
Jacob RC Greaves, Donglin Wang, Zeming Xie, Zitong Huang, Chao Wang, Kevin G.
Field, Dane Morgan
- Abstract summary: We perform semantic segmentation of defect types in electron microscopy images of irradiated FeCrAl alloys using a deep learning Mask Regional Convolutional Neural Network (Mask R-CNN) model.
We conduct an in-depth analysis of key model performance statistics, with a focus on quantities such as predicted distributions of defect shapes, defect sizes, and defect areal densities.
Overall, we find that the current model is a fast, effective tool for automatically characterizing and quantifying multiple defect types in microscopy images.
- Score: 9.237363938772479
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this work, we perform semantic segmentation of multiple defect types in
electron microscopy images of irradiated FeCrAl alloys using a deep learning
Mask Regional Convolutional Neural Network (Mask R-CNN) model. We conduct an
in-depth analysis of key model performance statistics, with a focus on
quantities such as predicted distributions of defect shapes, defect sizes, and
defect areal densities relevant to informing modeling and understanding of
irradiated Fe-based materials properties. To better understand the performance
and present limitations of the model, we provide examples of useful evaluation
tests which include a suite of random splits, and dataset size-dependent and
domain-targeted cross validation tests. Overall, we find that the current model
is a fast, effective tool for automatically characterizing and quantifying
multiple defect types in microscopy images, with a level of accuracy on par
with human domain expert labelers. More specifically, the model can achieve
average defect identification F1 scores as high as 0.8, and, based on random
cross validation, have low overall average (+/- standard deviation) defect size
and density percentage errors of 7.3 (+/- 3.8)% and 12.7 (+/- 5.3)%,
respectively. Further, our model predicts the expected material hardening to
within 10-20 MPa (about 10% of total hardening), which is about the same error
level as experiments. Our targeted evaluation tests also suggest the best path
toward improving future models is not expanding existing databases with more
labeled images but instead data additions that target weak points of the model
domain, such as images from different microscopes, imaging conditions,
irradiation environments, and alloy types. Finally, we discuss the first phase
of an effort to provide an easy-to-use, open-source object detection tool to
the broader community for identifying defects in new images.
Related papers
- NCT-CRC-HE: Not All Histopathological Datasets Are Equally Useful [15.10324445908774]
In this paper, we analyze a popular NCT-CRC-HE-100K colorectal cancer dataset used in numerous prior works.
We show that both this dataset and the obtained results may be affected by data-specific biases.
We show that even the simplest model using only 3 features per image can demonstrate over 50% accuracy on this 9-class dataset.
arXiv Detail & Related papers (2024-09-17T20:36:03Z) - Accelerating Domain-Aware Electron Microscopy Analysis Using Deep Learning Models with Synthetic Data and Image-Wide Confidence Scoring [0.0]
We create a physics-based synthetic image and data generator, resulting in a machine learning model that achieves comparable precision (0.86), recall (0.63), F1 scores (0.71), and engineering property predictions (R2=0.82)
Our study demonstrates that synthetic data can eliminate human reliance in ML and provides a means for domain awareness in cases where many feature detections per image are needed.
arXiv Detail & Related papers (2024-08-02T20:15:15Z) - Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images [68.42215385041114]
This paper introduces a novel lightweight multi-level adaptation and comparison framework to repurpose the CLIP model for medical anomaly detection.
Our approach integrates multiple residual adapters into the pre-trained visual encoder, enabling a stepwise enhancement of visual features across different levels.
Our experiments on medical anomaly detection benchmarks demonstrate that our method significantly surpasses current state-of-the-art models.
arXiv Detail & Related papers (2024-03-19T09:28:19Z) - Deep Learning-Based Defect Classification and Detection in SEM Images [1.9206693386750882]
In particular, we train RetinaNet models using different ResNet, VGGNet architectures as backbone.
We propose a preference-based ensemble strategy to combine the output predictions from different models in order to achieve better performance on classification and detection of defects.
arXiv Detail & Related papers (2022-06-20T16:34:11Z) - Fake It Till You Make It: Near-Distribution Novelty Detection by
Score-Based Generative Models [54.182955830194445]
existing models either fail or face a dramatic drop under the so-called near-distribution" setting.
We propose to exploit a score-based generative model to produce synthetic near-distribution anomalous data.
Our method improves the near-distribution novelty detection by 6% and passes the state-of-the-art by 1% to 5% across nine novelty detection benchmarks.
arXiv Detail & Related papers (2022-05-28T02:02:53Z) - Image-to-Image Regression with Distribution-Free Uncertainty
Quantification and Applications in Imaging [88.20869695803631]
We show how to derive uncertainty intervals around each pixel that are guaranteed to contain the true value.
We evaluate our procedure on three image-to-image regression tasks.
arXiv Detail & Related papers (2022-02-10T18:59:56Z) - MEMO: Test Time Robustness via Adaptation and Augmentation [131.28104376280197]
We study the problem of test time robustification, i.e., using the test input to improve model robustness.
Recent prior works have proposed methods for test time adaptation, however, they each introduce additional assumptions.
We propose a simple approach that can be used in any test setting where the model is probabilistic and adaptable.
arXiv Detail & Related papers (2021-10-18T17:55:11Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Same Same But DifferNet: Semi-Supervised Defect Detection with
Normalizing Flows [24.734388664558708]
We propose DifferNet: It leverages the descriptiveness of features extracted by convolutional neural networks to estimate their density.
Based on these likelihoods we develop a scoring function that indicates defects.
We demonstrate the superior performance over existing approaches on the challenging and newly proposed MVTec AD and Magnetic Tile Defects datasets.
arXiv Detail & Related papers (2020-08-28T10:49:28Z) - Learning-based Defect Recognition for Quasi-Periodic Microscope Images [0.0]
We propose a semi-supervised machine learning method that assists in the detection of lattice defects from atomic resolution microscope images.
It involves a convolutional neural network that classifies image patches as defective or non-defective, a graph-based that chooses one non-defective patch as a model, and finally an automatically generated convolutional filter bank.
The algorithm is tested on III-V/Si crystalline materials and successfully evaluated against different metrics, showing promising results even for extremely small training data sets.
arXiv Detail & Related papers (2020-07-02T18:00:02Z) - Collaborative Boundary-aware Context Encoding Networks for Error Map
Prediction [65.44752447868626]
We propose collaborative boundaryaware context encoding networks called AEP-Net for error prediction task.
Specifically, we propose a collaborative feature transformation branch for better feature fusion between images and masks, and precise localization of error regions.
The AEP-Net achieves an average DSC of 0.8358, 0.8164 for error prediction task, and shows a high Pearson correlation coefficient of 0.9873.
arXiv Detail & Related papers (2020-06-25T12:42:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.