A Comparative Analysis of Semiconductor Wafer Map Defect Detection with Image Transformer
- URL: http://arxiv.org/abs/2512.11977v1
- Date: Fri, 12 Dec 2025 19:03:31 GMT
- Title: A Comparative Analysis of Semiconductor Wafer Map Defect Detection with Image Transformer
- Authors: Sushmita Nath,
- Abstract summary: This study investigates the use of the Data-Efficient Image Transformer (DeiT) for classifying wafer map defects under data-constrained conditions.<n> Experimental results reveal that the DeiT model achieves highest classification accuracy of 90.83%, outperforming CNN models such as VGG-19(65%), SqueezeNet(82%), Xception(66%) and Hybrid(67%)
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Predictive maintenance is an important sector in modern industries which improves fault detection and cost reduction processes. By using machine learning algorithms in the whole process, the defects detection process can be implemented smoothly. Semiconductor is a sensitive maintenance field that requires predictability in work. While convolutional neural networks (CNNs) such as VGG-19, Xception and Squeeze-Net have demonstrated solid performance in image classification for semiconductor wafer industry, their effectiveness often declines in scenarios with limited and imbalanced data. This study investigates the use of the Data-Efficient Image Transformer (DeiT) for classifying wafer map defects under data-constrained conditions. Experimental results reveal that the DeiT model achieves highest classification accuracy of 90.83%, outperforming CNN models such as VGG-19(65%), SqueezeNet(82%), Xception(66%) and Hybrid(67%). DeiT also demonstrated superior F1-score (90.78%) and faster training convergence, with enhanced robustness in detecting minority defect classes. These findings highlight the potential of transformer-based models like DeiT in semiconductor wafer defect detection and support predictive maintenance strategies within semiconductor fabrication processes.
Related papers
- PatchFlow: Leveraging a Flow-Based Model with Patch Features [3.3188882066534355]
Die casting plays a crucial role across various industries due to its ability to craft intricate shapes with high precision and smooth surfaces.<n>Computer vision techniques have been explored to automate and improve defect detection.
arXiv Detail & Related papers (2026-02-05T02:57:28Z) - Validating Vision Transformers for Otoscopy: Performance and Data-Leakage Effects [42.465094107111646]
This study evaluates the efficacy of vision transformer models, specifically Swin transformers, in enhancing the diagnostic accuracy of ear diseases.<n>The research utilised a real-world dataset from the Department of Otolaryngology at the Clinical Hospital of the Universidad de Chile.
arXiv Detail & Related papers (2025-11-06T23:20:37Z) - Boosted Training of Lightweight Early Exits for Optimizing CNN Image Classification Inference [47.027290803102666]
We introduce a sequential training approach that aligns branch training with inference-time data distributions.<n>Experiments on the CINIC-10 dataset with a ResNet18 backbone demonstrate that BTS-EE consistently outperforms non-boosted training.<n>These results offer practical efficiency gains for applications such as industrial inspection, embedded vision, and UAV-based monitoring.
arXiv Detail & Related papers (2025-09-10T06:47:49Z) - Wavelet Scattering Transform and Fourier Representation for Offline Detection of Malicious Clients in Federated Learning [42.558423984270135]
Federated Learning (FL) enables the training of machine learning models across decentralized clients while preserving data privacy.<n>The presence of anomalous or corrupted clients can significantly degrade model performance.<n>We propose WAFFLE a detection algorithm that labels malicious clients it before training<n>A lightweight detector, trained on a distillated public dataset, performs the labeling with minimal communication and computational overhead.
arXiv Detail & Related papers (2025-06-11T12:48:00Z) - Semiconductor SEM Image Defect Classification Using Supervised and Semi-Supervised Learning with Vision Transformers [31.64631761575222]
This work proposes application of vision transformer (ViT) neural networks for automatic defect classification (ADC) of scanning electron microscope (SEM) images of wafer defects.<n>We studied 11 defect types from over 7400 total images and investigated the potential of transfer learning of DinoV2 and semi-supervised learning for improved classification accuracy and efficient computation.
arXiv Detail & Related papers (2025-06-03T19:34:54Z) - Enhancing Glass Defect Detection with Diffusion Models: Addressing Imbalanced Datasets in Manufacturing Quality Control [0.0]
This paper presents a novel approach using Denoising Diffusion Probabilistic Models (DDPMs) to generate synthetic defective glass product images for data augmentation.<n>The methodology significantly enhances image classification performance of standard CNN architectures.<n>The most dramatic improvement was observed in ResNet50V2's overall classification accuracy, which increased from 78% to 93% when trained with the augmented data.
arXiv Detail & Related papers (2025-05-06T03:16:56Z) - Advanced technology in railway track monitoring using the GPR Technique: A Review [41.94295877935867]
Ground Penetrating Radar (GPR) is an electromagnetic survey technique that can be used to monitor railway tracks.<n>It can detect defects such as ballast pockets, fouled ballast, poor drainage, and subgrade settlement.<n>This paper demonstrates the current techniques for using synthetic modeling to calibrate real-world GPR data.<n>Deep learning techniques, particularly Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) are also highlighted for their effectiveness in recognizing patterns associated with defects in GPR images.
arXiv Detail & Related papers (2025-01-19T18:01:39Z) - Wafer Map Defect Classification Using Autoencoder-Based Data Augmentation and Convolutional Neural Network [4.8748194765816955]
This study proposes a novel method combining a self-encoder-based data augmentation technique with a convolutional neural network (CNN)
The proposed method achieves a classification accuracy of 98.56%, surpassing Random Forest, SVM, and Logistic Regression by 19%, 21%, and 27%, respectively.
arXiv Detail & Related papers (2024-11-17T10:19:54Z) - Utilizing Generative Adversarial Networks for Image Data Augmentation and Classification of Semiconductor Wafer Dicing Induced Defects [0.21990652930491852]
In semiconductor manufacturing, the wafer dicing process is central yet vulnerable to defects that significantly impair yield.
Deep neural networks are the current state of the art in (semi-)automated visual inspection.
We explore the application of generative adversarial networks (GAN) for image data augmentation and classification of semiconductor wafer dicing induced defects.
arXiv Detail & Related papers (2024-07-24T20:44:16Z) - Wafer Map Defect Patterns Semi-Supervised Classification Using Latent
Vector Representation [8.400553138721044]
The demand for defect detection during integrated circuit fabrication stages is becoming increasingly critical.
Traditional wafer map defect pattern detection methods involve manual inspection using electron microscopes.
We propose a model capable of automatically detecting defects as an alternative to manual operations.
arXiv Detail & Related papers (2023-10-06T08:23:36Z) - SEMI-DiffusionInst: A Diffusion Model Based Approach for Semiconductor
Defect Classification and Segmentation [0.11999555634662631]
This work is the first demonstration to accurately detect and precisely segment semiconductor defect patterns by using a diffusion model.
Our proposed approach outperforms previous work on overall mAP and performs comparatively better or as per for almost all defect classes.
arXiv Detail & Related papers (2023-07-17T17:53:36Z) - Boosting Facial Expression Recognition by A Semi-Supervised Progressive
Teacher [54.50747989860957]
We propose a semi-supervised learning algorithm named Progressive Teacher (PT) to utilize reliable FER datasets as well as large-scale unlabeled expression images for effective training.
Experiments on widely-used databases RAF-DB and FERPlus validate the effectiveness of our method, which achieves state-of-the-art performance with accuracy of 89.57% on RAF-DB.
arXiv Detail & Related papers (2022-05-28T07:47:53Z) - Truncated tensor Schatten p-norm based approach for spatiotemporal
traffic data imputation with complicated missing patterns [77.34726150561087]
We introduce four complicated missing patterns, including missing and three fiber-like missing cases according to the mode-drivenn fibers.
Despite nonity of the objective function in our model, we derive the optimal solutions by integrating alternating data-mputation method of multipliers.
arXiv Detail & Related papers (2022-05-19T08:37:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.