Related papers: Unsupervised Multi-Attention Meta Transformer for Rotating Machinery Fault Diagnosis

Unsupervised Multi-Attention Meta Transformer for Rotating Machinery Fault Diagnosis

URL: http://arxiv.org/abs/2509.09251v1
Date: Thu, 11 Sep 2025 08:35:43 GMT
Title: Unsupervised Multi-Attention Meta Transformer for Rotating Machinery Fault Diagnosis
Authors: Hanyang Wang, Yuxuan Yang, Hongjun Wang, Lihui Wang,
Abstract summary: We propose a Multi-Attention Meta Transformer method for few-shot unsupervised rotating machinery fault diagnosis (MMT-FD)<n>This framework extracts potential fault representations from unlabeled data and demonstrates strong generalization capabilities.<n>The model is iteratively optimized using a small number of contrastive learning iterations, resulting in high efficiency.
Score: 4.9825074884178955
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The intelligent fault diagnosis of rotating mechanical equipment usually requires a large amount of labeled sample data. However, in practical industrial applications, acquiring enough data is both challenging and expensive in terms of time and cost. Moreover, different types of rotating mechanical equipment with different unique mechanical properties, require separate training of diagnostic models for each case. To address the challenges of limited fault samples and the lack of generalizability in prediction models for practical engineering applications, we propose a Multi-Attention Meta Transformer method for few-shot unsupervised rotating machinery fault diagnosis (MMT-FD). This framework extracts potential fault representations from unlabeled data and demonstrates strong generalization capabilities, making it suitable for diagnosing faults across various types of mechanical equipment. The MMT-FD framework integrates a time-frequency domain encoder and a meta-learning generalization model. The time-frequency domain encoder predicts status representations generated through random augmentations in the time-frequency domain. These enhanced data are then fed into a meta-learning network for classification and generalization training, followed by fine-tuning using a limited amount of labeled data. The model is iteratively optimized using a small number of contrastive learning iterations, resulting in high efficiency. To validate the framework, we conducted experiments on a bearing fault dataset and rotor test bench data. The results demonstrate that the MMT-FD model achieves 99\% fault diagnosis accuracy with only 1\% of labeled sample data, exhibiting robust generalization capabilities.

Related papers

FusAD: Time-Frequency Fusion with Adaptive Denoising for General Time Series Analysis [92.23551599659186]
Time series analysis plays a vital role in fields such as finance, healthcare, industry, and meteorology.<n>FusAD is a unified analysis framework designed for diverse time series tasks.
arXiv Detail & Related papers (2025-12-16T04:34:27Z)
A Multimodal Lightweight Approach to Fault Diagnosis of Induction Motors in High-Dimensional Dataset [1.148237645450678]
An accurate AI-based diagnostic system for induction motors (IMs) holds the potential to enhance proactive maintenance, mitigating unplanned downtime and curbing overall maintenance costs within an industrial environment.<n>Researchers have proposed various fault diagnosis approaches using signal processing (SP), machine learning (ML), deep learning (DL) and hybrid architectures for BRB faults.<n>This paper implements large-scale data of BRB faults by using a transfer-learning-based lightweight DL model named ShuffleNetV2 for diagnosing one, two, three, and four BRB faults using current and vibration signal data.
arXiv Detail & Related papers (2025-01-07T12:40:11Z)
RmGPT: A Foundation Model with Generative Pre-trained Transformer for Fault Diagnosis and Prognosis in Rotating Machinery [20.52039868199533]
Current methods of Prognostics and Health Management (PHM) often rely on task-specific models.<n>Inspired by advancements in generative pretrained models, we propose RmGPT, a unified model for diagnosis and prognosis tasks.<n>RmGPT significantly outperforms state-of-the-art algorithms, achieving near-perfect accuracy in diagnosis tasks and exceptionally low errors in prognosis tasks.
arXiv Detail & Related papers (2024-09-26T07:40:47Z)
Causal Disentanglement Hidden Markov Model for Fault Diagnosis [55.90917958154425]
We propose a Causal Disentanglement Hidden Markov model (CDHM) to learn the causality in the bearing fault mechanism. Specifically, we make full use of the time-series data and progressively disentangle the vibration signal into fault-relevant and fault-irrelevant factors. To expand the scope of the application, we adopt unsupervised domain adaptation to transfer the learned disentangled representations to other working environments.
arXiv Detail & Related papers (2023-08-06T05:58:45Z)
Fault Detection and Diagnosis with Imbalanced and Noisy Data: A Hybrid Framework for Rotating Machinery [2.580765958706854]
Fault diagnosis plays an essential role in reducing the maintenance costs of rotating machinery manufacturing systems. Traditional Fault Detection and Diagnosis (FDD) frameworks get poor performances when dealing with real-world circumstances. This paper proposes a hybrid framework which uses the three aforementioned components to achieve an effective signal-based FDD system.
arXiv Detail & Related papers (2022-02-09T01:09:59Z)
DAE : Discriminatory Auto-Encoder for multivariate time-series anomaly detection in air transportation [68.8204255655161]
We propose a novel anomaly detection model called Discriminatory Auto-Encoder (DAE) It uses the baseline of a regular LSTM-based auto-encoder but with several decoders, each getting data of a specific flight phase. Results show that the DAE achieves better results in both accuracy and speed of detection.
arXiv Detail & Related papers (2021-09-08T14:07:55Z)
Detecting Faults during Automatic Screwdriving: A Dataset and Use Case of Anomaly Detection for Automatic Screwdriving [80.6725125503521]
Data-driven approaches, using Machine Learning (ML) for detecting faults have recently gained increasing interest. We present a use case of using ML models for detecting faults during automated screwdriving operations.
arXiv Detail & Related papers (2021-07-05T11:46:00Z)
Quick Learning Mechanism with Cross-Domain Adaptation for Intelligent Fault Diagnosis [11.427019313283997]
This paper presents a quick learning mechanism for intelligent fault diagnosis of rotating machines operating under changeable working conditions. We propose a quick learning method with Net2Net transformation followed by a fine-tuning method to cancel/minimize the maximum mean discrepancy of the new data to the previous one. The effectiveness of the proposed fault diagnosis method has been demonstrated on the CWRU dataset, IMS bearing dataset, and Paderborn university dataset.
arXiv Detail & Related papers (2021-03-16T07:24:37Z)
TELESTO: A Graph Neural Network Model for Anomaly Classification in Cloud Services [77.454688257702]
Machine learning (ML) and artificial intelligence (AI) are applied on IT system operation and maintenance. One direction aims at the recognition of re-occurring anomaly types to enable remediation automation. We propose a method that is invariant to dimensionality changes of given data.
arXiv Detail & Related papers (2021-02-25T14:24:49Z)
Data Anomaly Detection for Structural Health Monitoring of Bridges using Shapelet Transform [0.0]
A number of Structural Health Monitoring (SHM) systems are deployed to monitor civil infrastructure. The data measured by the SHM systems tend to be affected by multiple anomalies caused by faulty or broken sensors. This paper proposes the use of a relatively new time series representation named Shapelet Transform to autonomously identify anomalies in SHM data.
arXiv Detail & Related papers (2020-08-31T01:11:04Z)
Unsupervised Anomaly Detection with Adversarial Mirrored AutoEncoders [51.691585766702744]
We propose a variant of Adversarial Autoencoder which uses a mirrored Wasserstein loss in the discriminator to enforce better semantic-level reconstruction. We put forward an alternative measure of anomaly score to replace the reconstruction-based metric. Our method outperforms the current state-of-the-art methods for anomaly detection on several OOD detection benchmarks.
arXiv Detail & Related papers (2020-03-24T08:26:58Z)
SUOD: Accelerating Large-Scale Unsupervised Heterogeneous Outlier Detection [63.253850875265115]
Outlier detection (OD) is a key machine learning (ML) task for identifying abnormal objects from general samples. We propose a modular acceleration system, called SUOD, to address it.
arXiv Detail & Related papers (2020-03-11T00:22:50Z)

This list is automatically generated from the titles and abstracts of the papers in this site.