Physics-Guided Tiny-Mamba Transformer for Reliability-Aware Early Fault Warning
- URL: http://arxiv.org/abs/2601.21293v1
- Date: Thu, 29 Jan 2026 05:46:12 GMT
- Title: Physics-Guided Tiny-Mamba Transformer for Reliability-Aware Early Fault Warning
- Authors: Changyu Li, Dingcheng Huang, Kexuan Yao, Xiaoya Ni, Lijuan Shen, Fei Luo,
- Abstract summary: We propose the Physics-Guided Tiny-Mamba Transformer (PG-TMT), a compact tri-branch encoder tailored for online condition monitoring.<n>A depthwise-separable convolutional stem captures micro-transients, a Tiny-Mamba state-space branch models near-linear long-range dynamics, and a lightweight local Transformer encodes cross-channel resonances.<n>PG-TMT delivers, interpretable, and deployment-ready early warnings for reliability-centric prognostics and health management.
- Score: 2.491980790083337
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Reliability-centered prognostics for rotating machinery requires early warning signals that remain accurate under nonstationary operating conditions, domain shifts across speed/load/sensors, and severe class imbalance, while keeping the false-alarm rate small and predictable. We propose the Physics-Guided Tiny-Mamba Transformer (PG-TMT), a compact tri-branch encoder tailored for online condition monitoring. A depthwise-separable convolutional stem captures micro-transients, a Tiny-Mamba state-space branch models near-linear long-range dynamics, and a lightweight local Transformer encodes cross-channel resonances. We derive an analytic temporal-to-spectral mapping that ties the model's attention spectrum to classical bearing fault-order bands, yielding a band-alignment score that quantifies physical plausibility and provides physics-grounded explanations. To ensure decision reliability, healthy-score exceedances are modeled with extreme-value theory (EVT), which yields an on-threshold achieving a target false-alarm intensity (events/hour); a dual-threshold hysteresis with a minimum hold time further suppresses chatter. Under a leakage-free streaming protocol with right-censoring of missed detections on CWRU, Paderborn, XJTU-SY, and an industrial pilot, PG-TMT attains higher precision-recall AUC (primary under imbalance), competitive or better ROC AUC, and shorter mean time-to-detect at matched false-alarm intensity, together with strong cross-domain transfer. By coupling physics-aligned representations with EVT-calibrated decision rules, PG-TMT delivers calibrated, interpretable, and deployment-ready early warnings for reliability-centric prognostics and health management.
Related papers
- Learning to Separate RF Signals Under Uncertainty: Detect-Then-Separate vs. Unified Joint Models [53.79667447811139]
We show that a single deep neural architecture learns to jointly detect and separate when applied directly to the received signal.<n>These findings highlight UJM as a scalable and practical alternative to DTS, while opening new directions for unified separation under broader estimation.
arXiv Detail & Related papers (2026-02-04T15:25:02Z) - RED-F: Reconstruction-Elimination based Dual-stream Contrastive Forecasting for Multivariate Time Series Anomaly Prediction [19.04414742117033]
We propose a reconstruction- Elimination based Dual-stream Contrastive Forecasting frame- work.<n>The framework transforms the difficult task of absolute signal detection into a simpler, more robust task of relative trajectory comparison.<n>Experiments on six real-world datasets demonstrate the superior capability of RED-F in anomaly prediction tasks.
arXiv Detail & Related papers (2025-11-25T08:11:41Z) - SARNet: A Spike-Aware consecutive validation Framework for Accurate Remaining Useful Life Prediction [7.0741499054562995]
We introduce SARNet (Spike-Aware Consecutive Validation Framework), which builds on a Modern Temporal Convolutional Network (ModernTCN)<n>ModernTCN forecasts degradation-sensitive indicators; an adaptive consecutive threshold validates true spikes while suppressing noise.<n>Across benchmark-ported datasets under an event-triggered protocol, SARNet consistently lowers error compared to recent baselines.
arXiv Detail & Related papers (2025-10-27T03:23:11Z) - Unsupervised Detection of Spatiotemporal Anomalies in PMU Data Using Transformer-Based BiGAN [0.0]
We introduce T-BiGAN, a framework that window-attention Transformers within a bidirectional Generative Adversarial Network (BiGAN)<n>Its encoderdecoder captures architecture while discriminator enforces cycle consistency to align latent space with the true data distribution.<n>Anomalies are flagged in real-time using an adaptive score that combines reconstruction error, latent space drift, and discriminator confidence.
arXiv Detail & Related papers (2025-09-30T00:16:35Z) - Adaptive Learning for IRS-Assisted Wireless Networks: Securing Opportunistic Communications Against Byzantine Eavesdroppers [7.256056777973974]
We propose a joint learning framework for Byzantine-resilient spectrum sensing and secure intelligent reflecting surface (IRS)<n>We develop an augmented-Lagrangian alternating algorithm with projected updates and provide provable sublinear convergence, with accelerated rates under mild local curvature.<n> Simulations across diverse network conditions show higher detection probability at fixed false-alarm rate under adversarial attacks, large reductions in sum MSE for honest users, strong suppression of eavesdropper signal power, and fast convergence.
arXiv Detail & Related papers (2025-08-11T17:28:25Z) - A Hybrid Artificial Intelligence Method for Estimating Flicker in Power Systems [42.76841620787673]
This paper introduces a novel hybrid AI method combining H filtering and an adaptive linear neuron network for flicker component estimation in power distribution systems.<n>The proposed method leverages the robustness of the H filter to extract the voltage envelope under uncertain and noisy conditions followed by the use of ADALINE to accurately identify flicker frequencies embedded in the envelope.
arXiv Detail & Related papers (2025-06-16T15:38:39Z) - PCA-Featured Transformer for Jamming Detection in 5G UAV Networks [0.5999777817331317]
Unmanned Aerial Vehicles (UAVs) face significant security risks from jamming attacks, which can compromise network functionality.<n>Traditional detection methods often fall short when confronting AI-powered jamming that dynamically modifies its behavior.<n>We introduce a novel U-shaped transformer architecture to refine feature representations for improved wireless security.
arXiv Detail & Related papers (2024-12-19T16:13:04Z) - Calibrating Wireless Ray Tracing for Digital Twinning using Local Phase Error Estimates [40.04476706955071]
ray tracing (RT) is widely seen as an enabling technology for DTs of the radio access network (RAN) segment of next-generation wireless systems.
The effectiveness of RT hinges on the adaptation of the electromagnetic properties assumed by the RT to actual channel conditions.
This paper proposes a novel channel response-based scheme that estimates and compensates for the phase errors in the channel responses.
arXiv Detail & Related papers (2023-12-19T22:01:59Z) - Forecasting Particle Accelerator Interruptions Using Logistic LASSO
Regression [62.997667081978825]
Unforeseen particle accelerator interruptions, also known as interlocks, lead to abrupt operational changes despite being necessary safety measures.
We propose a simple yet powerful binary classification model aiming to forecast such interruptions.
The model is formulated as logistic regression penalized by at least absolute shrinkage and selection operator.
arXiv Detail & Related papers (2023-03-15T23:11:30Z) - Time-to-Green predictions for fully-actuated signal control systems with
supervised learning [56.66331540599836]
This paper proposes a time series prediction framework using aggregated traffic signal and loop detector data.
We utilize state-of-the-art machine learning models to predict future signal phases' duration.
Results based on an empirical data set from a fully-actuated signal control system in Zurich, Switzerland, show that machine learning models outperform conventional prediction methods.
arXiv Detail & Related papers (2022-08-24T07:50:43Z) - Model-based Deep Learning Receiver Design for Rate-Splitting Multiple
Access [65.21117658030235]
This work proposes a novel design for a practical RSMA receiver based on model-based deep learning (MBDL) methods.
The MBDL receiver is evaluated in terms of uncoded Symbol Error Rate (SER), throughput performance through Link-Level Simulations (LLS) and average training overhead.
Results reveal that the MBDL outperforms by a significant margin the SIC receiver with imperfect CSIR.
arXiv Detail & Related papers (2022-05-02T12:23:55Z) - StableEmit: Selection Probability Discount for Reducing Emission Latency
of Streaming Monotonic Attention ASR [46.69852287267763]
We propose a simple alignment-free regularization method, StableEmit, to encourage MoChA to emit tokens earlier.
We show that StableEmit significantly reduces the recognition errors and the emission latency simultaneously.
arXiv Detail & Related papers (2021-07-01T17:49:31Z) - Interference Distribution Prediction for Link Adaptation in
Ultra-Reliable Low-Latency Communications [71.0558149440701]
Link adaptation (LA) is considered to be one of the bottlenecks to realize URLLC.
In this paper, we focus on predicting the signal to interference plus noise ratio at the user to enhance the LA.
We show that exploiting time correlation of the interference is an important enabler of URLLC.
arXiv Detail & Related papers (2020-07-01T07:59:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.