Deep Learning Algorithms for Rotating Machinery Intelligent Diagnosis:
An Open Source Benchmark Study
- URL: http://arxiv.org/abs/2003.03315v3
- Date: Wed, 19 Aug 2020 13:31:16 GMT
- Title: Deep Learning Algorithms for Rotating Machinery Intelligent Diagnosis:
An Open Source Benchmark Study
- Authors: Zhibin Zhao, Tianfu Li, Jingyao Wu, Chuang Sun, Shibin Wang, Ruqiang
Yan, Xuefeng Chen
- Abstract summary: This paper provides a benchmark study of deep learning algorithms for rotating machinery intelligent diagnosis.
We integrate the whole evaluation codes into a code library and release this code library to the public for better development of this field.
By these works, we release a unified code framework for comparing and testing models fairly and quickly, emphasize the importance of open source codes, provide the baseline accuracy (a lower bound) to avoid useless improvement, and discuss potential future directions in this field.
- Score: 0.8497188292342053
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the development of deep learning (DL) techniques, rotating machinery
intelligent diagnosis has gone through tremendous progress with verified
success and the classification accuracies of many DL-based intelligent
diagnosis algorithms are tending to 100\%. However, different datasets,
configurations, and hyper-parameters are often recommended to be used in
performance verification for different types of models, and few open source
codes are made public for evaluation and comparisons. Therefore, unfair
comparisons and ineffective improvement may exist in rotating machinery
intelligent diagnosis, which limits the advancement of this field. To address
these issues, we perform an extensive evaluation of four kinds of models,
including multi-layer perception (MLP), auto-encoder (AE), convolutional neural
network (CNN), and recurrent neural network (RNN), with various datasets to
provide a benchmark study within the same framework. We first gather most of
the publicly available datasets and give the complete benchmark study of
DL-based intelligent algorithms under two data split strategies, five input
formats, three normalization methods, and four augmentation methods. Second, we
integrate the whole evaluation codes into a code library and release this code
library to the public for better development of this field. Third, we use
specific-designed cases to point out the existing issues, including class
imbalance, generalization ability, interpretability, few-shot learning, and
model selection. By these works, we release a unified code framework for
comparing and testing models fairly and quickly, emphasize the importance of
open source codes, provide the baseline accuracy (a lower bound) to avoid
useless improvement, and discuss potential future directions in this field. The
code library is available at
https://github.com/ZhaoZhibin/DL-based-Intelligent-Diagnosis-Benchmark.
Related papers
- Binary Code Similarity Detection via Graph Contrastive Learning on Intermediate Representations [52.34030226129628]
Binary Code Similarity Detection (BCSD) plays a crucial role in numerous fields, including vulnerability detection, malware analysis, and code reuse identification.
In this paper, we propose IRBinDiff, which mitigates compilation differences by leveraging LLVM-IR with higher-level semantic abstraction.
Our extensive experiments, conducted under varied compilation settings, demonstrate that IRBinDiff outperforms other leading BCSD methods in both One-to-one comparison and One-to-many search scenarios.
arXiv Detail & Related papers (2024-10-24T09:09:20Z) - RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation [54.707460684650584]
Large Language Models (LLMs) demonstrate human-level capabilities in dialogue, reasoning, and knowledge retention.
Current research addresses this bottleneck by equipping LLMs with external knowledge, a technique known as Retrieval Augmented Generation (RAG)
RAGLAB is a modular and research-oriented open-source library that reproduces 6 existing algorithms and provides a comprehensive ecosystem for investigating RAG algorithms.
arXiv Detail & Related papers (2024-08-21T07:20:48Z) - Towards Improved Imbalance Robustness in Continual Multi-Label Learning
with Dual Output Spiking Architecture (DOSA) [3.7039357017353214]
This work proposes a dual output spiking architecture (DOSA) for continual multi-label learning.
A novel imbalance-aware loss function is also proposed, improving the multi-label classification performance of the model.
arXiv Detail & Related papers (2024-02-07T05:38:53Z) - Improving and Benchmarking Offline Reinforcement Learning Algorithms [87.67996706673674]
This work aims to bridge the gaps caused by low-level choices and datasets.
We empirically investigate 20 implementation choices using three representative algorithms.
We find two variants CRR+ and CQL+ achieving new state-of-the-art on D4RL.
arXiv Detail & Related papers (2023-06-01T17:58:46Z) - The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder
Models for More Efficient Code Classification [7.205265729540538]
Training deep NLP models requires significant computational resources.
We propose a generic approach, EarlyBIRD, to build composite representations of code from the early layers of a pre-trained transformer model.
arXiv Detail & Related papers (2023-05-08T16:47:28Z) - Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning
Approach for Dynamic Treatment Regimes [37.06048335758881]
We present a novel approach, a distributed Q-learning algorithm, for generating dynamic treatment regimes.
We show that our algorithm outperforms both traditional linear Q-learning and commonly used deep Q-learning in terms of both prediction accuracy and computation cost.
arXiv Detail & Related papers (2023-02-21T04:15:34Z) - Towards Better Out-of-Distribution Generalization of Neural Algorithmic
Reasoning Tasks [51.8723187709964]
We study the OOD generalization of neural algorithmic reasoning tasks.
The goal is to learn an algorithm from input-output pairs using deep neural networks.
arXiv Detail & Related papers (2022-11-01T18:33:20Z) - A Meta-embedding-based Ensemble Approach for ICD Coding Prediction [64.42386426730695]
International Classification of Diseases (ICD) are the de facto codes used globally for clinical coding.
These codes enable healthcare providers to claim reimbursement and facilitate efficient storage and retrieval of diagnostic information.
Our proposed approach enhances the performance of neural models by effectively training word vectors using routine medical data as well as external knowledge from scientific articles.
arXiv Detail & Related papers (2021-02-26T17:49:58Z) - Few-Shot Named Entity Recognition: A Comprehensive Study [92.40991050806544]
We investigate three schemes to improve the model generalization ability for few-shot settings.
We perform empirical comparisons on 10 public NER datasets with various proportions of labeled data.
We create new state-of-the-art results on both few-shot and training-free settings.
arXiv Detail & Related papers (2020-12-29T23:43:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.