MSW-Transformer: Multi-Scale Shifted Windows Transformer Networks for
12-Lead ECG Classification
- URL: http://arxiv.org/abs/2306.12098v1
- Date: Wed, 21 Jun 2023 08:27:26 GMT
- Title: MSW-Transformer: Multi-Scale Shifted Windows Transformer Networks for
12-Lead ECG Classification
- Authors: Renjie Cheng, Zhemin Zhuang, Shuxin Zhuang, Lei Xie and Jingfeng Guo
- Abstract summary: We propose a single-layer Transformer network that uses a multi-window sliding attention mechanism at different scales to capture features in different dimensions.
A learnable feature fusion method is then proposed to integrate features from different windows to further enhance model performance.
The proposed model achieves state-of-the-art performance on five classification tasks of the PTBXL-2020 12-lead ECG dataset.
- Score: 6.353064734475176
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automatic classification of electrocardiogram (ECG) signals plays a crucial
role in the early prevention and diagnosis of cardiovascular diseases. While
ECG signals can be used for the diagnosis of various diseases, their
pathological characteristics exhibit minimal variations, posing a challenge to
automatic classification models. Existing methods primarily utilize
convolutional neural networks to extract ECG signal features for
classification, which may not fully capture the pathological feature
differences of different diseases. Transformer networks have advantages in
feature extraction for sequence data, but the complete network is complex and
relies on large-scale datasets. To address these challenges, we propose a
single-layer Transformer network called Multi-Scale Shifted Windows Transformer
Networks (MSW-Transformer), which uses a multi-window sliding attention
mechanism at different scales to capture features in different dimensions. The
self-attention is restricted to non-overlapping local windows via shifted
windows, and different window scales have different receptive fields. A
learnable feature fusion method is then proposed to integrate features from
different windows to further enhance model performance. Furthermore, we
visualize the attention mechanism of the multi-window shifted mechanism to
achieve better clinical interpretation in the ECG classification task. The
proposed model achieves state-of-the-art performance on five classification
tasks of the PTBXL-2020 12-lead ECG dataset, which includes 5 diagnostic
superclasses, 23 diagnostic subclasses, 12 rhythm classes, 17 morphology
classes, and 44 diagnosis classes, with average macro-F1 scores of 77.85%,
47.57%, 66.13%, 34.60%, and 34.29%, and average sample-F1 scores of 81.26%,
68.27%, 91.32%, 50.07%, and 63.19%, respectively.
Related papers
- Prototype Learning Guided Hybrid Network for Breast Tumor Segmentation in DCE-MRI [58.809276442508256]
We propose a hybrid network via the combination of convolution neural network (CNN) and transformer layers.
The experimental results on private and public DCE-MRI datasets demonstrate that the proposed hybrid network superior performance than the state-of-the-art methods.
arXiv Detail & Related papers (2024-08-11T15:46:00Z) - Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection [76.11864242047074]
We propose a novel Affine-Consistent Transformer (AC-Former), which directly yields a sequence of nucleus positions.
We introduce an Adaptive Affine Transformer (AAT) module, which can automatically learn the key spatial transformations to warp original images for local network training.
Experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art algorithms on various benchmarks.
arXiv Detail & Related papers (2023-10-22T02:27:02Z) - Breast Ultrasound Tumor Classification Using a Hybrid Multitask
CNN-Transformer Network [63.845552349914186]
Capturing global contextual information plays a critical role in breast ultrasound (BUS) image classification.
Vision Transformers have an improved capability of capturing global contextual information but may distort the local image patterns due to the tokenization operations.
In this study, we proposed a hybrid multitask deep neural network called Hybrid-MT-ESTAN, designed to perform BUS tumor classification and segmentation.
arXiv Detail & Related papers (2023-08-04T01:19:32Z) - HiFuse: Hierarchical Multi-Scale Feature Fusion Network for Medical
Image Classification [16.455887856811465]
This paper proposes a three-branch hierarchical multi-scale feature fusion network structure termed as HiFuse for medical image classification.
The accuracy of our proposed model on the ISIC dataset is 7.6% higher than baseline, 21.5% on the Covid-19 dataset, and 10.4% on the Kvasir dataset.
arXiv Detail & Related papers (2022-09-21T09:30:20Z) - Multiple Time Series Fusion Based on LSTM An Application to CAP A Phase
Classification Using EEG [56.155331323304]
Deep learning based electroencephalogram channels' feature level fusion is carried out in this work.
Channel selection, fusion, and classification procedures were optimized by two optimization algorithms.
arXiv Detail & Related papers (2021-12-18T14:17:49Z) - SE-ECGNet: A Multi-scale Deep Residual Network with
Squeeze-and-Excitation Module for ECG Signal Classification [6.124438924401066]
We develop a multi-scale deep residual network for the ECG signal classification task.
We are the first to propose to treat the multi-lead signal as a 2-dimensional matrix.
Our proposed model achieves 99.2% F1-score in the MIT-BIH dataset and 89.4% F1-score in Alibaba dataset.
arXiv Detail & Related papers (2020-12-10T08:37:44Z) - Multilabel 12-Lead Electrocardiogram Classification Using Gradient
Boosting Tree Ensemble [64.29529357862955]
We build an algorithm using gradient boosted tree ensembles fitted on morphology and signal processing features to classify ECG diagnosis.
For each lead, we derive features from heart rate variability, PQRST template shape, and the full signal waveform.
We join the features of all 12 leads to fit an ensemble of gradient boosting decision trees to predict probabilities of ECG instances belonging to each class.
arXiv Detail & Related papers (2020-10-21T18:11:36Z) - Combining Scatter Transform and Deep Neural Networks for Multilabel
Electrocardiogram Signal Classification [0.6117371161379209]
We incorporate a variant of the complex wavelet transform, called a scatter transform, in a deep residual neural network (ResNet)
Our approach achieved a challenge validation score of 0.640, and full test score of 0.485, placing us 4th out of 41 in the official ranking.
arXiv Detail & Related papers (2020-10-15T10:13:31Z) - ECG Classification with a Convolutional Recurrent Neural Network [0.13903116275861838]
We developed a convolutional recurrent network to classify 12-lead ECG signals for the challenge of PhysioNet/ Computing in Cardiology 2020 as team Pink Irish Hat.
The model combines convolutional and recurrent layers, takes sliding windows of ECG signals as input and yields the probability of each class as output.
Our network achieved a challenge score of 0.511 on the hidden validation set and 0.167 on the full hidden test set, ranking us 23rd out of 41 in the official ranking.
arXiv Detail & Related papers (2020-09-28T13:41:59Z) - Multi-Task Neural Networks with Spatial Activation for Retinal Vessel
Segmentation and Artery/Vein Classification [49.64863177155927]
We propose a multi-task deep neural network with spatial activation mechanism to segment full retinal vessel, artery and vein simultaneously.
The proposed network achieves pixel-wise accuracy of 95.70% for vessel segmentation, and A/V classification accuracy of 94.50%, which is the state-of-the-art performance for both tasks.
arXiv Detail & Related papers (2020-07-18T05:46:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.