MMA-RNN: A Multi-level Multi-task Attention-based Recurrent Neural
Network for Discrimination and Localization of Atrial Fibrillation
- URL: http://arxiv.org/abs/2302.03731v2
- Date: Thu, 9 Feb 2023 01:29:04 GMT
- Title: MMA-RNN: A Multi-level Multi-task Attention-based Recurrent Neural
Network for Discrimination and Localization of Atrial Fibrillation
- Authors: Yifan Sun, Jingyan Shen, Yunfan Jiang, Zhaohui Huang, Minsheng Hao,
Xuegong Zhang
- Abstract summary: This paper proposes the Multi-level Multi-task Attention-based Recurrent Neural Network for three-class discrimination on patients and localization of the exact timing of AF episodes.
The model is designed as an end-to-end framework to enhance information interaction and reduce error accumulation.
- Score: 1.8037893225125925
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The automatic detection of atrial fibrillation based on electrocardiograph
(ECG) signals has received wide attention both clinically and practically. It
is challenging to process ECG signals with cyclical pattern, varying length and
unstable quality due to noise and distortion. Besides, there has been
insufficient research on separating persistent atrial fibrillation from
paroxysmal atrial fibrillation, and little discussion on locating the onsets
and end points of AF episodes. It is even more arduous to perform well on these
two distinct but interrelated tasks, while avoiding the mistakes inherent from
stage-by-stage approaches. This paper proposes the Multi-level Multi-task
Attention-based Recurrent Neural Network for three-class discrimination on
patients and localization of the exact timing of AF episodes. Our model
captures three-level sequential features based on a hierarchical architecture
utilizing Bidirectional Long and Short-Term Memory Network (Bi-LSTM) and
attention layers, and accomplishes the two tasks simultaneously with a
multi-head classifier. The model is designed as an end-to-end framework to
enhance information interaction and reduce error accumulation. Finally, we
conduct experiments on CPSC 2021 dataset and the result demonstrates the
superior performance of our method, indicating the potential application of
MMA-RNN to wearable mobile devices for routine AF monitoring and early
diagnosis.
Related papers
- FDP: A Frequency-Decomposition Preprocessing Pipeline for Unsupervised Anomaly Detection in Brain MRI [44.4791295950757]
We develop an unsupervised anomaly detection (UAD) approach for brain MRI.<n>We conduct the first systematic frequency-domain analysis of pathological signatures.<n>We show that Frequency-Decomposition Preprocessing (FDP) framework can leverage frequency-domain reconstruction for simultaneous pathology suppression and anatomical preservation.
arXiv Detail & Related papers (2025-11-17T02:40:14Z) - Motion-Robust Multimodal Fusion of PPG and Accelerometer Signals for Three-Class Heart Rhythm Classification [5.573626931805866]
Atrial fibrillation (AF) is a leading cause of stroke and mortality, particularly in elderly patients.<n>We introduce RhythmiNet, a residual neural network enhanced with temporal and channel attention modules.<n>The model performs three-class rhythm classification: AF, sinus rhythm (SR) and Other.
arXiv Detail & Related papers (2025-11-02T14:21:55Z) - QualityFM: a Multimodal Physiological Signal Foundation Model with Self-Distillation for Signal Quality Challenges in Critically Ill Patients [3.744688073010934]
Photoplethysmogram ( PPG) and electrocardiogram (ECG) are commonly recorded intesive care unit (ICU) and operating room (OR)<n>The methods explored so far suffer from limited generalizability, reliance on extensive labeled data, and poor cross-task transferability.<n>We introduce QualityFM, a novel multimodal foundation model for these physiological signals, designed to acquire a general-purpose understanding of signal quality.
arXiv Detail & Related papers (2025-09-08T10:20:56Z) - Masked Autoencoders that Feel the Heart: Unveiling Simplicity Bias for ECG Analyses [24.039917512972977]
We show that ECG models tend to overfit dominant and repetitive patterns, overlooking fine-grained but clinically critical cues, known as Simplicity Bias (SB)<n>We propose a novel method comprising two key components: 1) Temporal-frequency aware Filters to capture temporal-frequency features reflecting the dynamic characteristics of ECG signals, and 2) building on this, Multi-Grained Prototype Reconstruction for coarse and fine representation learning across dual domains, further mitigating SB.
arXiv Detail & Related papers (2025-06-25T03:25:49Z) - Segmenting Bi-Atrial Structures Using ResNext Based Framework [3.0838948803252904]
We propose TASSNet, a novel two-stage deep learning framework for fully automated segmentation of both left atrium (LA) and right atrium (RA)<n> TASSNet introduces two main innovations: (i) a ResNeXt-based encoder to enhance feature extraction from limited medical datasets, and (ii) a cyclical learning rate schedule to address convergence instability in highly imbalanced, small-batch 3D segmentation tasks.
arXiv Detail & Related papers (2025-02-28T10:23:12Z) - CTPD: Cross-Modal Temporal Pattern Discovery for Enhanced Multimodal Electronic Health Records Analysis [50.56875995511431]
We introduce a Cross-Modal Temporal Pattern Discovery (CTPD) framework, designed to efficiently extract meaningful cross-modal temporal patterns from multimodal EHR data.<n>Our approach introduces shared initial temporal pattern representations which are refined using slot attention to generate temporal semantic embeddings.
arXiv Detail & Related papers (2024-11-01T15:54:07Z) - Classification of Heart Sounds Using Multi-Branch Deep Convolutional Network and LSTM-CNN [2.7699831151653305]
This study develops and evaluates novel deep learning architectures that offer fast, accurate, and cost-effective methods for automatic diagnosis of cardiac diseases.<n>We propose two innovative methodologies: first, a Multi-Branch Deep Convolutional Neural Network (MBDCN) that emulates human auditory processing by utilizing diverse convolutional filter sizes and power spectrum input for enhanced feature extraction.<n>Second, a Long Short-Term Memory-Convolutional Neural (LSCN) model that integrates LSTM blocks with MBDCN to improve time-domain feature extraction.
arXiv Detail & Related papers (2024-07-15T13:02:54Z) - Temporal Cross-Attention for Dynamic Embedding and Tokenization of Multimodal Electronic Health Records [1.6609516435725236]
We introduce a dynamic embedding and tokenization framework for precise representation of multimodal clinical time series.
Our framework outperformed baseline approaches on the task of predicting the occurrence of nine postoperative complications.
arXiv Detail & Related papers (2024-03-06T19:46:44Z) - SiamAF: Learning Shared Information from ECG and PPG Signals for Robust
Atrial Fibrillation Detection [18.014439380551824]
Atrial fibrillation (AF) is the most common type of cardiac arrhythmia.
It is associated with an increased risk of stroke, heart failure, and other cardiovascular complications, but can be clinically silent.
Current deep learning models learn from a single modality, either electrocardiogram (ECG) or photoplethysmography ( PPG) signals.
We propose a new approach, SiamAF, leveraging a novel Siamese network architecture and joint learning loss function to learn from both ECG and PPG signals.
arXiv Detail & Related papers (2023-10-13T15:48:24Z) - DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial
Attention Detection [49.196182908826565]
Auditory Attention Detection (AAD) aims to detect target speaker from brain signals in a multi-speaker environment.
Current approaches primarily rely on traditional convolutional neural network designed for processing Euclidean data like images.
This paper proposes a dynamical graph self-distillation (DGSD) approach for AAD, which does not require speech stimuli as input.
arXiv Detail & Related papers (2023-09-07T13:43:46Z) - TC-LIF: A Two-Compartment Spiking Neuron Model for Long-Term Sequential
Modelling [54.97005925277638]
The identification of sensory cues associated with potential opportunities and dangers is frequently complicated by unrelated events that separate useful cues by long delays.
It remains a challenging task for state-of-the-art spiking neural networks (SNNs) to establish long-term temporal dependency between distant cues.
We propose a novel biologically inspired Two-Compartment Leaky Integrate-and-Fire spiking neuron model, dubbed TC-LIF.
arXiv Detail & Related papers (2023-08-25T08:54:41Z) - MVMTnet: A Multi-variate Multi-modal Transformer for Multi-class
Classification of Cardiac Irregularities Using ECG Waveforms and Clinical
Notes [4.648677931378919]
Deep learning can be used to optimize diagnosis and patient monitoring for clinical-based applications.
For cardiovascular disease, one such condition where the rising number of patients increasingly outweighs the availability of medical resources in different parts of the world, a core challenge is the automated classification of various cardiac abnormalities.
The proposed novel multi-modal Transformer architecture would be able to accurately perform this task while demonstrating the cross-domain effectiveness of Transformers.
arXiv Detail & Related papers (2023-02-21T21:38:41Z) - Hybrid Reinforced Medical Report Generation with M-Linear Attention and
Repetition Penalty [45.92216112110279]
We propose a hybrid reinforced medical report generation method with m-linear attention and repetition penalty mechanism.
Specifically, a hybrid reward with different weights is employed to remedy the limitations of single-metric-based rewards.
We also propose a search algorithm with linear complexity to approximate the best weight combination.
arXiv Detail & Related papers (2022-10-14T15:27:34Z) - Fuzzy Attention Neural Network to Tackle Discontinuity in Airway
Segmentation [67.19443246236048]
Airway segmentation is crucial for the examination, diagnosis, and prognosis of lung diseases.
Some small-sized airway branches (e.g., bronchus and terminaloles) significantly aggravate the difficulty of automatic segmentation.
This paper presents an efficient method for airway segmentation, comprising a novel fuzzy attention neural network and a comprehensive loss function.
arXiv Detail & Related papers (2022-09-05T16:38:13Z) - Superresolution and Segmentation of OCT scans using Multi-Stage
adversarial Guided Attention Training [18.056525121226862]
We propose the multi-stage & multi-discriminatory generative adversarial network (MultiSDGAN) to translate OCT scans in high-resolution segmentation labels.
We evaluate and compare various combinations of channel and spatial attention to the MultiSDGAN architecture to extract more powerful feature maps.
Our results demonstrate relative improvements of 21.44% and 19.45% on the Dice coefficient and SSIM, respectively.
arXiv Detail & Related papers (2022-06-10T00:26:55Z) - Real-time landmark detection for precise endoscopic submucosal
dissection via shape-aware relation network [51.44506007844284]
We propose a shape-aware relation network for accurate and real-time landmark detection in endoscopic submucosal dissection surgery.
We first devise an algorithm to automatically generate relation keypoint heatmaps, which intuitively represent the prior knowledge of spatial relations among landmarks.
We then develop two complementary regularization schemes to progressively incorporate the prior knowledge into the training process.
arXiv Detail & Related papers (2021-11-08T07:57:30Z) - Multiple Sclerosis Lesion Activity Segmentation with Attention-Guided
Two-Path CNNs [49.32653090178743]
convolutional neural networks (CNNs) are studied for lesion activity segmentation from two time points.
CNNs are designed and evaluated that combine the information from two points in different ways.
It is demonstrated that deep learning-based methods outperform classic approaches.
arXiv Detail & Related papers (2020-08-05T08:49:20Z) - AutoHR: A Strong End-to-end Baseline for Remote Heart Rate Measurement
with Neural Searching [76.4844593082362]
We investigate the reason why existing end-to-end networks perform poorly in challenging conditions and establish a strong baseline for remote HR measurement with architecture search (NAS)
Comprehensive experiments are performed on three benchmark datasets on both intra-temporal and cross-dataset testing.
arXiv Detail & Related papers (2020-04-26T05:43:21Z) - TeCNO: Surgical Phase Recognition with Multi-Stage Temporal
Convolutional Networks [43.95869213955351]
We propose a Multi-Stage Temporal Convolutional Network (MS-TCN) that performs hierarchical prediction refinement for surgical phase recognition.
Our method is thoroughly evaluated on two datasets of laparoscopic cholecystectomy videos with and without the use of additional surgical tool information.
arXiv Detail & Related papers (2020-03-24T10:12:30Z) - ADRN: Attention-based Deep Residual Network for Hyperspectral Image
Denoising [52.01041506447195]
We propose an attention-based deep residual network to learn a mapping from noisy HSI to the clean one.
Experimental results demonstrate that our proposed ADRN scheme outperforms the state-of-the-art methods both in quantitative and visual evaluations.
arXiv Detail & Related papers (2020-03-04T08:36:27Z) - HAN-ECG: An Interpretable Atrial Fibrillation Detection Model Using
Hierarchical Attention Networks [15.96723122978118]
Atrial fibrillation (AF) is one of the most prevalent cardiac arrhythmias that affects the lives of more than 3 million people in the U.S. and over 33 million people around the world.
We propose HAN-ECG, an interpretable bidirectional-recurrent-neural-network-based approach for the AF detection task.
arXiv Detail & Related papers (2020-02-12T22:23:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.