Position and Orientation-Aware One-Shot Learning for Medical Action
Recognition from Signal Data
- URL: http://arxiv.org/abs/2309.15635v1
- Date: Wed, 27 Sep 2023 13:08:15 GMT
- Title: Position and Orientation-Aware One-Shot Learning for Medical Action
Recognition from Signal Data
- Authors: Leiyu Xie, Yuxing Yang, Zeyu Fu, Syed Mohsen Naqvi
- Abstract summary: We propose a position and orientation-aware one-shot learning framework for medical action recognition from signal data.
The proposed framework comprises two stages and each stage includes signal-level image generation ( SIG), cross-attention (CsA), dynamic time warping (DTW) modules.
- Score: 9.757753196253532
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In this work, we propose a position and orientation-aware one-shot learning
framework for medical action recognition from signal data. The proposed
framework comprises two stages and each stage includes signal-level image
generation (SIG), cross-attention (CsA), dynamic time warping (DTW) modules and
the information fusion between the proposed privacy-preserved position and
orientation features. The proposed SIG method aims to transform the raw
skeleton data into privacy-preserved features for training. The CsA module is
developed to guide the network in reducing medical action recognition bias and
more focusing on important human body parts for each specific action, aimed at
addressing similar medical action related issues. Moreover, the DTW module is
employed to minimize temporal mismatching between instances and further improve
model performance. Furthermore, the proposed privacy-preserved
orientation-level features are utilized to assist the position-level features
in both of the two stages for enhancing medical action recognition performance.
Extensive experimental results on the widely-used and well-known NTU RGB+D 60,
NTU RGB+D 120, and PKU-MMD datasets all demonstrate the effectiveness of the
proposed method, which outperforms the other state-of-the-art methods with
general dataset partitioning by 2.7%, 6.2% and 4.1%, respectively.
Related papers
- Dual-Attention Frequency Fusion at Multi-Scale for Joint Segmentation and Deformable Medical Image Registration [2.6089354079273512]
We propose a multi-task learning framework based on dual attention frequency fusion (DAFF-Net)
DAFF-Net simultaneously achieves the segmentation masks and dense deformation fields in a single-step estimation.
Experiments on three public 3D brain magnetic resonance imaging (MRI) datasets demonstrate that the proposed DAFF-Net and its unsupervised variant outperform state-of-the-art registration methods.
arXiv Detail & Related papers (2024-09-29T11:11:04Z) - PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation [51.509573838103854]
We propose a semi-supervised learning framework, termed Progressive Mean Teachers (PMT), for medical image segmentation.
Our PMT generates high-fidelity pseudo labels by learning robust and diverse features in the training process.
Experimental results on two datasets with different modalities, i.e., CT and MRI, demonstrate that our method outperforms the state-of-the-art medical image segmentation approaches.
arXiv Detail & Related papers (2024-09-08T15:02:25Z) - SMART: Scene-motion-aware human action recognition framework for mental disorder group [16.60713558596286]
We propose to build a vision-based Human Action Recognition dataset including abnormal actions often occurring in the mental disorder group.
We then introduce a novel Scene-Motion-aware Action Recognition framework, named SMART, consisting of two technical modules.
The effectiveness of our proposed method has been validated on our self-collected HAR dataset (HAD), achieving 94.9% and 93.1% accuracy in un-seen subjects and scenes, and outperforming state-of-the-art approaches by 6.5% and 13.2%, respectively.
arXiv Detail & Related papers (2024-06-07T05:29:42Z) - A Mutual Inclusion Mechanism for Precise Boundary Segmentation in Medical Images [2.9137615132901704]
We present a novel deep learning-based approach, MIPC-Net, for precise boundary segmentation in medical images.
We introduce the MIPC module, which enhances the focus on channel information when extracting position features.
We also propose the GL-MIPC-Residue, a global residual connection that enhances the integration of the encoder and decoder.
arXiv Detail & Related papers (2024-04-12T02:14:35Z) - Dual-scale Enhanced and Cross-generative Consistency Learning for Semi-supervised Medical Image Segmentation [49.57907601086494]
Medical image segmentation plays a crucial role in computer-aided diagnosis.
We propose a novel Dual-scale Enhanced and Cross-generative consistency learning framework for semi-supervised medical image (DEC-Seg)
arXiv Detail & Related papers (2023-12-26T12:56:31Z) - Few-Shot Fine-Grained Action Recognition via Bidirectional Attention and
Contrastive Meta-Learning [51.03781020616402]
Fine-grained action recognition is attracting increasing attention due to the emerging demand of specific action understanding in real-world applications.
We propose a few-shot fine-grained action recognition problem, aiming to recognize novel fine-grained actions with only few samples given for each class.
Although progress has been made in coarse-grained actions, existing few-shot recognition methods encounter two issues handling fine-grained actions.
arXiv Detail & Related papers (2021-08-15T02:21:01Z) - Dual-Attention Enhanced BDense-UNet for Liver Lesion Segmentation [3.1667381240856987]
We propose a new segmentation network by integrating DenseUNet and bidirectional LSTM together with attention mechanism, termed as DA-BDense-UNet.
DenseUNet allows learning enough diverse features and enhancing the representative power of networks by regulating the information flow.
arXiv Detail & Related papers (2021-07-24T16:28:00Z) - Differentially private federated deep learning for multi-site medical
image segmentation [56.30543374146002]
Collaborative machine learning techniques such as federated learning (FL) enable the training of models on effectively larger datasets without data transfer.
Recent initiatives have demonstrated that segmentation models trained with FL can achieve performance similar to locally trained models.
However, FL is not a fully privacy-preserving technique and privacy-centred attacks can disclose confidential patient data.
arXiv Detail & Related papers (2021-07-06T12:57:32Z) - Cross-Modality Brain Tumor Segmentation via Bidirectional
Global-to-Local Unsupervised Domain Adaptation [61.01704175938995]
In this paper, we propose a novel Bidirectional Global-to-Local (BiGL) adaptation framework under a UDA scheme.
Specifically, a bidirectional image synthesis and segmentation module is proposed to segment the brain tumor.
The proposed method outperforms several state-of-the-art unsupervised domain adaptation methods by a large margin.
arXiv Detail & Related papers (2021-05-17T10:11:45Z) - Towards Cross-modality Medical Image Segmentation with Online Mutual
Knowledge Distillation [71.89867233426597]
In this paper, we aim to exploit the prior knowledge learned from one modality to improve the segmentation performance on another modality.
We propose a novel Mutual Knowledge Distillation scheme to thoroughly exploit the modality-shared knowledge.
Experimental results on the public multi-class cardiac segmentation data, i.e., MMWHS 2017, show that our method achieves large improvements on CT segmentation.
arXiv Detail & Related papers (2020-10-04T10:25:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.