Transformer with Selective Shuffled Position Embedding and Key-Patch
Exchange Strategy for Early Detection of Knee Osteoarthritis
- URL: http://arxiv.org/abs/2304.08364v2
- Date: Fri, 30 Jun 2023 21:12:04 GMT
- Title: Transformer with Selective Shuffled Position Embedding and Key-Patch
Exchange Strategy for Early Detection of Knee Osteoarthritis
- Authors: Zhe Wang and Aladine Chetouani and Mohamed Jarraya and Didier Hans and
Rachid Jennane
- Abstract summary: Knee OsteoArthritis (KOA) is a widespread musculoskeletal disorder that can severely impact the mobility of older individuals.
Insufficient medical data presents a significant obstacle for effectively training models due to the high cost associated with data labelling.
We propose a novel approach based on the Vision Transformer (ViT) model with original Selective Shuffled Position Embedding (SSPE) and key-patch exchange strategies.
- Score: 7.656764569447645
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Knee OsteoArthritis (KOA) is a widespread musculoskeletal disorder that can
severely impact the mobility of older individuals. Insufficient medical data
presents a significant obstacle for effectively training models due to the high
cost associated with data labelling. Currently, deep learning-based models
extensively utilize data augmentation techniques to improve their
generalization ability and alleviate overfitting. However, conventional data
augmentation techniques are primarily based on the original data and fail to
introduce substantial diversity to the dataset. In this paper, we propose a
novel approach based on the Vision Transformer (ViT) model with original
Selective Shuffled Position Embedding (SSPE) and key-patch exchange strategies
to obtain different input sequences as a method of data augmentation for early
detection of KOA (KL-0 vs KL-2). More specifically, we fix and shuffle the
position embedding of key and non-key patches, respectively. Then, for the
target image, we randomly select other candidate images from the training set
to exchange their key patches and thus obtain different input sequences.
Finally, a hybrid loss function is developed by incorporating multiple loss
functions for different types of the sequences. According to the experimental
results, the generated data are considered valid as they lead to a notable
improvement in the model's classification performance.
Related papers
- Depression detection in social media posts using transformer-based models and auxiliary features [6.390468088226495]
Detection of depression in social media posts is crucial due to the increasing prevalence of mental health issues.
Traditional machine learning algorithms often fail to capture intricate textual patterns, limiting their effectiveness in identifying depression.
This research proposes a neural network architecture leveraging transformer-based models combined with metadata and linguistic markers.
arXiv Detail & Related papers (2024-09-30T07:53:39Z) - Model Inversion Attacks Through Target-Specific Conditional Diffusion Models [54.69008212790426]
Model inversion attacks (MIAs) aim to reconstruct private images from a target classifier's training set, thereby raising privacy concerns in AI applications.
Previous GAN-based MIAs tend to suffer from inferior generative fidelity due to GAN's inherent flaws and biased optimization within latent space.
We propose Diffusion-based Model Inversion (Diff-MI) attacks to alleviate these issues.
arXiv Detail & Related papers (2024-07-16T06:38:49Z) - Exploring the Efficacy of Base Data Augmentation Methods in Deep
Learning-Based Radiograph Classification of Knee Joint Osteoarthritis [0.12289361708127876]
Diagnosing knee joint osteoarthritis (KOA) is challenging due to subtle radiographic indicators and the varied progression of the disease.
This study explored various data augmentation methods, including adversarial augmentations, and their impact on KOA classification model performance.
arXiv Detail & Related papers (2023-11-10T15:35:00Z) - Adaptive Variance Thresholding: A Novel Approach to Improve Existing
Deep Transfer Vision Models and Advance Automatic Knee-Joint Osteoarthritis
Classification [0.11249583407496219]
Knee-Joint Osteoarthritis (KOA) is a prevalent cause of global disability and inherently complex to diagnose.
One promising classification avenue involves applying deep learning methods.
This study proposes a novel paradigm for improving post-training specialized classifiers.
arXiv Detail & Related papers (2023-11-10T00:17:07Z) - 1D-Convolutional transformer for Parkinson disease diagnosis from gait [7.213855322671065]
This paper presents an efficient deep neural network model for diagnosing Parkinson's disease from gait.
We introduce a hybrid ConvNetTransform-er architecture to accurately diagnose the disease by detecting the severity stage.
Our experimental results show that our approach is effective for detecting the different stages of Parkinson's disease from gait data.
arXiv Detail & Related papers (2023-11-06T15:17:17Z) - The effect of data augmentation and 3D-CNN depth on Alzheimer's Disease
detection [51.697248252191265]
This work summarizes and strictly observes best practices regarding data handling, experimental design, and model evaluation.
We focus on Alzheimer's Disease (AD) detection, which serves as a paradigmatic example of challenging problem in healthcare.
Within this framework, we train predictive 15 models, considering three different data augmentation strategies and five distinct 3D CNN architectures.
arXiv Detail & Related papers (2023-09-13T10:40:41Z) - Automatic diagnosis of knee osteoarthritis severity using Swin
transformer [55.01037422579516]
Knee osteoarthritis (KOA) is a widespread condition that can cause chronic pain and stiffness in the knee joint.
We propose an automated approach that employs the Swin Transformer to predict the severity of KOA.
arXiv Detail & Related papers (2023-07-10T09:49:30Z) - Remote Sensing Change Detection With Transformers Trained from Scratch [62.96911491252686]
transformer-based change detection (CD) approaches either employ a pre-trained model trained on large-scale image classification ImageNet dataset or rely on first pre-training on another CD dataset and then fine-tuning on the target benchmark.
We develop an end-to-end CD approach with transformers that is trained from scratch and yet achieves state-of-the-art performance on four public benchmarks.
arXiv Detail & Related papers (2023-04-13T17:57:54Z) - TWINS: A Fine-Tuning Framework for Improved Transferability of
Adversarial Robustness and Generalization [89.54947228958494]
This paper focuses on the fine-tuning of an adversarially pre-trained model in various classification tasks.
We propose a novel statistics-based approach, Two-WIng NormliSation (TWINS) fine-tuning framework.
TWINS is shown to be effective on a wide range of image classification datasets in terms of both generalization and robustness.
arXiv Detail & Related papers (2023-03-20T14:12:55Z) - Key-Exchange Convolutional Auto-Encoder for Data Augmentation in Early
Knee OsteoArthritis Classification [9.400820679110147]
Knee OsteoArthritis (KOA) is a prevalent musculoskeletal condition that impairs the mobility of senior citizens.
We propose a learning model based on the convolutional Auto-Encoder and a hybrid loss strategy to generate new data for early KOA diagnosis.
arXiv Detail & Related papers (2023-02-26T15:45:19Z) - Cross-Site Severity Assessment of COVID-19 from CT Images via Domain
Adaptation [64.59521853145368]
Early and accurate severity assessment of Coronavirus disease 2019 (COVID-19) based on computed tomography (CT) images offers a great help to the estimation of intensive care unit event.
To augment the labeled data and improve the generalization ability of the classification model, it is necessary to aggregate data from multiple sites.
This task faces several challenges including class imbalance between mild and severe infections, domain distribution discrepancy between sites, and presence of heterogeneous features.
arXiv Detail & Related papers (2021-09-08T07:56:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.