Introducing Vision Transformer for Alzheimer's Disease classification
task with 3D input
- URL: http://arxiv.org/abs/2210.01177v1
- Date: Mon, 3 Oct 2022 18:48:22 GMT
- Title: Introducing Vision Transformer for Alzheimer's Disease classification
task with 3D input
- Authors: Zilun Zhang, Farzad Khalvati
- Abstract summary: Do Vision Transformer-based models perform better than CNN-based models?
Is it possible to use a shallow 3D CNN-based model to obtain satisfying results?
Our results indicate that the shallow 3D CNN-based models are sufficient to achieve good classification results for Alzheimer's Disease using MRI scans.
- Score: 1.0152838128195467
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Many high-performance classification models utilize complex CNN-based
architectures for Alzheimer's Disease classification. We aim to investigate two
relevant questions regarding classification of Alzheimer's Disease using MRI:
"Do Vision Transformer-based models perform better than CNN-based models?" and
"Is it possible to use a shallow 3D CNN-based model to obtain satisfying
results?" To achieve these goals, we propose two models that can take in and
process 3D MRI scans: Convolutional Voxel Vision Transformer (CVVT)
architecture, and ConvNet3D-4, a shallow 4-block 3D CNN-based model. Our
results indicate that the shallow 3D CNN-based models are sufficient to achieve
good classification results for Alzheimer's Disease using MRI scans.
Related papers
- Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D MRI Scans [0.6827423171182153]
Classifying 3D MRI images for early detection of Alzheimer's disease is a critical task in medical imaging.
Traditional approaches using Convolutional Neural Networks (CNNs) and Transformers face significant challenges in this domain.
We propose the use of Vision Mamba, an advanced model based on State Space Models (SSMs) for the classification of 3D MRI images to detect Alzheimer's disease.
arXiv Detail & Related papers (2024-06-09T12:23:22Z) - The effect of data augmentation and 3D-CNN depth on Alzheimer's Disease
detection [51.697248252191265]
This work summarizes and strictly observes best practices regarding data handling, experimental design, and model evaluation.
We focus on Alzheimer's Disease (AD) detection, which serves as a paradigmatic example of challenging problem in healthcare.
Within this framework, we train predictive 15 models, considering three different data augmentation strategies and five distinct 3D CNN architectures.
arXiv Detail & Related papers (2023-09-13T10:40:41Z) - Video4MRI: An Empirical Study on Brain Magnetic Resonance Image
Analytics with CNN-based Video Classification Frameworks [60.42012344842292]
3D CNN-based models dominate the field of magnetic resonance image (MRI) analytics.
In this paper, four datasets of Alzheimer's and Parkinson's disease recognition are utilized in experiments.
In terms of efficiency, the video framework performs better than 3D-CNN models by 5% - 11% with 50% - 66% less trainable parameters.
arXiv Detail & Related papers (2023-02-24T15:26:31Z) - Highly Accurate FMRI ADHD Classification using time distributed multi
modal 3D CNNs [0.0]
This work proposes an algorithm for fMRI data analysis for the classification of ADHD disorders.
By leveraging a 3D-GAN it would be possible to use deepfake data to enhance the accuracy of 3D CNN classification of brain disorders.
arXiv Detail & Related papers (2022-05-24T11:39:11Z) - Combining 3D Image and Tabular Data via the Dynamic Affine Feature Map
Transform [3.5235974685889397]
We introduce the Dynamic Affine Feature Map Transform (DAFT), a general-purpose module for CNNs that dynamically rescales and shifts the feature maps of a convolutional layer, conditional on a patient's clinical information.
We show that DAFT is highly effective in combining 3D image and tabular information for diagnosis and time-to-dementia prediction, where it outperforms competing CNNs with a mean balanced accuracy of 0.622 and mean c-index of 0.748, respectively.
arXiv Detail & Related papers (2021-07-13T11:18:22Z) - 3D Convolutional Neural Networks for Stalled Brain Capillary Detection [72.21315180830733]
Brain vasculature dysfunctions such as stalled blood flow in cerebral capillaries are associated with cognitive decline and pathogenesis in Alzheimer's disease.
Here, we describe a deep learning-based approach for automatic detection of stalled capillaries in brain images based on 3D convolutional neural networks.
In this setting, our approach outperformed other methods and demonstrated state-of-the-art results, achieving 0.85 Matthews correlation coefficient, 85% sensitivity, and 99.3% specificity.
arXiv Detail & Related papers (2021-04-04T20:30:14Z) - Automated Model Design and Benchmarking of 3D Deep Learning Models for
COVID-19 Detection with Chest CT Scans [72.04652116817238]
We propose a differentiable neural architecture search (DNAS) framework to automatically search for the 3D DL models for 3D chest CT scans classification.
We also exploit the Class Activation Mapping (CAM) technique on our models to provide the interpretability of the results.
arXiv Detail & Related papers (2021-01-14T03:45:01Z) - Dynamic Image for 3D MRI Image Alzheimer's Disease Classification [26.296422774282156]
Training a 3D convolutional neural network (CNN) is time-consuming and computationally expensive.
We make use of approximate rank pooling to transform the 3D MRI image volume into a 2D image to use as input to a 2D CNN.
Our proposed CNN model achieves $9.5%$ better Alzheimer's disease classification accuracy than the baseline 3D models.
arXiv Detail & Related papers (2020-11-30T21:39:32Z) - Modelling the Distribution of 3D Brain MRI using a 2D Slice VAE [66.63629641650572]
We propose a method to model 3D MR brain volumes distribution by combining a 2D slice VAE with a Gaussian model that captures the relationships between slices.
We also introduce a novel evaluation method for generated volumes that quantifies how well their segmentations match those of true brain anatomy.
arXiv Detail & Related papers (2020-07-09T13:23:15Z) - 2.75D: Boosting learning by representing 3D Medical imaging to 2D
features for small data [54.223614679807994]
3D convolutional neural networks (CNNs) have started to show superior performance to 2D CNNs in numerous deep learning tasks.
Applying transfer learning on 3D CNN is challenging due to a lack of publicly available pre-trained 3D models.
In this work, we proposed a novel 2D strategical representation of volumetric data, namely 2.75D.
As a result, 2D CNN networks can also be used to learn volumetric information.
arXiv Detail & Related papers (2020-02-11T08:24:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.