Efficient and Accurate Pneumonia Detection Using a Novel Multi-Scale Transformer Approach
- URL: http://arxiv.org/abs/2408.04290v2
- Date: Sun, 3 Nov 2024 11:51:50 GMT
- Title: Efficient and Accurate Pneumonia Detection Using a Novel Multi-Scale Transformer Approach
- Authors: Alireza Saber, Pouria Parhami, Alimohammad Siahkarzadeh, Mansoor Fateh, Amirreza Fateh,
- Abstract summary: We propose a novel approach combining deep learning and transformer-based attention mechanisms to enhance pneumonia detection from chest X-rays.
Our method begins with lung segmentation using a TransUNet model that integrates our specialized transformer module.
Our approach achieves high accuracy rates of 92.79% on the Kermany dataset and 95.11% on the Cohen dataset.
- Score: 1.2233362977312945
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Pneumonia, a severe respiratory disease, poses significant diagnostic challenges, especially in underdeveloped regions. Traditional diagnostic methods, such as chest X-rays, suffer from variability in interpretation among radiologists, necessitating reliable automated tools. In this study, we propose a novel approach combining deep learning and transformer-based attention mechanisms to enhance pneumonia detection from chest X-rays. Our method begins with lung segmentation using a TransUNet model that integrates our specialized transformer module, which has fewer parameters compared to common transformers while maintaining performance. This model is trained on the "Chest Xray Masks and Labels" dataset and then applied to the Kermany and Cohen datasets to isolate lung regions, enhancing subsequent classification tasks. For classification, we employ pre-trained ResNet models (ResNet-50 and ResNet-101) to extract multi-scale feature maps, processed through our modified transformer module. By employing our specialized transformer, we attain superior results with significantly fewer parameters compared to common transformer models. Our approach achieves high accuracy rates of 92.79% on the Kermany dataset and 95.11% on the Cohen dataset, ensuring robust and efficient performance suitable for resource-constrained environments. "https://github.com/amirrezafateh/Multi-Scale-Transformer-Pneumonia"
Related papers
- A novel method to enhance pneumonia detection via a model-level
ensembling of CNN and vision transformer [0.7499722271664147]
Pneumonia remains a leading cause of morbidity and mortality worldwide.
Deep learning has shown immense potential for pneumonia detection from Chest X-ray (CXR) imaging.
We developed a novel model fusing Convolution Neural networks (CNN) and Vision Transformer networks via model-level ensembling.
arXiv Detail & Related papers (2024-01-04T16:58:31Z) - The effect of data augmentation and 3D-CNN depth on Alzheimer's Disease
detection [51.697248252191265]
This work summarizes and strictly observes best practices regarding data handling, experimental design, and model evaluation.
We focus on Alzheimer's Disease (AD) detection, which serves as a paradigmatic example of challenging problem in healthcare.
Within this framework, we train predictive 15 models, considering three different data augmentation strategies and five distinct 3D CNN architectures.
arXiv Detail & Related papers (2023-09-13T10:40:41Z) - Vision Transformer-based Model for Severity Quantification of Lung
Pneumonia Using Chest X-ray Images [11.12596879975844]
We present a Vision Transformer-based neural network model that relies on a small number of trainable parameters to quantify the severity of COVID-19 and other lung diseases.
Our model can provide peak performance in quantifying severity with high generalizability at a relatively low computational cost.
arXiv Detail & Related papers (2023-03-18T12:38:23Z) - Improved lung segmentation based on U-Net architecture and morphological
operations [0.0]
This paper presents a reliable model for the segmentation of lungs in chest radiographs.
Our model overcomes the challenges by learning to ignore unimportant areas in the source Chest Radiograph.
The proposed model has a DICE coefficient of 98.1 percent which demonstrates the reliability of our model.
arXiv Detail & Related papers (2022-10-19T13:32:00Z) - Preservation of High Frequency Content for Deep Learning-Based Medical
Image Classification [74.84221280249876]
An efficient analysis of large amounts of chest radiographs can aid physicians and radiologists.
We propose a novel Discrete Wavelet Transform (DWT)-based method for the efficient identification and encoding of visual information.
arXiv Detail & Related papers (2022-05-08T15:29:54Z) - Towards Data-Efficient Detection Transformers [77.43470797296906]
We show most detection transformers suffer from significant performance drops on small-size datasets.
We empirically analyze the factors that affect data efficiency, through a step-by-step transition from a data-efficient RCNN variant to the representative DETR.
We introduce a simple yet effective label augmentation method to provide richer supervision and improve data efficiency.
arXiv Detail & Related papers (2022-03-17T17:56:34Z) - Efficient Vision Transformers via Fine-Grained Manifold Distillation [96.50513363752836]
Vision transformer architectures have shown extraordinary performance on many computer vision tasks.
Although the network performance is boosted, transformers are often required more computational resources.
We propose to excavate useful information from the teacher transformer through the relationship between images and the divided patches.
arXiv Detail & Related papers (2021-07-03T08:28:34Z) - CoRSAI: A System for Robust Interpretation of CT Scans of COVID-19
Patients Using Deep Learning [133.87426554801252]
We adopted an approach based on using an ensemble of deep convolutionalneural networks for segmentation of lung CT scans.
Using our models we are able to segment the lesions, evaluatepatients dynamics, estimate relative volume of lungs affected by lesions and evaluate the lung damage stage.
arXiv Detail & Related papers (2021-05-25T12:06:55Z) - Development of a Multi-Task Learning V-Net for Pulmonary Lobar
Segmentation on Computed Tomography and Application to Diseased Lungs [0.19573380763700707]
Diseased lung regions often produce high-density zones on CT images, limiting an algorithm's execution to specify damaged lobes.
This impact motivated developing an improved machine learning method to segment lung lobes.
The approach can be readily adopted in the clinical setting as a robust tool for radiologists.
arXiv Detail & Related papers (2021-05-11T17:10:25Z) - Self-Training with Improved Regularization for Sample-Efficient Chest
X-Ray Classification [80.00316465793702]
We present a deep learning framework that enables robust modeling in challenging scenarios.
Our results show that using 85% lesser labeled data, we can build predictive models that match the performance of classifiers trained in a large-scale data setting.
arXiv Detail & Related papers (2020-05-03T02:36:00Z) - Detection of Coronavirus (COVID-19) Associated Pneumonia based on
Generative Adversarial Networks and a Fine-Tuned Deep Transfer Learning Model
using Chest X-ray Dataset [4.664495510551646]
This paper presents a pneumonia chest x-ray detection based on generative adversarial networks (GAN) with a fine-tuned deep transfer learning for a limited dataset.
The dataset used in this research consists of 5863 X-ray images with two categories: Normal and Pneumonia.
arXiv Detail & Related papers (2020-04-02T08:14:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.