A novel method to enhance pneumonia detection via a model-level
ensembling of CNN and vision transformer
- URL: http://arxiv.org/abs/2401.02358v1
- Date: Thu, 4 Jan 2024 16:58:31 GMT
- Title: A novel method to enhance pneumonia detection via a model-level
ensembling of CNN and vision transformer
- Authors: Sandeep Angara, Nishith Reddy Mannuru, Aashrith Mannuru, Sharath
Thirunagaru
- Abstract summary: Pneumonia remains a leading cause of morbidity and mortality worldwide.
Deep learning has shown immense potential for pneumonia detection from Chest X-ray (CXR) imaging.
We developed a novel model fusing Convolution Neural networks (CNN) and Vision Transformer networks via model-level ensembling.
- Score: 0.7499722271664147
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Pneumonia remains a leading cause of morbidity and mortality worldwide. Chest
X-ray (CXR) imaging is a fundamental diagnostic tool, but traditional analysis
relies on time-intensive expert evaluation. Recently, deep learning has shown
immense potential for automating pneumonia detection from CXRs. This paper
explores applying neural networks to improve CXR-based pneumonia diagnosis. We
developed a novel model fusing Convolution Neural networks (CNN) and Vision
Transformer networks via model-level ensembling. Our fusion architecture
combines a ResNet34 variant and a Multi-Axis Vision Transformer small model.
Both base models are initialized with ImageNet pre-trained weights. The output
layers are removed, and features are combined using a flattening layer before
final classification. Experiments used the Kaggle pediatric pneumonia dataset
containing 1,341 normal and 3,875 pneumonia CXR images. We compared our model
against standalone ResNet34, Vision Transformer, and Swin Transformer Tiny
baseline models using identical training procedures. Extensive data
augmentation, Adam optimization, learning rate warmup, and decay were employed.
The fusion model achieved a state-of-the-art accuracy of 94.87%, surpassing the
baselines. We also attained excellent sensitivity, specificity, kappa score,
and positive predictive value. Confusion matrix analysis confirms fewer
misclassifications. The ResNet34 and Vision Transformer combination enables
jointly learning robust features from CNNs and Transformer paradigms. This
model-level ensemble technique effectively integrates their complementary
strengths for enhanced pneumonia classification.
Related papers
- Brain Tumor Classification on MRI in Light of Molecular Markers [61.77272414423481]
Co-deletion of the 1p/19q gene is associated with clinical outcomes in low-grade gliomas.
This study aims to utilize a specially MRI-based convolutional neural network for brain cancer detection.
arXiv Detail & Related papers (2024-09-29T07:04:26Z) - Efficient and Accurate Pneumonia Detection Using a Novel Multi-Scale Transformer Approach [1.2233362977312945]
We propose a novel approach combining deep learning and transformer-based attention mechanisms to enhance pneumonia detection from chest X-rays.
Our method begins with lung segmentation using a TransUNet model that integrates our specialized transformer module.
Our approach achieves high accuracy rates of 92.79% on the Kermany dataset and 95.11% on the Cohen dataset.
arXiv Detail & Related papers (2024-08-08T08:06:42Z) - Pneumonia Detection on chest X-ray images Using Ensemble of Deep
Convolutional Neural Networks [7.232767871756102]
This paper presents a computer-aided classification of pneumonia, coined as Ensemble Learning (EL), to simplify the diagnosis process on chest X-ray images.
Our proposal is based on Convolutional Neural Network (CNN) models, which are pre-trained CNN models that have been recently employed to enhance the performance of many medical tasks instead of training CNN models from scratch.
The proposed EL approach outperforms other existing state-of-the-art methods, and it obtains an accuracy of 93.91% and a F1-Score of 93.88% on the testing phase.
arXiv Detail & Related papers (2023-12-13T08:28:21Z) - Performance of GAN-based augmentation for deep learning COVID-19 image
classification [57.1795052451257]
The biggest challenge in the application of deep learning to the medical domain is the availability of training data.
Data augmentation is a typical methodology used in machine learning when confronted with a limited data set.
In this work, a StyleGAN2-ADA model of Generative Adversarial Networks is trained on the limited COVID-19 chest X-ray image set.
arXiv Detail & Related papers (2023-04-18T15:39:58Z) - Vision Transformer-based Model for Severity Quantification of Lung
Pneumonia Using Chest X-ray Images [11.12596879975844]
We present a Vision Transformer-based neural network model that relies on a small number of trainable parameters to quantify the severity of COVID-19 and other lung diseases.
Our model can provide peak performance in quantifying severity with high generalizability at a relatively low computational cost.
arXiv Detail & Related papers (2023-03-18T12:38:23Z) - Unsupervised Contrastive Learning based Transformer for Lung Nodule
Detection [6.693379403133435]
Early detection of lung nodules with computed tomography (CT) is critical for the longer survival of lung cancer patients and better quality of life.
Computer-aided detection/diagnosis (CAD) is proven valuable as a second or concurrent reader in this context.
accurate detection of lung nodules remains a challenge for such CAD systems and even radiologists due to variability in size, location, and appearance of lung nodules.
Motivated by recent computer vision techniques, here we present a self-supervised region-based 3D transformer model to identify lung nodules.
arXiv Detail & Related papers (2022-04-30T01:19:00Z) - Wide & Deep neural network model for patch aggregation in CNN-based
prostate cancer detection systems [51.19354417900591]
Prostate cancer (PCa) is one of the leading causes of death among men, with almost 1.41 million new cases and around 375,000 deaths in 2020.
To perform an automatic diagnosis, prostate tissue samples are first digitized into gigapixel-resolution whole-slide images.
Small subimages called patches are extracted and predicted, obtaining a patch-level classification.
arXiv Detail & Related papers (2021-05-20T18:13:58Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - A Multi-Stage Attentive Transfer Learning Framework for Improving
COVID-19 Diagnosis [49.3704402041314]
We propose a multi-stage attentive transfer learning framework for improving COVID-19 diagnosis.
Our proposed framework consists of three stages to train accurate diagnosis models through learning knowledge from multiple source tasks and data of different domains.
Importantly, we propose a novel self-supervised learning method to learn multi-scale representations for lung CT images.
arXiv Detail & Related papers (2021-01-14T01:39:19Z) - Classification of COVID-19 in CT Scans using Multi-Source Transfer
Learning [91.3755431537592]
We propose the use of Multi-Source Transfer Learning to improve upon traditional Transfer Learning for the classification of COVID-19 from CT scans.
With our multi-source fine-tuning approach, our models outperformed baseline models fine-tuned with ImageNet.
Our best performing model was able to achieve an accuracy of 0.893 and a Recall score of 0.897, outperforming its baseline Recall score by 9.3%.
arXiv Detail & Related papers (2020-09-22T11:53:06Z) - Detection of Coronavirus (COVID-19) Associated Pneumonia based on
Generative Adversarial Networks and a Fine-Tuned Deep Transfer Learning Model
using Chest X-ray Dataset [4.664495510551646]
This paper presents a pneumonia chest x-ray detection based on generative adversarial networks (GAN) with a fine-tuned deep transfer learning for a limited dataset.
The dataset used in this research consists of 5863 X-ray images with two categories: Normal and Pneumonia.
arXiv Detail & Related papers (2020-04-02T08:14:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.