Comparative Analysis of ImageNet Pre-Trained Deep Learning Models and
  DINOv2 in Medical Imaging Classification
        - URL: http://arxiv.org/abs/2402.07595v2
- Date: Tue, 13 Feb 2024 15:39:11 GMT
- Title: Comparative Analysis of ImageNet Pre-Trained Deep Learning Models and
  DINOv2 in Medical Imaging Classification
- Authors: Yuning Huang, Jingchen Zou, Lanxi Meng, Xin Yue, Qing Zhao, Jianqiang
  Li, Changwei Song, Gabriel Jimenez, Shaowu Li, Guanghui Fu
- Abstract summary: In this paper, we performed a glioma grading task using three clinical modalities of brain MRI data.
We compared the performance of various pre-trained deep learning models, including those based on ImageNet and DINOv2.
Our findings indicate that in our clinical dataset, DINOv2's performance was not as strong as ImageNet-based pre-trained models.
- Score: 7.205610366609243
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   Medical image analysis frequently encounters data scarcity challenges.
Transfer learning has been effective in addressing this issue while conserving
computational resources. The recent advent of foundational models like the
DINOv2, which uses the vision transformer architecture, has opened new
opportunities in the field and gathered significant interest. However, DINOv2's
performance on clinical data still needs to be verified. In this paper, we
performed a glioma grading task using three clinical modalities of brain MRI
data. We compared the performance of various pre-trained deep learning models,
including those based on ImageNet and DINOv2, in a transfer learning context.
Our focus was on understanding the impact of the freezing mechanism on
performance. We also validated our findings on three other types of public
datasets: chest radiography, fundus radiography, and dermoscopy. Our findings
indicate that in our clinical dataset, DINOv2's performance was not as strong
as ImageNet-based pre-trained models, whereas in public datasets, DINOv2
generally outperformed other models, especially when using the frozen
mechanism. Similar performance was observed with various sizes of DINOv2 models
across different tasks. In summary, DINOv2 is viable for medical image
classification tasks, particularly with data resembling natural images.
However, its effectiveness may vary with data that significantly differs from
natural images such as MRI. In addition, employing smaller versions of the
model can be adequate for medical task, offering resource-saving benefits. Our
codes are available at https://github.com/GuanghuiFU/medical_DINOv2_eval.
 
      
        Related papers
        - Evaluating Pre-trained Convolutional Neural Networks and Foundation   Models as Feature Extractors for Content-based Medical Image Retrieval [0.37478492878307323]
 Content-based medical image retrieval (CBMIR) relies on the characteristic features of the images, such as color, texture, shape, and spatial features.
We investigated the CBMIR performance on a subset of the MedMNIST V2 dataset, including eight types of 2D and 3D medical images.
Our results show that, overall, for the 2D datasets, foundation models deliver superior performance by a large margin compared to CNNs.
Our findings confirm that while using larger image sizes (especially for 2D datasets) yields slightly better performance, competitive CBMIR performance can still be achieved even with smaller
 arXiv  Detail & Related papers  (2024-09-14T13:07:30Z)
- Disease Classification and Impact of Pretrained Deep Convolution Neural   Networks on Diverse Medical Imaging Datasets across Imaging Modalities [0.0]
 This paper investigates the intricacies of using pretrained deep convolutional neural networks with transfer learning across diverse medical imaging datasets.
It shows that the use of pretrained models as fixed feature extractors yields poor performance irrespective of the datasets.
It is also found that deeper and more complex architectures did not necessarily result in the best performance.
 arXiv  Detail & Related papers  (2024-08-30T04:51:19Z)
- Evaluating General Purpose Vision Foundation Models for Medical Image   Analysis: An Experimental Study of DINOv2 on Radiology Benchmarks [5.8941124219471055]
 DINOv2 is an open-source foundation model pre-trained with self-supervised learning on 142 million curated natural images.
This study comprehensively evaluates the performance DINOv2 for radiology.
 arXiv  Detail & Related papers  (2023-12-04T21:47:10Z)
- LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical
  Imaging via Second-order Graph Matching [59.01894976615714]
 We introduce LVM-Med, the first family of deep networks trained on large-scale medical datasets.
We have collected approximately 1.3 million medical images from 55 publicly available datasets.
LVM-Med empirically outperforms a number of state-of-the-art supervised, self-supervised, and foundation models.
 arXiv  Detail & Related papers  (2023-06-20T22:21:34Z)
- Performance of GAN-based augmentation for deep learning COVID-19 image
  classification [57.1795052451257]
 The biggest challenge in the application of deep learning to the medical domain is the availability of training data.
Data augmentation is a typical methodology used in machine learning when confronted with a limited data set.
In this work, a StyleGAN2-ADA model of Generative Adversarial Networks is trained on the limited COVID-19 chest X-ray image set.
 arXiv  Detail & Related papers  (2023-04-18T15:39:58Z)
- Vision-Language Modelling For Radiological Imaging and Reports In The
  Low Data Regime [70.04389979779195]
 This paper explores training medical vision-language models (VLMs) where the visual and language inputs are embedded into a common space.
We explore several candidate methods to improve low-data performance, including adapting generic pre-trained models to novel image and text domains.
Using text-to-image retrieval as a benchmark, we evaluate the performance of these methods with variable sized training datasets of paired chest X-rays and radiological reports.
 arXiv  Detail & Related papers  (2023-03-30T18:20:00Z)
- Mine yOur owN Anatomy: Revisiting Medical Image Segmentation with   Extremely Limited Labels [54.58539616385138]
 We introduce a novel semi-supervised 2D medical image segmentation framework termed Mine yOur owN Anatomy (MONA)
First, prior work argues that every pixel equally matters to the model training; we observe empirically that this alone is unlikely to define meaningful anatomical features.
Second, we construct a set of objectives that encourage the model to be capable of decomposing medical images into a collection of anatomical features.
 arXiv  Detail & Related papers  (2022-09-27T15:50:31Z)
- Learning from few examples: Classifying sex from retinal images via deep
  learning [3.9146761527401424]
 We showcase results for the performance of DL on small datasets to classify patient sex from fundus images.
Our models, developed using approximately 2500 fundus images, achieved test AUC scores of up to 0.72.
This corresponds to a mere 25% decrease in performance despite a nearly 1000-fold decrease in the dataset size.
 arXiv  Detail & Related papers  (2022-07-20T02:47:29Z)
- Interpretation of 3D CNNs for Brain MRI Data Classification [56.895060189929055]
 We extend the previous findings in gender differences from diffusion-tensor imaging on T1 brain MRI scans.
We provide the voxel-wise 3D CNN interpretation comparing the results of three interpretation methods.
 arXiv  Detail & Related papers  (2020-06-20T17:56:46Z)
- Modeling Shared Responses in Neuroimaging Studies through MultiView ICA [94.31804763196116]
 Group studies involving large cohorts of subjects are important to draw general conclusions about brain functional organization.
We propose a novel MultiView Independent Component Analysis model for group studies, where data from each subject are modeled as a linear combination of shared independent sources plus noise.
We demonstrate the usefulness of our approach first on fMRI data, where our model demonstrates improved sensitivity in identifying common sources among subjects.
 arXiv  Detail & Related papers  (2020-06-11T17:29:53Z)
- Improving Calibration and Out-of-Distribution Detection in Medical Image
  Segmentation with Convolutional Neural Networks [8.219843232619551]
 Convolutional Neural Networks (CNNs) have shown to be powerful medical image segmentation models.
We advocate for multi-task learning, i.e., training a single model on several different datasets.
We show that not only a single CNN learns to automatically recognize the context and accurately segment the organ of interest in each context, but also that such a joint model often has more accurate and better-calibrated predictions.
 arXiv  Detail & Related papers  (2020-04-12T23:42:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.