Endoscopy Classification Model Using Swin Transformer and Saliency Map
- URL: http://arxiv.org/abs/2303.06736v1
- Date: Sun, 12 Mar 2023 19:36:31 GMT
- Title: Endoscopy Classification Model Using Swin Transformer and Saliency Map
- Authors: Zahra Sobhaninia, Nasrin Abharian, Nader Karimi, Shahram Shirani,
Shadrokh Samavi
- Abstract summary: We propose a new multi-label classification method, which considers two aspects of learning approaches (local and global views) for endoscopic image classification.
The results demonstrate that this method performed well for endoscopic medical images by utilizing local and global features of the images.
- Score: 11.031841470875571
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Endoscopy is a valuable tool for the early diagnosis of colon cancer.
However, it requires the expertise of endoscopists and is a time-consuming
process. In this work, we propose a new multi-label classification method,
which considers two aspects of learning approaches (local and global views) for
endoscopic image classification. The model consists of a Swin transformer
branch and a modified VGG16 model as a CNN branch. To help the learning process
of the CNN branch, the model employs saliency maps and endoscopy images and
concatenates them. The results demonstrate that this method performed well for
endoscopic medical images by utilizing local and global features of the images.
Furthermore, quantitative evaluations prove the proposed method's superiority
over state-of-the-art works.
Related papers
- Cross Feature Fusion of Fundus Image and Generated Lesion Map for Referable Diabetic Retinopathy Classification [1.091626241764448]
Diabetic Retinopathy (DR) is a primary cause of blindness, necessitating early detection and diagnosis.
We develop an advanced cross-learning DR classification method leveraging transfer learning and cross-attention mechanisms.
Our experiments, utilizing two public datasets, demonstrate a superior accuracy of 94.6%, surpassing current state-of-the-art methods by 4.4%.
arXiv Detail & Related papers (2024-11-06T02:23:38Z) - A Multimodal Approach For Endoscopic VCE Image Classification Using BiomedCLIP-PubMedBERT [0.62914438169038]
This Paper presents an advanced approach for fine-tuning BiomedCLIP PubMedBERT, a multimodal model, to classify abnormalities in Video Capsule Endoscopy frames.
Our method categorizes images into ten specific classes: angioectasia, bleeding, erosion, erythema, foreign body, lymphangiectasia, polyp, ulcer, worms, and normal.
Performance metrics, including classification, accuracy, recall, and F1 score, indicate the models strong ability to accurately identify abnormalities in endoscopic frames.
arXiv Detail & Related papers (2024-10-25T19:42:57Z) - A Classification-Based Adaptive Segmentation Pipeline: Feasibility Study Using Polycystic Liver Disease and Metastases from Colorectal Cancer CT Images [0.261201916989931]
The purpose of this study is to explore the feasibility of building a workflow to efficiently trained segmentation models.
By implementing a deep learning model to automatically classify the images and route them appropriate segmentation models, we hope our workflow can segment the images with different pathology accurately.
arXiv Detail & Related papers (2024-05-02T18:05:37Z) - Cross-modulated Few-shot Image Generation for Colorectal Tissue
Classification [58.147396879490124]
Our few-shot generation method, named XM-GAN, takes one base and a pair of reference tissue images as input and generates high-quality yet diverse images.
To the best of our knowledge, we are the first to investigate few-shot generation in colorectal tissue images.
arXiv Detail & Related papers (2023-04-04T17:50:30Z) - MedSegDiff-V2: Diffusion based Medical Image Segmentation with
Transformer [53.575573940055335]
We propose a novel Transformer-based Diffusion framework, called MedSegDiff-V2.
We verify its effectiveness on 20 medical image segmentation tasks with different image modalities.
arXiv Detail & Related papers (2023-01-19T03:42:36Z) - Semi-supervised GAN for Bladder Tissue Classification in Multi-Domain
Endoscopic Images [10.48945682277992]
We propose a semi-surprised Generative Adrial Network (GAN)-based method composed of three main components.
The overall average classification accuracy, precision, and recall obtained with the proposed method for tissue classification are 0.90, 0.88, and 0.89 respectively.
arXiv Detail & Related papers (2022-12-21T21:32:36Z) - Stain based contrastive co-training for histopathological image analysis [61.87751502143719]
We propose a novel semi-supervised learning approach for classification of histovolution images.
We employ strong supervision with patch-level annotations combined with a novel co-training loss to create a semi-supervised learning framework.
We evaluate our approach in clear cell renal cell and prostate carcinomas, and demonstrate improvement over state-of-the-art semi-supervised learning methods.
arXiv Detail & Related papers (2022-06-24T22:25:31Z) - Harmonizing Pathological and Normal Pixels for Pseudo-healthy Synthesis [68.5287824124996]
We present a new type of discriminator, the segmentor, to accurately locate the lesions and improve the visual quality of pseudo-healthy images.
We apply the generated images into medical image enhancement and utilize the enhanced results to cope with the low contrast problem.
Comprehensive experiments on the T2 modality of BraTS demonstrate that the proposed method substantially outperforms the state-of-the-art methods.
arXiv Detail & Related papers (2022-03-29T08:41:17Z) - Malignancy Prediction and Lesion Identification from Clinical
Dermatological Images [65.1629311281062]
We consider machine-learning-based malignancy prediction and lesion identification from clinical dermatological images.
We first identify all lesions present in the image regardless of sub-type or likelihood of malignancy, then it estimates their likelihood of malignancy, and through aggregation, it also generates an image-level likelihood of malignancy.
arXiv Detail & Related papers (2021-04-02T20:52:05Z) - Joint Learning of Vessel Segmentation and Artery/Vein Classification
with Post-processing [27.825969553813092]
Vessel segmentation and artery/vein classification provide various information on potential disorders.
We adopt a UNet-based model, SeqNet, to accurately segment vessels from the background and make prediction on the vessel type.
Our experiments show that our method improves AUC to 0.98 for segmentation and the accuracy to 0.92 in classification over DRIVE dataset.
arXiv Detail & Related papers (2020-05-27T13:06:16Z) - Weakly supervised multiple instance learning histopathological tumor
segmentation [51.085268272912415]
We propose a weakly supervised framework for whole slide imaging segmentation.
We exploit a multiple instance learning scheme for training models.
The proposed framework has been evaluated on multi-locations and multi-centric public data from The Cancer Genome Atlas and the PatchCamelyon dataset.
arXiv Detail & Related papers (2020-04-10T13:12:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.