Related papers: Endoscopy Classification Model Using Swin Transformer and Saliency Map

Endoscopy Classification Model Using Swin Transformer and Saliency Map

URL: http://arxiv.org/abs/2303.06736v1
Date: Sun, 12 Mar 2023 19:36:31 GMT
Title: Endoscopy Classification Model Using Swin Transformer and Saliency Map
Authors: Zahra Sobhaninia, Nasrin Abharian, Nader Karimi, Shahram Shirani, Shadrokh Samavi
Abstract summary: We propose a new multi-label classification method, which considers two aspects of learning approaches (local and global views) for endoscopic image classification. The results demonstrate that this method performed well for endoscopic medical images by utilizing local and global features of the images.
Score: 11.031841470875571
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Endoscopy is a valuable tool for the early diagnosis of colon cancer. However, it requires the expertise of endoscopists and is a time-consuming process. In this work, we propose a new multi-label classification method, which considers two aspects of learning approaches (local and global views) for endoscopic image classification. The model consists of a Swin transformer branch and a modified VGG16 model as a CNN branch. To help the learning process of the CNN branch, the model employs saliency maps and endoscopy images and concatenates them. The results demonstrate that this method performed well for endoscopic medical images by utilizing local and global features of the images. Furthermore, quantitative evaluations prove the proposed method's superiority over state-of-the-art works.

Related papers

Cross Feature Fusion of Fundus Image and Generated Lesion Map for Referable Diabetic Retinopathy Classification [1.091626241764448]
Diabetic Retinopathy (DR) is a primary cause of blindness, necessitating early detection and diagnosis. We develop an advanced cross-learning DR classification method leveraging transfer learning and cross-attention mechanisms. Our experiments, utilizing two public datasets, demonstrate a superior accuracy of 94.6%, surpassing current state-of-the-art methods by 4.4%.
arXiv Detail & Related papers (2024-11-06T02:23:38Z)
A Multimodal Approach For Endoscopic VCE Image Classification Using BiomedCLIP-PubMedBERT [0.62914438169038]
This Paper presents an advanced approach for fine-tuning BiomedCLIP PubMedBERT, a multimodal model, to classify abnormalities in Video Capsule Endoscopy frames. Our method categorizes images into ten specific classes: angioectasia, bleeding, erosion, erythema, foreign body, lymphangiectasia, polyp, ulcer, worms, and normal. Performance metrics, including classification, accuracy, recall, and F1 score, indicate the models strong ability to accurately identify abnormalities in endoscopic frames.
arXiv Detail & Related papers (2024-10-25T19:42:57Z)
A Classification-Based Adaptive Segmentation Pipeline: Feasibility Study Using Polycystic Liver Disease and Metastases from Colorectal Cancer CT Images [0.261201916989931]
The purpose of this study is to explore the feasibility of building a workflow to efficiently trained segmentation models. By implementing a deep learning model to automatically classify the images and route them appropriate segmentation models, we hope our workflow can segment the images with different pathology accurately.
arXiv Detail & Related papers (2024-05-02T18:05:37Z)
Cross-modulated Few-shot Image Generation for Colorectal Tissue Classification [58.147396879490124]
Our few-shot generation method, named XM-GAN, takes one base and a pair of reference tissue images as input and generates high-quality yet diverse images. To the best of our knowledge, we are the first to investigate few-shot generation in colorectal tissue images.
arXiv Detail & Related papers (2023-04-04T17:50:30Z)
MedSegDiff-V2: Diffusion based Medical Image Segmentation with Transformer [53.575573940055335]
We propose a novel Transformer-based Diffusion framework, called MedSegDiff-V2. We verify its effectiveness on 20 medical image segmentation tasks with different image modalities.
arXiv Detail & Related papers (2023-01-19T03:42:36Z)
Semi-supervised GAN for Bladder Tissue Classification in Multi-Domain Endoscopic Images [10.48945682277992]
We propose a semi-surprised Generative Adrial Network (GAN)-based method composed of three main components. The overall average classification accuracy, precision, and recall obtained with the proposed method for tissue classification are 0.90, 0.88, and 0.89 respectively.
arXiv Detail & Related papers (2022-12-21T21:32:36Z)
Stain based contrastive co-training for histopathological image analysis [61.87751502143719]
We propose a novel semi-supervised learning approach for classification of histovolution images. We employ strong supervision with patch-level annotations combined with a novel co-training loss to create a semi-supervised learning framework. We evaluate our approach in clear cell renal cell and prostate carcinomas, and demonstrate improvement over state-of-the-art semi-supervised learning methods.
arXiv Detail & Related papers (2022-06-24T22:25:31Z)
Harmonizing Pathological and Normal Pixels for Pseudo-healthy Synthesis [68.5287824124996]
We present a new type of discriminator, the segmentor, to accurately locate the lesions and improve the visual quality of pseudo-healthy images. We apply the generated images into medical image enhancement and utilize the enhanced results to cope with the low contrast problem. Comprehensive experiments on the T2 modality of BraTS demonstrate that the proposed method substantially outperforms the state-of-the-art methods.
arXiv Detail & Related papers (2022-03-29T08:41:17Z)
Malignancy Prediction and Lesion Identification from Clinical Dermatological Images [65.1629311281062]
We consider machine-learning-based malignancy prediction and lesion identification from clinical dermatological images. We first identify all lesions present in the image regardless of sub-type or likelihood of malignancy, then it estimates their likelihood of malignancy, and through aggregation, it also generates an image-level likelihood of malignancy.
arXiv Detail & Related papers (2021-04-02T20:52:05Z)
Joint Learning of Vessel Segmentation and Artery/Vein Classification with Post-processing [27.825969553813092]
Vessel segmentation and artery/vein classification provide various information on potential disorders. We adopt a UNet-based model, SeqNet, to accurately segment vessels from the background and make prediction on the vessel type. Our experiments show that our method improves AUC to 0.98 for segmentation and the accuracy to 0.92 in classification over DRIVE dataset.
arXiv Detail & Related papers (2020-05-27T13:06:16Z)
Weakly supervised multiple instance learning histopathological tumor segmentation [51.085268272912415]
We propose a weakly supervised framework for whole slide imaging segmentation. We exploit a multiple instance learning scheme for training models. The proposed framework has been evaluated on multi-locations and multi-centric public data from The Cancer Genome Atlas and the PatchCamelyon dataset.
arXiv Detail & Related papers (2020-04-10T13:12:47Z)

This list is automatically generated from the titles and abstracts of the papers in this site.