HierAttn: Effectively Learn Representations from Stage Attention and
Branch Attention for Skin Lesions Diagnosis
- URL: http://arxiv.org/abs/2205.04326v2
- Date: Tue, 10 May 2022 06:49:58 GMT
- Title: HierAttn: Effectively Learn Representations from Stage Attention and
Branch Attention for Skin Lesions Diagnosis
- Authors: Wei Dai, Rui Liu, Tianyi Wu, Min Wang, Jianqin Yin, Jun Liu
- Abstract summary: An accurate and unbiased examination of skin lesions is critical for the early diagnosis and treatment of skin cancers.
Recent studies have developed ensembled convolutional neural networks (CNNs) to classify the images for early diagnosis.
We introduce HierAttn, a lite and effective neural network with hierarchical and self attention.
- Score: 18.026088450803258
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: An accurate and unbiased examination of skin lesions is critical for the
early diagnosis and treatment of skin cancers. The visual feature of the skin
lesions varies significantly because skin images are collected from patients
with different skin colours by using various devices. Recent studies have
developed ensembled convolutional neural networks (CNNs) to classify the images
for early diagnosis. However, the practical use of CNNs is limited because
their network structures are heavyweight and neglect contextual information.
Vision transformers (ViTs) learn the global features by self-attention
mechanisms, but they also have comparatively large model sizes (more than
100M). To address these limitations, we introduce HierAttn, a lite and
effective neural network with hierarchical and self attention. HierAttn applies
a novel strategy based on learning local and global features by a multi-stage
and hierarchical network. The efficacy of HierAttn was evaluated by using the
dermoscopy images dataset ISIC2019 and smartphone photos dataset PAD-UFES-20.
The experimental results show that HierAttn achieves the best top-1 accuracy
and AUC among state-of-the-art mobile networks, including MobileNetV3 and
MobileViT. The code is available at https://github.com/anthonyweidai/HierAttn.
Related papers
- Brain Tumor Classification on MRI in Light of Molecular Markers [61.77272414423481]
Co-deletion of the 1p/19q gene is associated with clinical outcomes in low-grade gliomas.
This study aims to utilize a specially MRI-based convolutional neural network for brain cancer detection.
arXiv Detail & Related papers (2024-09-29T07:04:26Z) - Optimizing Skin Lesion Classification via Multimodal Data and Auxiliary
Task Integration [54.76511683427566]
This research introduces a novel multimodal method for classifying skin lesions, integrating smartphone-captured images with essential clinical and demographic information.
A distinctive aspect of this method is the integration of an auxiliary task focused on super-resolution image prediction.
The experimental evaluations have been conducted using the PAD-UFES20 dataset, applying various deep-learning architectures.
arXiv Detail & Related papers (2024-02-16T05:16:20Z) - LesionAid: Vision Transformers-based Skin Lesion Generation and
Classification [0.0]
This research proposes a novel multi-class prediction framework that classifies skin lesions based on ViT and ViTGAN.
The framework consists of four main phases: ViTGANs, Image processing, and explainable AI.
arXiv Detail & Related papers (2023-02-02T13:52:54Z) - Siamese Neural Networks for Skin Cancer Classification and New Class
Detection using Clinical and Dermoscopic Image Datasets [4.83420384410068]
We evaluate Siamese Neural Networks (SNNs) on both dermoscopic and clinical images of skin lesions.
SNNs allow us to classify images of skin lesions, but also allow us to identify those images which are different from the trained classes.
We obtain top-1 classification accuracy levels of 74.33% and 85.61% on clinical and dermoscopic datasets.
arXiv Detail & Related papers (2022-12-12T18:58:43Z) - Data-Efficient Vision Transformers for Multi-Label Disease
Classification on Chest Radiographs [55.78588835407174]
Vision Transformers (ViTs) have not been applied to this task despite their high classification performance on generic images.
ViTs do not rely on convolutions but on patch-based self-attention and in contrast to CNNs, no prior knowledge of local connectivity is present.
Our results show that while the performance between ViTs and CNNs is on par with a small benefit for ViTs, DeiTs outperform the former if a reasonably large data set is available for training.
arXiv Detail & Related papers (2022-08-17T09:07:45Z) - Medulloblastoma Tumor Classification using Deep Transfer Learning with
Multi-Scale EfficientNets [63.62764375279861]
We propose an end-to-end MB tumor classification and explore transfer learning with various input sizes and matching network dimensions.
Using a data set with 161 cases, we demonstrate that pre-trained EfficientNets with larger input resolutions lead to significant performance improvements.
arXiv Detail & Related papers (2021-09-10T13:07:11Z) - Benchmarking convolutional neural networks for diagnosing Lyme disease
from images [0.0]
Lyme disease is one of the most common infectious vector-borne diseases in the world.
Recent studies show that convolutional neural networks (CNNs) perform very well to identify skin lesions from the image.
There is no publicly available EM image dataset for Lyme disease prediction mainly because of privacy concerns.
arXiv Detail & Related papers (2021-06-28T08:28:21Z) - Analysis of skin lesion images with deep learning [0.0]
We evaluate the current state of the art in the classification of dermoscopic images.
Various deep neural network architectures pre-trained on the ImageNet data set are adapted to a combined training data set.
The performance and applicability of these models for the detection of eight classes of skin lesions are examined.
arXiv Detail & Related papers (2021-01-11T10:58:36Z) - Comparisons among different stochastic selection of activation layers
for convolutional neural networks for healthcare [77.99636165307996]
We classify biomedical images using ensembles of neural networks.
We select our activations among the following ones: ReLU, leaky ReLU, Parametric ReLU, ELU, Adaptive Piecewice Linear Unit, S-Shaped ReLU, Swish, Mish, Mexican Linear Unit, Parametric Deformable Linear Unit, Soft Root Sign.
arXiv Detail & Related papers (2020-11-24T01:53:39Z) - A Deep Learning Study on Osteosarcoma Detection from Histological Images [6.341765152919201]
The most common type of primary malignant bone tumor is osteosarcoma.
CNNs can significantly decrease surgeon's workload and make a better prognosis of patient conditions.
CNNs need to be trained on a large amount of data in order to achieve a more trustworthy performance.
arXiv Detail & Related papers (2020-11-02T18:16:17Z) - Retinopathy of Prematurity Stage Diagnosis Using Object Segmentation and
Convolutional Neural Networks [68.96150598294072]
Retinopathy of Prematurity (ROP) is an eye disorder primarily affecting premature infants with lower weights.
It causes proliferation of vessels in the retina and could result in vision loss and, eventually, retinal detachment, leading to blindness.
In recent years, there has been a significant effort to automate the diagnosis using deep learning.
This paper builds upon the success of previous models and develops a novel architecture, which combines object segmentation and convolutional neural networks (CNN)
Our proposed system first trains an object segmentation model to identify the demarcation line at a pixel level and adds the resulting mask as an additional "color" channel in
arXiv Detail & Related papers (2020-04-03T14:07:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.