Patch Network for medical image Segmentation
- URL: http://arxiv.org/abs/2302.11802v1
- Date: Thu, 23 Feb 2023 06:29:31 GMT
- Title: Patch Network for medical image Segmentation
- Authors: Weihu Song and Heng Yu and Jianhua Wu
- Abstract summary: We present a Patch Network (PNet) that incorporates the Swin Transformer notion into a convolutional neural network.
Our PNet achieves SOTA performance in both speed and accuracy.
- Score: 10.893993462772409
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Accurate and fast segmentation of medical images is clinically essential, yet
current research methods include convolutional neural networks with fast
inference speed but difficulty in learning image contextual features, and
transformer with good performance but high hardware requirements. In this
paper, we present a Patch Network (PNet) that incorporates the Swin Transformer
notion into a convolutional neural network, allowing it to gather richer
contextual information while achieving the balance of speed and accuracy. We
test our PNet on Polyp(CVC-ClinicDB and ETIS- LaribPolypDB), Skin(ISIC-2018
Skin lesion segmentation challenge dataset) segmentation datasets. Our PNet
achieves SOTA performance in both speed and accuracy.
Related papers
- Rethinking Boundary Detection in Deep Learning-Based Medical Image Segmentation [29.37619692272332]
We propose a novel network architecture named CTO, which combines Convolutional Neural Networks (CNNs), Vision Transformer (ViT) models, and explicit edge detection operators.<n>CTO surpasses existing methods in terms of segmentation accuracy and strikes a better balance between accuracy and efficiency.<n>We validate the performance of CTO through extensive experiments conducted on seven challenging medical image segmentation datasets.
arXiv Detail & Related papers (2025-05-06T19:42:56Z) - Toward Relative Positional Encoding in Spiking Transformers [52.62008099390541]
Spiking neural networks (SNNs) are bio-inspired networks that model how neurons in the brain communicate through discrete spikes.
In this paper, we introduce an approximate method for relative positional encoding (RPE) in Spiking Transformers.
arXiv Detail & Related papers (2025-01-28T06:42:37Z) - TCCT-Net: Two-Stream Network Architecture for Fast and Efficient Engagement Estimation via Behavioral Feature Signals [58.865901821451295]
We present a novel two-stream feature fusion "Tensor-Convolution and Convolution-Transformer Network" (TCCT-Net) architecture.
To better learn the meaningful patterns in the temporal-spatial domain, we design a "CT" stream that integrates a hybrid convolutional-transformer.
In parallel, to efficiently extract rich patterns from the temporal-frequency domain, we introduce a "TC" stream that uses Continuous Wavelet Transform (CWT) to represent information in a 2D tensor form.
arXiv Detail & Related papers (2024-04-15T06:01:48Z) - SLP-Net:An efficient lightweight network for segmentation of skin
lesions [9.812172372998358]
SLP-Net is an ultra-lightweight skin lesion segmentation network based on the spiking neural P(SNP) systems type mechanism.
We design a lightweight multi-scale feature extractor without the usual encoder-decoder structure.
Experiments at the ISIC2018 challenge demonstrate that the proposed model has the highest Acc and DSC among the state-of-the-art methods.
arXiv Detail & Related papers (2023-12-20T06:22:21Z) - PHG-Net: Persistent Homology Guided Medical Image Classification [14.450329809640422]
We propose a persistent homology guided approach (PHG-Net) that explores topological features of objects for medical image classification.
For an input image, we first compute its cubical persistence diagram and extract topological features into a vector representation.
The extracted topological features are then incorporated into the feature map generated by CNN or Transformer for feature fusion.
arXiv Detail & Related papers (2023-11-28T21:34:06Z) - CMUNeXt: An Efficient Medical Image Segmentation Network based on Large
Kernel and Skip Fusion [11.434576556863934]
CMUNeXt is an efficient fully convolutional lightweight medical image segmentation network.
It enables fast and accurate auxiliary diagnosis in real scene scenarios.
arXiv Detail & Related papers (2023-08-02T15:54:00Z) - Deep Multi-Threshold Spiking-UNet for Image Processing [51.88730892920031]
This paper introduces the novel concept of Spiking-UNet for image processing, which combines the power of Spiking Neural Networks (SNNs) with the U-Net architecture.
To achieve an efficient Spiking-UNet, we face two primary challenges: ensuring high-fidelity information propagation through the network via spikes and formulating an effective training strategy.
Experimental results show that, on image segmentation and denoising, our Spiking-UNet achieves comparable performance to its non-spiking counterpart.
arXiv Detail & Related papers (2023-07-20T16:00:19Z) - Data-Efficient Vision Transformers for Multi-Label Disease
Classification on Chest Radiographs [55.78588835407174]
Vision Transformers (ViTs) have not been applied to this task despite their high classification performance on generic images.
ViTs do not rely on convolutions but on patch-based self-attention and in contrast to CNNs, no prior knowledge of local connectivity is present.
Our results show that while the performance between ViTs and CNNs is on par with a small benefit for ViTs, DeiTs outperform the former if a reasonably large data set is available for training.
arXiv Detail & Related papers (2022-08-17T09:07:45Z) - TFCNs: A CNN-Transformer Hybrid Network for Medical Image Segmentation [4.768284586442054]
We propose TFCNs (Transformers for Fully Convolutional denseNets) to tackle the problem of medical image segmentation.
We show that TFCNs can achieve state-of-the-art performance with dice scores of 83.72% on the Synapse dataset.
arXiv Detail & Related papers (2022-07-07T17:28:18Z) - CCAT-NET: A Novel Transformer Based Semi-supervised Framework for
Covid-19 Lung Lesion Segmentation [8.90602077660994]
We propose a novel network structure that combines CNN and Transformer for the segmentation of COVID-19 lesions.
We also propose an efficient semi-supervised learning framework to address the shortage of labeled data.
arXiv Detail & Related papers (2022-04-06T14:05:48Z) - Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation [63.46694853953092]
Swin-Unet is an Unet-like pure Transformer for medical image segmentation.
tokenized image patches are fed into the Transformer-based U-shaped decoder-Decoder architecture.
arXiv Detail & Related papers (2021-05-12T09:30:26Z) - Weakly-supervised Learning For Catheter Segmentation in 3D Frustum
Ultrasound [74.22397862400177]
We propose a novel Frustum ultrasound based catheter segmentation method.
The proposed method achieved the state-of-the-art performance with an efficiency of 0.25 second per volume.
arXiv Detail & Related papers (2020-10-19T13:56:22Z) - Collaborative Boundary-aware Context Encoding Networks for Error Map
Prediction [65.44752447868626]
We propose collaborative boundaryaware context encoding networks called AEP-Net for error prediction task.
Specifically, we propose a collaborative feature transformation branch for better feature fusion between images and masks, and precise localization of error regions.
The AEP-Net achieves an average DSC of 0.8358, 0.8164 for error prediction task, and shows a high Pearson correlation coefficient of 0.9873.
arXiv Detail & Related papers (2020-06-25T12:42:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.