PMFSNet: Polarized Multi-scale Feature Self-attention Network For
Lightweight Medical Image Segmentation
- URL: http://arxiv.org/abs/2401.07579v1
- Date: Mon, 15 Jan 2024 10:26:47 GMT
- Title: PMFSNet: Polarized Multi-scale Feature Self-attention Network For
Lightweight Medical Image Segmentation
- Authors: Jiahui Zhong, Wenhong Tian, Yuanlun Xie, Zhijia Liu, Jie Ou, Taoran
Tian and Lei Zhang
- Abstract summary: Current state-of-the-art medical image segmentation methods prioritize accuracy but often at the expense of increased computational demands and larger model sizes.
We propose PMFSNet, a novel medical imaging segmentation model that balances global local feature processing while avoiding computational redundancy.
It incorporates a plug-and-play PMFS block, a multi-scale feature enhancement module based on attention mechanisms, to capture long-term dependencies.
- Score: 6.134314911212846
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Current state-of-the-art medical image segmentation methods prioritize
accuracy but often at the expense of increased computational demands and larger
model sizes. Applying these large-scale models to the relatively limited scale
of medical image datasets tends to induce redundant computation, complicating
the process without the necessary benefits. This approach not only adds
complexity but also presents challenges for the integration and deployment of
lightweight models on edge devices. For instance, recent transformer-based
models have excelled in 2D and 3D medical image segmentation due to their
extensive receptive fields and high parameter count. However, their
effectiveness comes with a risk of overfitting when applied to small datasets
and often neglects the vital inductive biases of Convolutional Neural Networks
(CNNs), essential for local feature representation. In this work, we propose
PMFSNet, a novel medical imaging segmentation model that effectively balances
global and local feature processing while avoiding the computational redundancy
typical in larger models. PMFSNet streamlines the UNet-based hierarchical
structure and simplifies the self-attention mechanism's computational
complexity, making it suitable for lightweight applications. It incorporates a
plug-and-play PMFS block, a multi-scale feature enhancement module based on
attention mechanisms, to capture long-term dependencies. Extensive
comprehensive results demonstrate that even with a model (less than 1 million
parameters), our method achieves superior performance in various segmentation
tasks across different data scales. It achieves (IoU) metrics of 84.68%,
82.02%, and 78.82% on public datasets of teeth CT (CBCT), ovarian tumors
ultrasound(MMOTU), and skin lesions dermoscopy images (ISIC 2018),
respectively. The source code is available at
https://github.com/yykzjh/PMFSNet.
Related papers
- Prompting Segment Anything Model with Domain-Adaptive Prototype for Generalizable Medical Image Segmentation [49.5901368256326]
We propose a novel Domain-Adaptive Prompt framework for fine-tuning the Segment Anything Model (termed as DAPSAM) in segmenting medical images.
Our DAPSAM achieves state-of-the-art performance on two medical image segmentation tasks with different modalities.
arXiv Detail & Related papers (2024-09-19T07:28:33Z) - MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image Segmentation [3.64388407705261]
We propose a Multi-Scale Vision Mamba UNet model for medical image segmentation, termed MSVM-UNet.
Specifically, by introducing multi-scale convolutions in the VSS blocks, we can more effectively capture and aggregate multi-scale feature representations from the hierarchical features of the VMamba encoder.
arXiv Detail & Related papers (2024-08-25T06:20:28Z) - GCtx-UNet: Efficient Network for Medical Image Segmentation [0.2353157426758003]
GCtx-UNet is a lightweight segmentation architecture that can capture global and local image features with accuracy better than state-of-the-art approaches.
GCtx-UNet is evaluated on the Synapse multi-organ abdominal CT dataset, the ACDC cardiac MRI dataset, and several polyp segmentation datasets.
arXiv Detail & Related papers (2024-06-09T19:17:14Z) - Modality-agnostic Domain Generalizable Medical Image Segmentation by Multi-Frequency in Multi-Scale Attention [1.1155836879100416]
We propose a Modality-agnostic Domain Generalizable Network (MADGNet) for medical image segmentation.
MFMSA block refines the process of spatial feature extraction, particularly in capturing boundary features.
E-SDM mitigates information loss in multi-task learning with deep supervision.
arXiv Detail & Related papers (2024-05-10T07:34:36Z) - LiteNeXt: A Novel Lightweight ConvMixer-based Model with Self-embedding Representation Parallel for Medical Image Segmentation [2.0901574458380403]
We propose a new lightweight but efficient model, namely LiteNeXt, for medical image segmentation.
LiteNeXt is trained from scratch with small amount of parameters (0.71M) and Giga Floating Point Operations Per Second (0.42).
arXiv Detail & Related papers (2024-04-04T01:59:19Z) - Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain [48.440691680864745]
We introduce a new neural network architecture, termed LoGoNet, with a tailored self-supervised learning (SSL) method.
LoGoNet integrates a novel feature extractor within a U-shaped architecture, leveraging Large Kernel Attention (LKA) and a dual encoding strategy.
We propose a novel SSL method tailored for 3D images to compensate for the lack of large labeled datasets.
arXiv Detail & Related papers (2024-02-09T05:06:58Z) - Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation [63.15257949821558]
Referring Remote Sensing Image (RRSIS) is a new challenge that combines computer vision and natural language processing.
Traditional Referring Image (RIS) approaches have been impeded by the complex spatial scales and orientations found in aerial imagery.
We introduce the Rotated Multi-Scale Interaction Network (RMSIN), an innovative approach designed for the unique demands of RRSIS.
arXiv Detail & Related papers (2023-12-19T08:14:14Z) - Heterogenous Memory Augmented Neural Networks [84.29338268789684]
We introduce a novel heterogeneous memory augmentation approach for neural networks.
By introducing learnable memory tokens with attention mechanism, we can effectively boost performance without huge computational overhead.
We show our approach on various image and graph-based tasks under both in-distribution (ID) and out-of-distribution (OOD) conditions.
arXiv Detail & Related papers (2023-10-17T01:05:28Z) - Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement
Learning [53.00683059396803]
Mask image model (MIM) has been widely used due to its simplicity and effectiveness in recovering original information from masked images.
We propose a decision-based MIM that utilizes reinforcement learning (RL) to automatically search for optimal image masking ratio and masking strategy.
Our approach has a significant advantage over alternative self-supervised methods on the task of neuron segmentation.
arXiv Detail & Related papers (2023-10-06T10:40:46Z) - 3D Medical Image Segmentation based on multi-scale MPU-Net [5.393743755706745]
This paper proposes a tumor segmentation model MPU-Net for patient volume CT images.
It is inspired by Transformer with a global attention mechanism.
Compared with the benchmark model U-Net, MPU-Net shows excellent segmentation results.
arXiv Detail & Related papers (2023-07-11T20:46:19Z) - MISSU: 3D Medical Image Segmentation via Self-distilling TransUNet [55.16833099336073]
We propose to self-distill a Transformer-based UNet for medical image segmentation.
It simultaneously learns global semantic information and local spatial-detailed features.
Our MISSU achieves the best performance over previous state-of-the-art methods.
arXiv Detail & Related papers (2022-06-02T07:38:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.