Pubic Symphysis-Fetal Head Segmentation Using Full Transformer with
Bi-level Routing Attention
- URL: http://arxiv.org/abs/2310.00289v1
- Date: Sat, 30 Sep 2023 07:45:50 GMT
- Title: Pubic Symphysis-Fetal Head Segmentation Using Full Transformer with
Bi-level Routing Attention
- Authors: Pengzhou Cai
- Abstract summary: We propose a method, named BRAU-Net, to solve the pubic symphysis-fetal head segmentation task.
The method adopts a U-Net-like pure Transformer architecture with bi-level routing attention and skip connections, which effectively learns local-global semantic information.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we propose a method, named BRAU-Net, to solve the pubic
symphysis-fetal head segmentation task. The method adopts a U-Net-like pure
Transformer architecture with bi-level routing attention and skip connections,
which effectively learns local-global semantic information. The proposed
BRAU-Net was evaluated on transperineal Ultrasound images dataset from the
pubic symphysis-fetal head segmentation and angle of progression (FH-PS-AOP)
challenge. The results demonstrate that the proposed BRAU-Net achieves
comparable a final score. The codes will be available at
https://github.com/Caipengzhou/BRAU-Net.
Related papers
- Pubic Symphysis-Fetal Head Segmentation Network Using BiFormer Attention Mechanism and Multipath Dilated Convolution [6.673262517388075]
Pubic symphysis-fetal head segmentation in transperineal ultrasound images plays a critical role for the assessment of fetal head descent and progression.
We introduce a dynamic, query-aware sparse attention mechanism for ultrasound image segmentation.
We propose a novel method, named BRAU-Net, to solve the pubic symphysis-fetal head segmentation task.
arXiv Detail & Related papers (2024-10-14T10:14:04Z) - MiTU-Net: A fine-tuned U-Net with SegFormer backbone for segmenting
pubic symphysis-fetal head [0.0]
We propose the Mix Transformer U-Net (MiTU-Net) network for automatic fetal head-pubic symphysis segmentation and AoP measurement.
MiTU-Net is based on an encoder-decoder framework, utilizing a pre-trained efficient transformer to enhance feature representation.
Our model achieves competitive performance, ranking 5th compared to existing approaches.
arXiv Detail & Related papers (2024-01-27T21:53:05Z) - BRAU-Net++: U-Shaped Hybrid CNN-Transformer Network for Medical Image Segmentation [11.986549780782724]
We propose a hybrid yet effective CNN-Transformer network, named BRAU-Net++, for an accurate medical image segmentation task.
Specifically, BRAU-Net++ uses bi-level routing attention as the core building block to design our u-shaped encoder-decoder structure.
Our proposed approach surpasses other state-of-the-art methods including its baseline: BRAU-Net.
arXiv Detail & Related papers (2024-01-01T10:49:09Z) - Interactive Segmentation as Gaussian Process Classification [58.44673380545409]
Click-based interactive segmentation (IS) aims to extract the target objects under user interaction.
Most of the current deep learning (DL)-based methods mainly follow the general pipelines of semantic segmentation.
We propose to formulate the IS task as a Gaussian process (GP)-based pixel-wise binary classification model on each image.
arXiv Detail & Related papers (2023-02-28T14:01:01Z) - GDB: Gated convolutions-based Document Binarization [0.0]
We formulate text extraction as the learning of gating values and propose an end-to-end gated convolutions-based network (GDB) to solve the problem of imprecise stroke edge extraction.
Our proposed framework consists of two stages. Firstly, a coarse sub-network with an extra edge branch is trained to get more precise feature maps by feeding a priori mask and edge.
Secondly, a refinement sub-network is cascaded to refine the output of the first stage by gated convolutions based on the sharp edge.
arXiv Detail & Related papers (2023-02-04T02:56:40Z) - A Trio-Method for Retinal Vessel Segmentation using Image Processing [0.0]
This paper primarily focuses on the segmentation of retinal vessels using a triple preprocessing approach.
Two proposed U-Net architectures were compared in terms of all the standard performance metrics.
This real-time deployment can help in the efficient pre-processing of images with better segmentation and detection.
arXiv Detail & Related papers (2022-09-19T22:07:34Z) - On the Effective Number of Linear Regions in Shallow Univariate ReLU
Networks: Convergence Guarantees and Implicit Bias [50.84569563188485]
We show that gradient flow converges in direction when labels are determined by the sign of a target network with $r$ neurons.
Our result may already hold for mild over- parameterization, where the width is $tildemathcalO(r)$ and independent of the sample size.
arXiv Detail & Related papers (2022-05-18T16:57:10Z) - Deep ensembles based on Stochastic Activation Selection for Polyp
Segmentation [82.61182037130406]
This work deals with medical image segmentation and in particular with accurate polyp detection and segmentation during colonoscopy examinations.
Basic architecture in image segmentation consists of an encoder and a decoder.
We compare some variant of the DeepLab architecture obtained by varying the decoder backbone.
arXiv Detail & Related papers (2021-04-02T02:07:37Z) - Batch Coherence-Driven Network for Part-aware Person Re-Identification [79.33809815035127]
Existing part-aware person re-identification methods typically employ two separate steps: namely, body part detection and part-level feature extraction.
We propose NetworkBCDNet that bypasses body part during both the training and testing phases while still semantically aligned features.
arXiv Detail & Related papers (2020-09-21T09:04:13Z) - Boundary-assisted Region Proposal Networks for Nucleus Segmentation [89.69059532088129]
Machine learning models cannot perform well because of large amount of crowded nuclei.
We devise a Boundary-assisted Region Proposal Network (BRP-Net) that achieves robust instance-level nucleus segmentation.
arXiv Detail & Related papers (2020-06-04T08:26:38Z) - Holistically-Attracted Wireframe Parsing [123.58263152571952]
This paper presents a fast and parsimonious parsing method to detect a vectorized wireframe in an input image with a single forward pass.
The proposed method is end-to-end trainable, consisting of three components: (i) line segment and junction proposal generation, (ii) line segment and junction matching, and (iii) line segment and junction verification.
arXiv Detail & Related papers (2020-03-03T17:43:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.