Pubic Symphysis-Fetal Head Segmentation Using Pure Transformer with Bi-level Routing Attention
- URL: http://arxiv.org/abs/2310.00289v3
- Date: Thu, 14 Nov 2024 02:08:40 GMT
- Title: Pubic Symphysis-Fetal Head Segmentation Using Pure Transformer with Bi-level Routing Attention
- Authors: Pengzhou Cai, Lu Jiang, Yanxin Li, Libin Lan,
- Abstract summary: We propose a method, named BRAU-Net, to solve the pubic symphysis-fetal head segmentation task.
The method adopts a U-Net-like pure Transformer architecture with bi-level routing attention and skip connections, which effectively learns local-global semantic information.
- Score: 6.709399356217316
- License:
- Abstract: In this paper, we propose a method, named BRAU-Net, to solve the pubic symphysis-fetal head segmentation task. The method adopts a U-Net-like pure Transformer architecture with bi-level routing attention and skip connections, which effectively learns local-global semantic information. The proposed BRAU-Net was evaluated on transperineal Ultrasound images dataset from the pubic symphysis-fetal head segmentation and angle of progression (FH-PS-AOP) challenge. The results demonstrate that the proposed BRAU-Net achieves comparable a final score. The codes will be available at https://github.com/Caipengzhou/BRAU-Net.
Related papers
- Pubic Symphysis-Fetal Head Segmentation Network Using BiFormer Attention Mechanism and Multipath Dilated Convolution [6.673262517388075]
Pubic symphysis-fetal head segmentation in transperineal ultrasound images plays a critical role for the assessment of fetal head descent and progression.
We introduce a dynamic, query-aware sparse attention mechanism for ultrasound image segmentation.
We propose a novel method, named BRAU-Net, to solve the pubic symphysis-fetal head segmentation task.
arXiv Detail & Related papers (2024-10-14T10:14:04Z) - MiTU-Net: A fine-tuned U-Net with SegFormer backbone for segmenting
pubic symphysis-fetal head [0.0]
We propose the Mix Transformer U-Net (MiTU-Net) network for automatic fetal head-pubic symphysis segmentation and AoP measurement.
MiTU-Net is based on an encoder-decoder framework, utilizing a pre-trained efficient transformer to enhance feature representation.
Our model achieves competitive performance, ranking 5th compared to existing approaches.
arXiv Detail & Related papers (2024-01-27T21:53:05Z) - GDB: Gated convolutions-based Document Binarization [0.0]
We formulate text extraction as the learning of gating values and propose an end-to-end gated convolutions-based network (GDB) to solve the problem of imprecise stroke edge extraction.
Our proposed framework consists of two stages. Firstly, a coarse sub-network with an extra edge branch is trained to get more precise feature maps by feeding a priori mask and edge.
Secondly, a refinement sub-network is cascaded to refine the output of the first stage by gated convolutions based on the sharp edge.
arXiv Detail & Related papers (2023-02-04T02:56:40Z) - A Trio-Method for Retinal Vessel Segmentation using Image Processing [0.0]
This paper primarily focuses on the segmentation of retinal vessels using a triple preprocessing approach.
Two proposed U-Net architectures were compared in terms of all the standard performance metrics.
This real-time deployment can help in the efficient pre-processing of images with better segmentation and detection.
arXiv Detail & Related papers (2022-09-19T22:07:34Z) - On the Effective Number of Linear Regions in Shallow Univariate ReLU
Networks: Convergence Guarantees and Implicit Bias [50.84569563188485]
We show that gradient flow converges in direction when labels are determined by the sign of a target network with $r$ neurons.
Our result may already hold for mild over- parameterization, where the width is $tildemathcalO(r)$ and independent of the sample size.
arXiv Detail & Related papers (2022-05-18T16:57:10Z) - GT U-Net: A U-Net Like Group Transformer Network for Tooth Root
Segmentation [11.177365038111438]
We propose a novel end-to-end U-Net like Group Transformer Network (GT U-Net) for the tooth root segmentation.
The proposed network retains the essential structure of U-Net but each of the encoders and decoders is replaced by a group Transformer.
Our proposed network achieves the state-of-the-art performance on our collected tooth root segmentation dataset and the public retina dataset.
arXiv Detail & Related papers (2021-09-30T02:39:07Z) - Deep ensembles based on Stochastic Activation Selection for Polyp
Segmentation [82.61182037130406]
This work deals with medical image segmentation and in particular with accurate polyp detection and segmentation during colonoscopy examinations.
Basic architecture in image segmentation consists of an encoder and a decoder.
We compare some variant of the DeepLab architecture obtained by varying the decoder backbone.
arXiv Detail & Related papers (2021-04-02T02:07:37Z) - Batch Coherence-Driven Network for Part-aware Person Re-Identification [79.33809815035127]
Existing part-aware person re-identification methods typically employ two separate steps: namely, body part detection and part-level feature extraction.
We propose NetworkBCDNet that bypasses body part during both the training and testing phases while still semantically aligned features.
arXiv Detail & Related papers (2020-09-21T09:04:13Z) - Boundary-assisted Region Proposal Networks for Nucleus Segmentation [89.69059532088129]
Machine learning models cannot perform well because of large amount of crowded nuclei.
We devise a Boundary-assisted Region Proposal Network (BRP-Net) that achieves robust instance-level nucleus segmentation.
arXiv Detail & Related papers (2020-06-04T08:26:38Z) - MetricUNet: Synergistic Image- and Voxel-Level Learning for Precise CT
Prostate Segmentation via Online Sampling [66.01558025094333]
We propose a two-stage framework, with the first stage to quickly localize the prostate region and the second stage to precisely segment the prostate.
We introduce a novel online metric learning module through voxel-wise sampling in the multi-task network.
Our method can effectively learn more representative voxel-level features compared with the conventional learning methods with cross-entropy or Dice loss.
arXiv Detail & Related papers (2020-05-15T10:37:02Z) - Holistically-Attracted Wireframe Parsing [123.58263152571952]
This paper presents a fast and parsimonious parsing method to detect a vectorized wireframe in an input image with a single forward pass.
The proposed method is end-to-end trainable, consisting of three components: (i) line segment and junction proposal generation, (ii) line segment and junction matching, and (iii) line segment and junction verification.
arXiv Detail & Related papers (2020-03-03T17:43:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.