BARNet: Bilinear Attention Network with Adaptive Receptive Fields for
Surgical Instrument Segmentation
- URL: http://arxiv.org/abs/2001.07093v4
- Date: Fri, 22 May 2020 03:12:14 GMT
- Title: BARNet: Bilinear Attention Network with Adaptive Receptive Fields for
Surgical Instrument Segmentation
- Authors: Zhen-Liang Ni, Gui-Bin Bian, Guan-An Wang, Xiao-Hu Zhou, Zeng-Guang
Hou, Xiao-Liang Xie, Zhen Li and Yu-Han Wang
- Abstract summary: We propose a novel bilinear attention network with adaptive receptive field to solve these two challenges.
The proposed network achieves the best performance 97.47% mean IOU on Cata7 and comes first place on EndoVis 2017 by 10.10% IOU overtaking second-ranking method.
- Score: 26.44585036105453
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Surgical instrument segmentation is extremely important for computer-assisted
surgery. Different from common object segmentation, it is more challenging due
to the large illumination and scale variation caused by the special surgical
scenes. In this paper, we propose a novel bilinear attention network with
adaptive receptive field to solve these two challenges. For the illumination
variation, the bilinear attention module can capture second-order statistics to
encode global contexts and semantic dependencies between local pixels. With
them, semantic features in challenging areas can be inferred from their
neighbors and the distinction of various semantics can be boosted. For the
scale variation, our adaptive receptive field module aggregates multi-scale
features and automatically fuses them with different weights. Specifically, it
encodes the semantic relationship between channels to emphasize feature maps
with appropriate scales, changing the receptive field of subsequent
convolutions. The proposed network achieves the best performance 97.47% mean
IOU on Cata7 and comes first place on EndoVis 2017 by 10.10% IOU overtaking
second-ranking method.
Related papers
- MSA$^2$Net: Multi-scale Adaptive Attention-guided Network for Medical Image Segmentation [8.404273502720136]
We introduce MSA$2$Net, a new deep segmentation framework featuring an expedient design of skip-connections.
We propose a Multi-Scale Adaptive Spatial Attention Gate (MASAG) to ensure that spatially relevant features are selectively highlighted.
Our MSA$2$Net outperforms state-of-the-art (SOTA) works or matches their performance.
arXiv Detail & Related papers (2024-07-31T14:41:10Z) - Adaptive Feature Fusion Neural Network for Glaucoma Segmentation on Unseen Fundus Images [13.03504366061946]
We propose a method named Adaptive Feature-fusion Neural Network (AFNN) for glaucoma segmentation on unseen domains.
The domain adaptor helps the pretrained-model fast adapt from other image domains to the medical fundus image domain.
Our proposed method achieves a competitive performance over existing fundus segmentation methods on four public glaucoma datasets.
arXiv Detail & Related papers (2024-04-02T16:30:12Z) - Self-Supervised Correction Learning for Semi-Supervised Biomedical Image
Segmentation [84.58210297703714]
We propose a self-supervised correction learning paradigm for semi-supervised biomedical image segmentation.
We design a dual-task network, including a shared encoder and two independent decoders for segmentation and lesion region inpainting.
Experiments on three medical image segmentation datasets for different tasks demonstrate the outstanding performance of our method.
arXiv Detail & Related papers (2023-01-12T08:19:46Z) - UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer
via Hierarchical Mask Calibration [49.16591283724376]
We design UniDAformer, a unified domain adaptive panoptic segmentation transformer that is simple but can achieve domain adaptive instance segmentation and semantic segmentation simultaneously within a single network.
UniDAformer introduces Hierarchical Mask (HMC) that rectifies inaccurate predictions at the level of regions, superpixels and annotated pixels via online self-training on the fly.
It has three unique features: 1) it enables unified domain adaptive panoptic adaptation; 2) it mitigates false predictions and improves domain adaptive panoptic segmentation effectively; 3) it is end-to-end trainable with a much simpler training and inference pipeline.
arXiv Detail & Related papers (2022-06-30T07:32:23Z) - AF$_2$: Adaptive Focus Framework for Aerial Imagery Segmentation [86.44683367028914]
Aerial imagery segmentation has some unique challenges, the most critical one among which lies in foreground-background imbalance.
We propose Adaptive Focus Framework (AF$), which adopts a hierarchical segmentation procedure and focuses on adaptively utilizing multi-scale representations.
AF$ has significantly improved the accuracy on three widely used aerial benchmarks, as fast as the mainstream method.
arXiv Detail & Related papers (2022-02-18T10:14:45Z) - DONet: Dual Objective Networks for Skin Lesion Segmentation [77.9806410198298]
We propose a simple yet effective framework, named Dual Objective Networks (DONet), to improve the skin lesion segmentation.
Our DONet adopts two symmetric decoders to produce different predictions for approaching different objectives.
To address the challenge of large variety of lesion scales and shapes in dermoscopic images, we additionally propose a recurrent context encoding module (RCEM)
arXiv Detail & Related papers (2020-08-19T06:02:46Z) - Phase Consistent Ecological Domain Adaptation [76.75730500201536]
We focus on the task of semantic segmentation, where annotated synthetic data are aplenty, but annotating real data is laborious.
The first criterion, inspired by visual psychophysics, is that the map between the two image domains be phase-preserving.
The second criterion aims to leverage ecological statistics, or regularities in the scene which are manifest in any image of it, regardless of the characteristics of the illuminant or the imaging sensor.
arXiv Detail & Related papers (2020-04-10T06:58:03Z) - Gated Path Selection Network for Semantic Segmentation [72.44994579325822]
We develop a novel network named Gated Path Selection Network (GPSNet), which aims to learn adaptive receptive fields.
In GPSNet, we first design a two-dimensional multi-scale network - SuperNet, which densely incorporates features from growing receptive fields.
To dynamically select desirable semantic context, a gate prediction module is further introduced.
arXiv Detail & Related papers (2020-01-19T12:32:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.