Interactive segmentation using U-Net with weight map and dynamic user
interactions
- URL: http://arxiv.org/abs/2111.09740v1
- Date: Thu, 18 Nov 2021 15:08:11 GMT
- Title: Interactive segmentation using U-Net with weight map and dynamic user
interactions
- Authors: Ragavie Pirabaharan and Naimul Khan
- Abstract summary: We propose a novel interactive segmentation framework, where user clicks are dynamically adapted in size based on the current segmentation mask.
The clicked regions form a weight map and are fed to a deep neural network as a novel weighted loss function.
Applying dynamic user click sizes increases the overall accuracy by 5.60% and 10.39% respectively by utilizing only a single user interaction.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Interactive segmentation has recently attracted attention for specialized
tasks where expert input is required to further enhance the segmentation
performance. In this work, we propose a novel interactive segmentation
framework, where user clicks are dynamically adapted in size based on the
current segmentation mask. The clicked regions form a weight map and are fed to
a deep neural network as a novel weighted loss function. To evaluate our loss
function, an interactive U-Net (IU-Net) model which applies both foreground and
background user clicks as the main method of interaction is employed. We train
and validate on the BCV dataset, while testing on spleen and colon cancer CT
images from the MSD dataset to improve the overall segmentation accuracy in
comparison to the standard U-Net using our weighted loss function. Applying
dynamic user click sizes increases the overall accuracy by 5.60% and 10.39%
respectively by utilizing only a single user interaction.
Related papers
- LAC-Net: Linear-Fusion Attention-Guided Convolutional Network for Accurate Robotic Grasping Under the Occlusion [79.22197702626542]
This paper introduces a framework that explores amodal segmentation for robotic grasping in cluttered scenes.
We propose a Linear-fusion Attention-guided Convolutional Network (LAC-Net)
The results on different datasets show that our method achieves state-of-the-art performance.
arXiv Detail & Related papers (2024-08-06T14:50:48Z) - TETRIS: Towards Exploring the Robustness of Interactive Segmentation [39.1981941213761]
We propose a methodology for finding extreme user inputs by a direct optimization in a white-box adversarial attack on the interactive segmentation model.
We report the results of an extensive evaluation of dozens of models.
arXiv Detail & Related papers (2024-02-09T01:36:21Z) - Contour-based Interactive Segmentation [4.164728134421114]
We consider a natural form of user interaction as a loose contour, and introduce a contour-based interactive segmentation method.
We demonstrate that a single contour provides the same accuracy as multiple clicks, thus reducing the required amount of user interactions.
arXiv Detail & Related papers (2023-02-13T13:35:26Z) - Learning Dynamic Compact Memory Embedding for Deformable Visual Object
Tracking [82.34356879078955]
We propose a compact memory embedding to enhance the discrimination of the segmentation-based deformable visual tracking method.
Our method outperforms the excellent segmentation-based trackers, i.e., D3S and SiamMask on DAVIS 2017 benchmark.
arXiv Detail & Related papers (2021-11-23T03:07:12Z) - Clicking Matters:Towards Interactive Human Parsing [60.35351491254932]
This work is the first attempt to tackle the human parsing task under the interactive setting.
Our IHP solution achieves 85% mIoU on the benchmark LIP, 80% mIoU on PASCAL-Person-Part and CIHP, 75% mIoU on Helen with only 1.95, 3.02, 2.84 and 1.09 clicks per class respectively.
arXiv Detail & Related papers (2021-11-11T11:47:53Z) - Interactive Object Segmentation with Dynamic Click Transform [27.709779682559883]
We propose a Dynamic Click Transform Network(DCT-Net), consisting of Spatial-DCT and Feature-DCT, to better represent user interactions.
We demonstrate the effectiveness of our proposed method and achieve favorable performance compared to the state-of-the-art on three standard benchmark datasets.
arXiv Detail & Related papers (2021-06-19T10:13:37Z) - Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation [49.90178055521207]
This work addresses weakly supervised semantic segmentation (WSSS), with the goal of bridging the gap between image-level annotations and pixel-level segmentation.
We formulate WSSS as a novel group-wise learning task that explicitly models semantic dependencies in a group of images to estimate more reliable pseudo ground-truths.
In particular, we devise a graph neural network (GNN) for group-wise semantic mining, wherein input images are represented as graph nodes.
arXiv Detail & Related papers (2020-12-09T12:40:13Z) - Joint Object Contour Points and Semantics for Instance Segmentation [1.2117737635879038]
We propose Mask Point R-CNN aiming at promoting the neural network's attention to the object boundary.
Specifically, we innovatively extend the original human keypoint detection task to the contour point detection of any object.
As a consequence, the model will be more sensitive to the edges of the object and can capture more geometric features.
arXiv Detail & Related papers (2020-08-02T11:11:28Z) - FAIRS -- Soft Focus Generator and Attention for Robust Object
Segmentation from Extreme Points [70.65563691392987]
We present a new approach to generate object segmentation from user inputs in the form of extreme points and corrective clicks.
We demonstrate our method's ability to generate high-quality training data as well as its scalability in incorporating extreme points, guiding clicks, and corrective clicks in a principled manner.
arXiv Detail & Related papers (2020-04-04T22:25:47Z) - Mining Implicit Entity Preference from User-Item Interaction Data for
Knowledge Graph Completion via Adversarial Learning [82.46332224556257]
We propose a novel adversarial learning approach by leveraging user interaction data for the Knowledge Graph Completion task.
Our generator is isolated from user interaction data, and serves to improve the performance of the discriminator.
To discover implicit entity preference of users, we design an elaborate collaborative learning algorithms based on graph neural networks.
arXiv Detail & Related papers (2020-03-28T05:47:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.