Learning RGB-D Feature Embeddings for Unseen Object Instance
Segmentation
- URL: http://arxiv.org/abs/2007.15157v3
- Date: Wed, 3 Mar 2021 10:45:42 GMT
- Title: Learning RGB-D Feature Embeddings for Unseen Object Instance
Segmentation
- Authors: Yu Xiang, Christopher Xie, Arsalan Mousavian, Dieter Fox
- Abstract summary: We propose a new method for unseen object instance segmentation by learning RGB-D feature embeddings from synthetic data.
A metric learning loss function is utilized to learn to produce pixel-wise feature embeddings.
We further improve the segmentation accuracy with a new two-stage clustering algorithm.
- Score: 67.88276573341734
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Segmenting unseen objects in cluttered scenes is an important skill that
robots need to acquire in order to perform tasks in new environments. In this
work, we propose a new method for unseen object instance segmentation by
learning RGB-D feature embeddings from synthetic data. A metric learning loss
function is utilized to learn to produce pixel-wise feature embeddings such
that pixels from the same object are close to each other and pixels from
different objects are separated in the embedding space. With the learned
feature embeddings, a mean shift clustering algorithm can be applied to
discover and segment unseen objects. We further improve the segmentation
accuracy with a new two-stage clustering algorithm. Our method demonstrates
that non-photorealistic synthetic RGB and depth images can be used to learn
feature embeddings that transfer well to real-world images for unseen object
instance segmentation.
Related papers
- Learning-based Relational Object Matching Across Views [63.63338392484501]
We propose a learning-based approach which combines local keypoints with novel object-level features for matching object detections between RGB images.
We train our object-level matching features based on appearance and inter-frame and cross-frame spatial relations between objects in an associative graph neural network.
arXiv Detail & Related papers (2023-05-03T19:36:51Z) - Self-Supervised Learning of Object Segmentation from Unlabeled RGB-D
Videos [11.40098981859033]
This work proposes a self-supervised learning system for segmenting rigid objects in RGB images.
The proposed pipeline is trained on unlabeled RGB-D videos of static objects, which can be captured with a camera carried by a mobile robot.
arXiv Detail & Related papers (2023-04-09T23:13:39Z) - SupeRGB-D: Zero-shot Instance Segmentation in Cluttered Indoor
Environments [67.34330257205525]
In this work, we explore zero-shot instance segmentation (ZSIS) from RGB-D data to identify unseen objects in a semantic category-agnostic manner.
We present a method that uses annotated objects to learn the objectness'' of pixels and generalize to unseen object categories in cluttered indoor environments.
arXiv Detail & Related papers (2022-12-22T17:59:48Z) - CASAPose: Class-Adaptive and Semantic-Aware Multi-Object Pose Estimation [2.861848675707602]
We present a new single-stage architecture called CASAPose.
It determines 2D-3D correspondences for pose estimation of multiple different objects in RGB images in one pass.
It is fast and memory efficient, and achieves high accuracy for multiple objects.
arXiv Detail & Related papers (2022-10-11T10:20:01Z) - Self-Supervised Visual Representation Learning with Semantic Grouping [50.14703605659837]
We tackle the problem of learning visual representations from unlabeled scene-centric data.
We propose contrastive learning from data-driven semantic slots, namely SlotCon, for joint semantic grouping and representation learning.
arXiv Detail & Related papers (2022-05-30T17:50:59Z) - Exploring Cross-Image Pixel Contrast for Semantic Segmentation [130.22216825377618]
We propose a pixel-wise contrastive framework for semantic segmentation in the fully supervised setting.
The core idea is to enforce pixel embeddings belonging to a same semantic class to be more similar than embeddings from different classes.
Our method can be effortlessly incorporated into existing segmentation frameworks without extra overhead during testing.
arXiv Detail & Related papers (2021-01-28T11:35:32Z) - Unseen Object Instance Segmentation for Robotic Environments [67.88276573341734]
We propose a method to segment unseen object instances in tabletop environments.
UOIS-Net is comprised of two stages: first, it operates only on depth to produce object instance center votes in 2D or 3D.
Surprisingly, our framework is able to learn from synthetic RGB-D data where the RGB is non-photorealistic.
arXiv Detail & Related papers (2020-07-16T01:59:13Z) - Reinforced Coloring for End-to-End Instance Segmentation [10.73460247817528]
We propose a novel iterative deep reinforcement learning agent that learns how to differentiate multiple objects in parallel.
Our reward function for the trainable agent is designed to favor grouping pixels belonging to the same object using a graph coloring algorithm.
arXiv Detail & Related papers (2020-05-14T15:15:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.