Robust 3D Scene Segmentation through Hierarchical and Learnable
Part-Fusion
- URL: http://arxiv.org/abs/2111.08434v1
- Date: Tue, 16 Nov 2021 13:14:47 GMT
- Title: Robust 3D Scene Segmentation through Hierarchical and Learnable
Part-Fusion
- Authors: Anirud Thyagharajan, Benjamin Ummenhofer, Prashant Laddha, Om J Omer,
Sreenivas Subramoney
- Abstract summary: 3D semantic segmentation is a fundamental building block for several scene understanding applications such as autonomous driving, robotics and AR/VR.
Previous methods have utilized hierarchical, iterative methods to fuse semantic and instance information, but they lack learnability in context fusion.
This paper presents Segment-Fusion, a novel attention-based method for hierarchical fusion of semantic and instance information.
- Score: 9.275156524109438
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: 3D semantic segmentation is a fundamental building block for several scene
understanding applications such as autonomous driving, robotics and AR/VR.
Several state-of-the-art semantic segmentation models suffer from the part
misclassification problem, wherein parts of the same object are labelled
incorrectly. Previous methods have utilized hierarchical, iterative methods to
fuse semantic and instance information, but they lack learnability in context
fusion, and are computationally complex and heuristic driven. This paper
presents Segment-Fusion, a novel attention-based method for hierarchical fusion
of semantic and instance information to address the part misclassifications.
The presented method includes a graph segmentation algorithm for grouping
points into segments that pools point-wise features into segment-wise features,
a learnable attention-based network to fuse these segments based on their
semantic and instance features, and followed by a simple yet effective
connected component labelling algorithm to convert segment features to instance
labels. Segment-Fusion can be flexibly employed with any network architecture
for semantic/instance segmentation. It improves the qualitative and
quantitative performance of several semantic segmentation backbones by upto 5%
when evaluated on the ScanNet and S3DIS datasets.
Related papers
- Instance-aware 3D Semantic Segmentation powered by Shape Generators and
Classifiers [28.817905887080293]
We propose a novel instance-aware approach for 3D semantic segmentation.
Our method combines several geometry processing tasks supervised at instance-level to promote the consistency of the learned feature representation.
arXiv Detail & Related papers (2023-11-21T02:14:16Z) - Semantic Segmentation-Assisted Instance Feature Fusion for Multi-Level
3D Part Instance Segmentation [17.929866369256555]
We present a new method for 3D part instance segmentation.
Our method exploits semantic segmentation to fuse nonlocal instance features, such as center prediction.
Our method outperforms existing methods with a large-margin improvement in the PartNet benchmark.
arXiv Detail & Related papers (2022-08-09T13:22:55Z) - Open-world Semantic Segmentation via Contrasting and Clustering
Vision-Language Embedding [95.78002228538841]
We propose a new open-world semantic segmentation pipeline that makes the first attempt to learn to segment semantic objects of various open-world categories without any efforts on dense annotations.
Our method can directly segment objects of arbitrary categories, outperforming zero-shot segmentation methods that require data labeling on three benchmark datasets.
arXiv Detail & Related papers (2022-07-18T09:20:04Z) - SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation [94.11915008006483]
We propose SemAffiNet for point cloud semantic segmentation.
We conduct extensive experiments on the ScanNetV2 and NYUv2 datasets.
arXiv Detail & Related papers (2022-05-26T17:00:23Z) - Scaling up Multi-domain Semantic Segmentation with Sentence Embeddings [81.09026586111811]
We propose an approach to semantic segmentation that achieves state-of-the-art supervised performance when applied in a zero-shot setting.
This is achieved by replacing each class label with a vector-valued embedding of a short paragraph that describes the class.
The resulting merged semantic segmentation dataset of over 2 Million images enables training a model that achieves performance equal to that of state-of-the-art supervised methods on 7 benchmark datasets.
arXiv Detail & Related papers (2022-02-04T07:19:09Z) - TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic
Segmentation [44.75300205362518]
Unsupervised semantic segmentation aims to obtain high-level semantic representation on low-level visual features without manual annotations.
We propose the first top-down unsupervised semantic segmentation framework for fine-grained segmentation in extremely complicated scenarios.
Our results show that our top-down unsupervised segmentation is robust to both object-centric and scene-centric datasets.
arXiv Detail & Related papers (2021-12-02T18:59:03Z) - 3D Compositional Zero-shot Learning with DeCompositional Consensus [102.7571947144639]
We argue that part knowledge should be composable beyond the observed object classes.
We present 3D Compositional Zero-shot Learning as a problem of part generalization from seen to unseen object classes.
arXiv Detail & Related papers (2021-11-29T16:34:53Z) - PointFlow: Flowing Semantics Through Points for Aerial Image
Segmentation [96.76882806139251]
We propose a point-wise affinity propagation module based on the Feature Pyramid Network (FPN) framework, named PointFlow.
Rather than dense affinity learning, a sparse affinity map is generated upon selected points between the adjacent features.
Experimental results on three different aerial segmentation datasets suggest that the proposed method is more effective and efficient than state-of-the-art general semantic segmentation methods.
arXiv Detail & Related papers (2021-03-11T09:42:32Z) - Learning Panoptic Segmentation from Instance Contours [9.347742071428918]
Panopticpixel aims to provide an understanding of background (stuff) and instances of objects (things) at a pixel level.
It combines the separate tasks of semantic segmentation (level classification) and instance segmentation to build a single unified scene understanding task.
We present a fully convolution neural network that learns instance segmentation from semantic segmentation and instance contours.
arXiv Detail & Related papers (2020-10-16T03:05:48Z) - Self-Prediction for Joint Instance and Semantic Segmentation of Point
Clouds [41.75579185647845]
We develop a novel learning scheme named Self-Prediction for 3D instance and semantic segmentation of point clouds.
Our method achieves state-of-the-art instance segmentation results on S3DIS and comparable semantic segmentation results on S3DIS and ShapeNet.
arXiv Detail & Related papers (2020-07-27T07:58:00Z) - Few-shot 3D Point Cloud Semantic Segmentation [138.80825169240302]
We propose a novel attention-aware multi-prototype transductive few-shot point cloud semantic segmentation method.
Our proposed method shows significant and consistent improvements compared to baselines in different few-shot point cloud semantic segmentation settings.
arXiv Detail & Related papers (2020-06-22T08:05:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.