Unsupervised Multi-View Object Segmentation Using Radiance Field
Propagation
- URL: http://arxiv.org/abs/2210.00489v1
- Date: Sun, 2 Oct 2022 11:14:23 GMT
- Title: Unsupervised Multi-View Object Segmentation Using Radiance Field
Propagation
- Authors: Xinhang Liu, Jiaben Chen, Huai Yu, Yu-Wing Tai, Chi-Keung Tang
- Abstract summary: We present a novel approach to segmenting objects in 3D during reconstruction given only unlabeled multi-view images of a scene.
The core of our method is a novel propagation strategy for individual objects' radiance fields with a bidirectional photometric loss.
To the best of our knowledge, RFP is the first unsupervised approach for tackling 3D scene object segmentation for neural radiance field (NeRF)
- Score: 55.9577535403381
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: We present radiance field propagation (RFP), a novel approach to segmenting
objects in 3D during reconstruction given only unlabeled multi-view images of a
scene. RFP is derived from emerging neural radiance field-based techniques,
which jointly encodes semantics with appearance and geometry. The core of our
method is a novel propagation strategy for individual objects' radiance fields
with a bidirectional photometric loss, enabling an unsupervised partitioning of
a scene into salient or meaningful regions corresponding to different object
instances. To better handle complex scenes with multiple objects and
occlusions, we further propose an iterative expectation-maximization algorithm
to refine object masks. To the best of our knowledge, RFP is the first
unsupervised approach for tackling 3D scene object segmentation for neural
radiance field (NeRF) without any supervision, annotations, or other cues such
as 3D bounding boxes and prior knowledge of object class. Experiments
demonstrate that RFP achieves feasible segmentation results that are more
accurate than previous unsupervised image/scene segmentation approaches, and
are comparable to existing supervised NeRF-based methods. The segmented object
representations enable individual 3D object editing operations.
Related papers
- Geometry Aware Field-to-field Transformations for 3D Semantic
Segmentation [48.307734886370014]
We present a novel approach to perform 3D semantic segmentation solely from 2D supervision by leveraging Neural Radiance Fields (NeRFs)
By extracting features along a surface point cloud, we achieve a compact representation of the scene which is sample-efficient and conducive to 3D reasoning.
arXiv Detail & Related papers (2023-10-08T11:48:19Z) - Implicit Ray-Transformers for Multi-view Remote Sensing Image
Segmentation [26.726658200149544]
We propose ''Implicit Ray-Transformer (IRT)'' based on Implicit Neural Representation (INR) for RS scene semantic segmentation with sparse labels.
The proposed method includes a two-stage learning process. In the first stage, we optimize a neural field to encode the color and 3D structure of the remote sensing scene.
In the second stage, we design a Ray Transformer to leverage the relations between the neural field 3D features and 2D texture features for learning better semantic representations.
arXiv Detail & Related papers (2023-03-15T07:05:07Z) - DETR4D: Direct Multi-View 3D Object Detection with Sparse Attention [50.11672196146829]
3D object detection with surround-view images is an essential task for autonomous driving.
We propose DETR4D, a Transformer-based framework that explores sparse attention and direct feature query for 3D object detection in multi-view images.
arXiv Detail & Related papers (2022-12-15T14:18:47Z) - ONeRF: Unsupervised 3D Object Segmentation from Multiple Views [59.445957699136564]
ONeRF is a method that automatically segments and reconstructs object instances in 3D from multi-view RGB images without any additional manual annotations.
The segmented 3D objects are represented using separate Neural Radiance Fields (NeRFs) which allow for various 3D scene editing and novel view rendering.
arXiv Detail & Related papers (2022-11-22T06:19:37Z) - SegNeRF: 3D Part Segmentation with Neural Radiance Fields [63.12841224024818]
SegNeRF is a neural field representation that integrates a semantic field along with the usual radiance field.
SegNeRF is capable of simultaneously predicting geometry, appearance, and semantic information from posed images, even for unseen objects.
SegNeRF is able to generate an explicit 3D model from a single image of an object taken in the wild, with its corresponding part segmentation.
arXiv Detail & Related papers (2022-11-21T07:16:03Z) - Neural Volumetric Object Selection [126.04480613166194]
We introduce an approach for selecting objects in neural volumetric 3D representations, such as multi-plane images (MPI) and neural radiance fields (NeRF)
Our approach takes a set of foreground and background 2D user scribbles in one view and automatically estimates a 3D segmentation of the desired object, which can be rendered into novel views.
arXiv Detail & Related papers (2022-05-30T08:55:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.