360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan
Estimation
- URL: http://arxiv.org/abs/2112.06180v1
- Date: Sun, 12 Dec 2021 08:36:41 GMT
- Title: 360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan
Estimation
- Authors: Bolivar Solarte, Yueh-Cheng Liu, Chin-Hsuan Wu, Yi-Hsuan Tsai, Min Sun
- Abstract summary: We present 360-DFPE, a sequential floor plan estimation method that directly takes 360-images as input without relying on active sensors or 3D information.
Our results show that our monocular solution achieves favorable performance against the current state-of-the-art algorithms.
- Score: 43.56963653723287
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present 360-DFPE, a sequential floor plan estimation method that directly
takes 360-images as input without relying on active sensors or 3D information.
Our approach leverages a loosely coupled integration between a monocular visual
SLAM solution and a monocular 360-room layout approach, which estimate camera
poses and layout geometries, respectively. Since our task is to sequentially
capture the floor plan using monocular images, the entire scene structure, room
instances, and room shapes are unknown. To tackle these challenges, we first
handle the scale difference between visual odometry and layout geometry via
formulating an entropy minimization process, which enables us to directly align
360-layouts without knowing the entire scene in advance. Second, to
sequentially identify individual rooms, we propose a novel room identification
algorithm that tracks every room along the camera exploration using geometry
information. Lastly, to estimate the final shape of the room, we propose a
shortest path algorithm with an iterative coarse-to-fine strategy, which
improves prior formulations with higher accuracy and faster run-time. Moreover,
we collect a new floor plan dataset with challenging large-scale scenes,
providing both point clouds and sequential 360-image information. Experimental
results show that our monocular solution achieves favorable performance against
the current state-of-the-art algorithms that rely on active sensors and require
the entire scene reconstruction data in advance. Our code and dataset will be
released soon.
Related papers
- Self-supervised 3D Point Cloud Completion via Multi-view Adversarial Learning [61.14132533712537]
We propose MAL-SPC, a framework that effectively leverages both object-level and category-specific geometric similarities to complete missing structures.
Our MAL-SPC does not require any 3D complete supervision and only necessitates a single partial point cloud for each object.
arXiv Detail & Related papers (2024-07-13T06:53:39Z) - Sp2360: Sparse-view 360 Scene Reconstruction using Cascaded 2D Diffusion Priors [51.36238367193988]
We tackle sparse-view reconstruction of a 360 3D scene using priors from latent diffusion models (LDM)
We present SparseSplat360, a method that employs a cascade of in-painting and artifact removal models to fill in missing details and clean novel views.
Our method generates entire 360 scenes from as few as 9 input views, with a high degree of foreground and background detail.
arXiv Detail & Related papers (2024-05-26T11:01:39Z) - 360 Layout Estimation via Orthogonal Planes Disentanglement and Multi-view Geometric Consistency Perception [56.84921040837699]
Existing panoramic layout estimation solutions tend to recover room boundaries from a vertically compressed sequence, yielding imprecise results.
We propose an orthogonal plane disentanglement network (termed DOPNet) to distinguish ambiguous semantics.
We also present an unsupervised adaptation technique tailored for horizon-depth and ratio representations.
Our solution outperforms other SoTA models on both monocular layout estimation and multi-view layout estimation tasks.
arXiv Detail & Related papers (2023-12-26T12:16:03Z) - Monocular Spherical Depth Estimation with Explicitly Connected Weak
Layout Cues [27.15511982413305]
We generate a geometric vision (360V) dataset that includes multiple modalities, multi-view stereo data and automatically generated weak layout cues.
We rely on depth-based layout reconstruction and layout-based depth attention, demonstrating increased performance across both tasks.
By using single 360 cameras to scan rooms, the opportunity for facile and quick building-scale 3D scanning arises.
arXiv Detail & Related papers (2022-06-22T20:10:45Z) - Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo [71.59494156155309]
Existing approaches for multi-view 3D pose estimation explicitly establish cross-view correspondences to group 2D pose detections from multiple camera views.
We present our multi-view 3D pose estimation approach based on plane sweep stereo to jointly address the cross-view fusion and 3D pose reconstruction in a single shot.
arXiv Detail & Related papers (2021-04-06T03:49:35Z) - LED2-Net: Monocular 360 Layout Estimation via Differentiable Depth
Rendering [59.63979143021241]
We formulate the task of 360 layout estimation as a problem of predicting depth on the horizon line of a panorama.
We propose the Differentiable Depth Rendering procedure to make the conversion from layout to depth prediction differentiable.
Our method achieves state-of-the-art performance on numerous 360 layout benchmark datasets.
arXiv Detail & Related papers (2021-04-01T15:48:41Z) - Scan2Plan: Efficient Floorplan Generation from 3D Scans of Indoor Scenes [9.71137838903781]
Scan2Plan is a novel approach for accurate estimation of a floorplan from a 3D scan of the structural elements of indoor environments.
The proposed method incorporates a two-stage approach where the initial stage clusters an unordered point cloud representation of the scene.
The subsequent stage estimates a closed perimeter, parameterized by a simple polygon, for each individual room.
The final floorplan is simply an assembly of all such room perimeters in the global co-ordinate system.
arXiv Detail & Related papers (2020-03-16T17:59:41Z) - A Fixation-based 360{\deg} Benchmark Dataset for Salient Object
Detection [21.314578493964333]
Fixation prediction (FP) in panoramic contents has been widely investigated along with the booming trend of virtual reality (VR) applications.
salient object detection (SOD) has been seldom explored in 360deg images due to the lack of datasets representative of real scenes.
arXiv Detail & Related papers (2020-01-22T11:16:39Z) - General 3D Room Layout from a Single View by Render-and-Compare [36.94817376590415]
We present a novel method to reconstruct the 3D layout of a room from a single perspective view.
Our dataset consists of 293 images from ScanNet, which we annotated with precise 3D layouts.
arXiv Detail & Related papers (2020-01-07T16:14:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.