Unit Region Encoding: A Unified and Compact Geometry-aware Representation for Floorplan Applications
- URL: http://arxiv.org/abs/2501.11097v1
- Date: Sun, 19 Jan 2025 16:17:20 GMT
- Title: Unit Region Encoding: A Unified and Compact Geometry-aware Representation for Floorplan Applications
- Authors: Huichao Zhang, Pengyu Wang, Manyi Li, Zuojun Li, Yaguang Wu,
- Abstract summary: Unit Region.
of floorplans is a unified and compact geometry-aware encoding representation for various applications.
Our representation can be flexibly adapted to different applications with the sliced unit regions.
- Score: 6.019073491990928
- License:
- Abstract: We present the Unit Region Encoding of floorplans, which is a unified and compact geometry-aware encoding representation for various applications, ranging from interior space planning, floorplan metric learning to floorplan generation tasks. The floorplans are represented as the latent encodings on a set of boundary-adaptive unit region partition based on the clustering of the proposed geometry-aware density map. The latent encodings are extracted by a trained network (URE-Net) from the input dense density map and other available semantic maps. Compared to the over-segmented rasterized images and the room-level graph structures, our representation can be flexibly adapted to different applications with the sliced unit regions while achieving higher accuracy performance and better visual quality. We conduct a variety of experiments and compare to the state-of-the-art methods on the aforementioned applications to validate the superiority of our representation, as well as extensive ablation studies to demonstrate the effect of our slicing choices.
Related papers
- LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes [3.482371041476053]
This paper presents an end-to-end deep graph message-passing network, LMSeg, designed to efficiently and accurately perform semantic segmentation on large-scale 3D landscape meshes.
The hierarchical and local pooling of the barycentric graph, along with the effective geometry aggregation modules, enable fast inference and accurate segmentation of small-sized and irregular mesh objects.
arXiv Detail & Related papers (2024-07-05T07:55:06Z) - Mapping High-level Semantic Regions in Indoor Environments without
Object Recognition [50.624970503498226]
The present work proposes a method for semantic region mapping via embodied navigation in indoor environments.
To enable region identification, the method uses a vision-to-language model to provide scene information for mapping.
By projecting egocentric scene understanding into the global frame, the proposed method generates a semantic map as a distribution over possible region labels at each location.
arXiv Detail & Related papers (2024-03-11T18:09:50Z) - RegionGPT: Towards Region Understanding Vision Language Model [88.42271128373191]
RegionGPT (short as RGPT) is a novel framework designed for complex region-level captioning and understanding.
We develop an automated region caption data generation pipeline, enriching the training set with detailed region-level captions.
We demonstrate that a universal RGPT model can be effectively applied and significantly enhancing performance across a range of region-level tasks.
arXiv Detail & Related papers (2024-03-04T18:58:08Z) - Jointly Learning Representations for Map Entities via Heterogeneous
Graph Contrastive Learning [38.415692986360995]
We propose a novel method named HOME-GCL for learning representations of multiple categories of map entities.
Our approach utilizes a heterogeneous map entity graph (HOME graph) that integrates both road segments and land parcels into a unified framework.
To the best of our knowledge, HOME-GCL is the first attempt to jointly learn representations for road segments and land parcels using a unified model.
arXiv Detail & Related papers (2024-02-09T01:47:18Z) - LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts [107.11267074981905]
We propose a semantically controllable layout-AWare diffusion model, termed LAW-Diffusion.
We show that LAW-Diffusion yields the state-of-the-art generative performance, especially with coherent object relations.
arXiv Detail & Related papers (2023-08-13T08:06:18Z) - R-MAE: Regions Meet Masked Autoencoders [113.73147144125385]
We explore regions as a potential visual analogue of words for self-supervised image representation learning.
Inspired by Masked Autoencoding (MAE), a generative pre-training baseline, we propose masked region autoencoding to learn from groups of pixels or regions.
arXiv Detail & Related papers (2023-06-08T17:56:46Z) - Cross-view Geo-localization via Learning Disentangled Geometric Layout
Correspondence [11.823147814005411]
Cross-view geo-localization aims to estimate the location of a query ground image by matching it to a reference geo-tagged aerial images database.
Recent works achieve outstanding progress on cross-view geo-localization benchmarks.
However, existing methods still suffer from poor performance on the cross-area benchmarks.
arXiv Detail & Related papers (2022-12-08T04:54:01Z) - LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context
Propagation in Transformers [60.51925353387151]
We propose a novel module named Local Context Propagation (LCP) to exploit the message passing between neighboring local regions.
We use the overlap points of adjacent local regions as intermediaries, then re-weight the features of these shared points from different local regions before passing them to the next layers.
The proposed method is applicable to different tasks and outperforms various transformer-based methods in benchmarks including 3D shape classification and dense prediction tasks.
arXiv Detail & Related papers (2022-10-23T15:43:01Z) - Learning Implicit Feature Alignment Function for Semantic Segmentation [51.36809814890326]
Implicit Feature Alignment function (IFA) is inspired by the rapidly expanding topic of implicit neural representations.
We show that IFA implicitly aligns the feature maps at different levels and is capable of producing segmentation maps in arbitrary resolutions.
Our method can be combined with improvement on various architectures, and it achieves state-of-the-art accuracy trade-off on common benchmarks.
arXiv Detail & Related papers (2022-06-17T09:40:14Z) - PSSNet: Planarity-sensible Semantic Segmentation of Large-scale Urban
Meshes [3.058685580689605]
We introduce a novel deep learning-based framework to interpret 3D urban scenes represented as textured meshes.
Our framework achieves semantic segmentation in two steps: planarity-sensible over-segmentation followed by semantic classification.
Our approach outperforms the state-of-the-art methods in terms of boundary quality and mean IoU (intersection over union)
arXiv Detail & Related papers (2022-02-07T14:16:10Z) - Image Stitching Based on Planar Region Consensus [22.303750435673752]
We propose a new image stitching method which stitches images by allowing for the alignment of a set of matched dominant planar regions.
We use rich semantic information directly from RGB images to extract planar image regions with a deep Convolutional Neural Network (CNN)
Our method can deal with different situations and outperforms the state-of-the-arts on challenging scenes.
arXiv Detail & Related papers (2020-07-06T13:07:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.