FloorPlanCAD: A Large-Scale CAD Drawing Dataset for Panoptic Symbol
Spotting
- URL: http://arxiv.org/abs/2105.07147v1
- Date: Sat, 15 May 2021 06:01:11 GMT
- Title: FloorPlanCAD: A Large-Scale CAD Drawing Dataset for Panoptic Symbol
Spotting
- Authors: Zhiwen Fan, Lingjie Zhu, Honghua Li, Xiaohao Chen, Siyu Zhu, Ping Tan
- Abstract summary: We present FloorPlanCAD, a large-scale real-world CAD drawing dataset containing over 10,000 floor plans.
We propose a novel method by combining Graph Convolutional Networks (GCNs) with Convolutional Neural Networks (CNNs)
The proposed CNN-GCN method achieved state-of-the-art (SOTA) performance on the task of semantic symbol spotting.
- Score: 38.987494792258694
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Access to large and diverse computer-aided design (CAD) drawings is critical
for developing symbol spotting algorithms. In this paper, we present
FloorPlanCAD, a large-scale real-world CAD drawing dataset containing over
10,000 floor plans, ranging from residential to commercial buildings. CAD
drawings in the dataset are all represented as vector graphics, which enable us
to provide line-grained annotations of 30 object categories. Equipped by such
annotations, we introduce the task of panoptic symbol spotting, which requires
to spot not only instances of countable things, but also the semantic of
uncountable stuff. Aiming to solve this task, we propose a novel method by
combining Graph Convolutional Networks (GCNs) with Convolutional Neural
Networks (CNNs), which captures both non-Euclidean and Euclidean features and
can be trained end-to-end. The proposed CNN-GCN method achieved
state-of-the-art (SOTA) performance on the task of semantic symbol spotting,
and help us build a baseline network for the panoptic symbol spotting task. Our
contributions are three-fold: 1) to the best of our knowledge, the presented
CAD drawing dataset is the first of its kind; 2) the panoptic symbol spotting
task considers the spotting of both thing instances and stuff semantic as one
recognition problem; and 3) we presented a baseline solution to the panoptic
symbol spotting task based on a novel CNN-GCN method, which achieved SOTA
performance on semantic symbol spotting. We believe that these contributions
will boost research in related areas.
Related papers
- Multiview Scene Graph [7.460438046915524]
A proper scene representation is central to the pursuit of spatial intelligence.
We propose to build Multiview Scene Graphs (MSG) from unposed images.
MSG represents a scene topologically with interconnected place and object nodes.
arXiv Detail & Related papers (2024-10-15T02:04:05Z) - Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images [1.5736099356327244]
We propose to label and spot symbols from CAD images that are converted from CAD drawings.
The advantage of spotting symbols from CAD images lies in the low requirement of labelers and the low-cost annotation.
Based on the keypoints detection, we propose a symbol grouping method to redraw the rectangle symbols in CAD images.
arXiv Detail & Related papers (2024-04-17T01:35:52Z) - Symbol as Points: Panoptic Symbol Spotting via Point-based
Representation [18.61469313164712]
This work studies the problem of panoptic symbol spotting in computer-aided design (CAD) drawings.
We take a different approach, which treats graphic primitives as a set of 2D points that are locally connected.
Specifically, we utilize a point transformer to extract the primitive features and append a mask2former-like spotting head to predict the final output.
arXiv Detail & Related papers (2024-01-19T08:44:52Z) - SGAligner : 3D Scene Alignment with Scene Graphs [84.01002998166145]
Building 3D scene graphs has emerged as a topic in scene representation for several embodied AI applications.
We focus on the fundamental problem of aligning pairs of 3D scene graphs whose overlap can range from zero to partial.
We propose SGAligner, the first method for aligning pairs of 3D scene graphs that is robust to in-the-wild scenarios.
arXiv Detail & Related papers (2023-04-28T14:39:22Z) - GAT-CADNet: Graph Attention Network for Panoptic Symbol Spotting in CAD
Drawings [0.0]
Spotting graphical symbols from the computer-aided design (CAD) drawings is essential to many industrial applications.
By treating each CAD drawing as a graph, we propose a novel graph attention network GAT-CADNet.
The proposed GAT-CADNet is intuitive yet effective and manages to solve the panoptic symbol spotting problem in one consolidated network.
arXiv Detail & Related papers (2022-01-03T13:08:28Z) - Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical
Understanding of Outdoor Scene [76.4183572058063]
We present a richly-annotated 3D point cloud dataset for multiple outdoor scene understanding tasks.
The dataset has been point-wisely annotated with both hierarchical and instance-based labels.
We formulate a hierarchical learning problem for 3D point cloud segmentation and propose a measurement evaluating consistency across various hierarchies.
arXiv Detail & Related papers (2020-08-11T19:10:32Z) - Symbol Spotting on Digital Architectural Floor Plans Using a Deep
Learning-based Framework [76.70609932823149]
This paper focuses on symbol spotting on real-world digital architectural floor plans with a deep learning (DL)-based framework.
We propose a training strategy based on tiles, avoiding many issues particular to DL-based object detection networks.
Experiments on real-world floor plans demonstrate that our method successfully detects architectural symbols with low intra-class similarity and of variable graphical complexity.
arXiv Detail & Related papers (2020-06-01T03:16:05Z) - 3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure
Prior [50.73148041205675]
The goal of the Semantic Scene Completion (SSC) task is to simultaneously predict a completed 3D voxel representation of volumetric occupancy and semantic labels of objects in the scene from a single-view observation.
We propose to devise a new geometry-based strategy to embed depth information with low-resolution voxel representation.
Our proposed geometric embedding works better than the depth feature learning from habitual SSC frameworks.
arXiv Detail & Related papers (2020-03-31T09:33:46Z) - Cross-GCN: Enhancing Graph Convolutional Network with $k$-Order Feature
Interactions [153.6357310444093]
Graph Convolutional Network (GCN) is an emerging technique that performs learning and reasoning on graph data.
We argue that existing designs of GCN forgo modeling cross features, making GCN less effective for tasks or data where cross features are important.
We design a new operator named Cross-feature Graph Convolution, which explicitly models the arbitrary-order cross features with complexity linear to feature dimension and order size.
arXiv Detail & Related papers (2020-03-05T13:05:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.