Symbol Spotting on Digital Architectural Floor Plans Using a Deep
Learning-based Framework
- URL: http://arxiv.org/abs/2006.00684v1
- Date: Mon, 1 Jun 2020 03:16:05 GMT
- Title: Symbol Spotting on Digital Architectural Floor Plans Using a Deep
Learning-based Framework
- Authors: Alireza Rezvanifar, Melissa Cote, Alexandra Branzan Albu
- Abstract summary: This paper focuses on symbol spotting on real-world digital architectural floor plans with a deep learning (DL)-based framework.
We propose a training strategy based on tiles, avoiding many issues particular to DL-based object detection networks.
Experiments on real-world floor plans demonstrate that our method successfully detects architectural symbols with low intra-class similarity and of variable graphical complexity.
- Score: 76.70609932823149
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: This papers focuses on symbol spotting on real-world digital architectural
floor plans with a deep learning (DL)-based framework. Traditional on-the-fly
symbol spotting methods are unable to address the semantic challenge of
graphical notation variability, i.e. low intra-class symbol similarity, an
issue that is particularly important in architectural floor plan analysis. The
presence of occlusion and clutter, characteristic of real-world plans, along
with a varying graphical symbol complexity from almost trivial to highly
complex, also pose challenges to existing spotting methods. In this paper, we
address all of the above issues by leveraging recent advances in DL and
adapting an object detection framework based on the You-Only-Look-Once (YOLO)
architecture. We propose a training strategy based on tiles, avoiding many
issues particular to DL-based object detection networks related to the relative
small size of symbols compared to entire floor plans, aspect ratios, and data
augmentation. Experiments on real-world floor plans demonstrate that our method
successfully detects architectural symbols with low intra-class similarity and
of variable graphical complexity, even in the presence of heavy occlusion and
clutter. Additional experiments on the public SESYD dataset confirm that our
proposed approach can deal with various degradation and noise levels and
outperforms other symbol spotting methods.
Related papers
- ZoomNeXt: A Unified Collaborative Pyramid Network for Camouflaged Object Detection [70.11264880907652]
Recent object (COD) attempts to segment objects visually blended into their surroundings, which is extremely complex and difficult in real-world scenarios.
We propose an effective unified collaborative pyramid network that mimics human behavior when observing vague images and camouflaged zooming in and out.
Our framework consistently outperforms existing state-of-the-art methods in image and video COD benchmarks.
arXiv Detail & Related papers (2023-10-31T06:11:23Z) - Symbolic Visual Reinforcement Learning: A Scalable Framework with
Object-Level Abstraction and Differentiable Expression Search [63.3745291252038]
We propose DiffSES, a novel symbolic learning approach that discovers discrete symbolic policies.
By using object-level abstractions instead of raw pixel-level inputs, DiffSES is able to leverage the simplicity and scalability advantages of symbolic expressions.
Our experiments demonstrate that DiffSES is able to generate symbolic policies that are simpler and more scalable than state-of-the-art symbolic RL methods.
arXiv Detail & Related papers (2022-12-30T17:50:54Z) - Spatial Reasoning for Few-Shot Object Detection [21.3564383157159]
We propose a spatial reasoning framework that detects novel objects with only a few training examples in a context.
We employ a graph convolutional network as the RoIs and their relatedness are defined as nodes and edges, respectively.
We demonstrate that the proposed method significantly outperforms the state-of-the-art methods and verify its efficacy through extensive ablation studies.
arXiv Detail & Related papers (2022-11-02T12:38:08Z) - FuNNscope: Visual microscope for interactively exploring the loss
landscape of fully connected neural networks [77.34726150561087]
We show how to explore high-dimensional landscape characteristics of neural networks.
We generalize observations on small neural networks to more complex systems.
An interactive dashboard opens up a number of possible application networks.
arXiv Detail & Related papers (2022-04-09T16:41:53Z) - PatchMVSNet: Patch-wise Unsupervised Multi-View Stereo for
Weakly-Textured Surface Reconstruction [2.9896482273918434]
This paper proposes robust loss functions leveraging constraints beneath multi-view images to alleviate matching ambiguity.
Our strategy can be implemented with arbitrary depth estimation frameworks and can be trained with arbitrary large-scale MVS datasets.
Our method reaches the performance of the state-of-the-art methods on popular benchmarks, like DTU, Tanks and Temples and ETH3D.
arXiv Detail & Related papers (2022-03-04T07:05:23Z) - FloorPlanCAD: A Large-Scale CAD Drawing Dataset for Panoptic Symbol
Spotting [38.987494792258694]
We present FloorPlanCAD, a large-scale real-world CAD drawing dataset containing over 10,000 floor plans.
We propose a novel method by combining Graph Convolutional Networks (GCNs) with Convolutional Neural Networks (CNNs)
The proposed CNN-GCN method achieved state-of-the-art (SOTA) performance on the task of semantic symbol spotting.
arXiv Detail & Related papers (2021-05-15T06:01:11Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z) - Spatial-spectral Hyperspectral Image Classification via Multiple Random
Anchor Graphs Ensemble Learning [88.60285937702304]
This paper proposes a novel spatial-spectral HSI classification method via multiple random anchor graphs ensemble learning (RAGE)
Firstly, the local binary pattern is adopted to extract the more descriptive features on each selected band, which preserves local structures and subtle changes of a region.
Secondly, the adaptive neighbors assignment is introduced in the construction of anchor graph, to reduce the computational complexity.
arXiv Detail & Related papers (2021-03-25T09:31:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.