Related papers: Polygonizing Roof Segments from High-Resolution Aerial Images Using Yolov8-Based Edge Detection

Polygonizing Roof Segments from High-Resolution Aerial Images Using Yolov8-Based Edge Detection

URL: http://arxiv.org/abs/2503.09187v1
Date: Wed, 12 Mar 2025 09:29:10 GMT
Title: Polygonizing Roof Segments from High-Resolution Aerial Images Using Yolov8-Based Edge Detection
Authors: Qipeng Mei, Dimitri Bulatov, Dorota Iwaszczuk,
Abstract summary: This study presents a novel approach for roof detail extraction and vectorization using remote sensing images.<n>We adapt the YOLOv8 OBB model, originally designed for rotated object detection, to extract roof edges effectively.<n> Experiments conducted on the Melville and Hausdorff datasets highlight the method's effectiveness.
Score: 0.0
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: This study presents a novel approach for roof detail extraction and vectorization using remote sensing images. Unlike previous geometric-primitive-based methods that rely on the detection of corners, our method focuses on edge detection as the primary mechanism for roof reconstruction, while utilizing geometric relationships to define corners and faces. We adapt the YOLOv8 OBB model, originally designed for rotated object detection, to extract roof edges effectively. Our method demonstrates robustness against noise and occlusion, leading to precise vectorized representations of building roofs. Experiments conducted on the SGA and Melville datasets highlight the method's effectiveness. At the raster level, our model outperforms the state-of-the-art foundation segmentation model (SAM), achieving a mIoU between 0.85 and 1 for most samples and an ovIoU close to 0.97. At the vector level, evaluation using the Hausdorff distance, PolyS metric, and our raster-vector-metric demonstrates significant improvements after polygonization, with a close approximation to the reference data. The method successfully handles diverse roof structures and refines edge gaps, even on complex roof structures of new, excluded from training datasets. Our findings underscore the potential of this approach to address challenges in automatic roof structure vectorization, supporting various applications such as urban terrain reconstruction.

Related papers

LDPoly: Latent Diffusion for Polygonal Road Outline Extraction in Large-Scale Topographic Mapping [5.093758132026397]
We introduce LDPoly, the first framework for extracting polygonal road outlines from high-resolution aerial images. We evaluate LDPoly on a new benchmark dataset, Map2ImLas, which contains detailed polygonal annotations for various topographic objects in several Dutch regions.
arXiv Detail & Related papers (2025-04-29T11:13:33Z)
Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers [59.0181939916084]
Traditional 3D networks mainly focus on local geometric details and ignore the topological structure between local geometries. We propose a novel Priors Distillation (RPD) method to extract priors from the well-trained transformers on massive images. Experiments on the PointDA-10 and the Sim-to-Real datasets verify that the proposed method consistently achieves the state-of-the-art performance of UDA for point cloud classification.
arXiv Detail & Related papers (2024-07-26T06:29:09Z)
Enhancing Polygonal Building Segmentation via Oriented Corners [0.3749861135832072]
This paper introduces a novel deep convolutional neural network named OriCornerNet, which directly extracts delineated building polygons from input images. Our approach involves a deep model that predicts building footprint masks, corners, and orientation vectors that indicate directions toward adjacent corners. Performance evaluations conducted on SpaceNet Vegas and CrowdAI-small datasets demonstrate the competitive efficacy of our approach.
arXiv Detail & Related papers (2024-07-17T01:59:06Z)
DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection [55.48770333927732]
We propose a Difusion-based Anomaly Detection (DiAD) framework for multi-class anomaly detection. It consists of a pixel-space autoencoder, a latent-space Semantic-Guided (SG) network with a connection to the stable diffusion's denoising network, and a feature-space pre-trained feature extractor. Experiments on MVTec-AD and VisA datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-12-11T18:38:28Z)
CornerFormer: Boosting Corner Representation for Fine-Grained Structured Reconstruction [20.04081992616026]
We present an enhanced corner representation method for structured reconstruction. It better reconstructs fine-grained structures, such as adjacent corners and tiny edges. It outperforms the state-of-the-art model by +1.9%@F-1 on Corner and +3.0%@F-1 on Edge.
arXiv Detail & Related papers (2023-04-14T11:51:26Z)
Learning to Generate 3D Representations of Building Roofs Using Single-View Aerial Imagery [68.3565370706598]
We present a novel pipeline for learning the conditional distribution of a building roof mesh given pixels from an aerial image. Unlike alternative methods that require multiple images of the same object, our approach enables estimating 3D roof meshes using only a single image for predictions.
arXiv Detail & Related papers (2023-03-20T15:47:05Z)
BuildMapper: A Fully Learnable Framework for Vectorized Building Contour Extraction [3.862461804734488]
We propose the first end-to-end learnable building contour extraction framework, named BuildMapper. BuildMapper can directly and efficiently delineate building polygons just as a human does. We show that BuildMapper can achieve a state-of-the-art performance, with a higher mask average precision (AP) and boundary AP than both segmentation-based and contour-based methods.
arXiv Detail & Related papers (2022-11-07T08:58:35Z)
Detecting Rotated Objects as Gaussian Distributions and Its 3-D Generalization [81.29406957201458]
Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects. We argue that such a mechanism has fundamental limitations in building an effective regression loss for rotation detection. We propose to model the rotated objects as Gaussian distributions. We extend our approach from 2-D to 3-D with a tailored algorithm design to handle the heading estimation.
arXiv Detail & Related papers (2022-09-22T07:50:48Z)
6DOF Pose Estimation of a 3D Rigid Object based on Edge-enhanced Point Pair Features [20.33119373900788]
We propose an efficient 6D pose estimation method based on the point pair feature (PPF) framework. A pose hypothesis validation approach is proposed to resolve the symmetric ambiguity by calculating edge matching degree.
arXiv Detail & Related papers (2022-09-17T07:05:50Z)
Quantization in Relative Gradient Angle Domain For Building Polygon Estimation [88.80146152060888]
CNN approaches often generate imprecise building morphologies including noisy edges and round corners. We propose a module that uses prior knowledge of building corners to create angular and concise building polygons from CNN segmentation outputs. Experimental results demonstrate that our method refines CNN output from a rounded approximation to a more clear-cut angular shape of the building footprint.
arXiv Detail & Related papers (2020-07-10T21:33:06Z)
Refined Plane Segmentation for Cuboid-Shaped Objects by Leveraging Edge Detection [63.942632088208505]
We propose a post-processing algorithm to align the segmented plane masks with edges detected in the image. This allows us to increase the accuracy of state-of-the-art approaches, while limiting ourselves to cuboid-shaped objects.
arXiv Detail & Related papers (2020-03-28T18:51:43Z)

This list is automatically generated from the titles and abstracts of the papers in this site.