Sat2lod2: A Software For Automated Lod-2 Modeling From Satellite-Derived
Orthophoto And Digital Surface Model
- URL: http://arxiv.org/abs/2204.04139v1
- Date: Fri, 8 Apr 2022 15:49:35 GMT
- Title: Sat2lod2: A Software For Automated Lod-2 Modeling From Satellite-Derived
Orthophoto And Digital Surface Model
- Authors: Shengxi Gui, Rongjun Qin, Yang Tang
- Abstract summary: We describe an open-source tool, called SAT2LOD2, built on a minorly modified version of our recently published work.
SAT2LoD2 is a fully open-source and GUI (Graphics User Interface) based software, coded in Python.
- Score: 7.219077740523683
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deriving LoD2 models from orthophoto and digital surface models (DSM)
reconstructed from satellite images is a challenging task. Existing solutions
are mostly system approaches that require complicated step-wise processes,
including not only heuristic geometric operations, but also high-level steps
such as machine learning-based semantic segmentation and building detection.
Here in this paper, we describe an open-source tool, called SAT2LOD2, built
based on a minorly modified version of our recently published work. SAT2LoD2 is
a fully open-source and GUI (Graphics User Interface) based software, coded in
Python, which takes an orthophoto and DSM as inputs, and outputs individual
building models, and it can additionally take road network shapefiles, and
customized classification maps to further improve the reconstruction results.
We further improve the robustness of the method by 1) intergrading building
segmentation based on HRNetV2 into our software; and 2) having implemented a
decision strategy to identify complex buildings and directly generate mesh to
avoid erroneous LoD2 reconstruction from a system point of view. The software
can process a moderate level of data (around 5000*5000 size of orthophoto and
DSM) using a PC with a graphics card supporting CUDA. Furthermore, the GUI is
self-contained and stores the intermediate processing results facilitating
researchers to learn the process easily and reuse intermediate files as needed.
The updated codes and software are available under this GitHub page:
https://github.com/GDAOSU/LOD2BuildingModel.
Related papers
- Exploring Diffusion Transformer Designs via Grafting [82.91123758506876]
We present grafting, a simple approach for editing pretrained diffusion transformers (DiTs) to materialize new architectures under small compute budgets.<n>We show that new diffusion model designs can be explored by grafting pretrained DiTs, with edits ranging from operator replacement to architecture restructuring.
arXiv Detail & Related papers (2025-06-05T17:59:40Z) - A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios [1.2093553114715083]
A-ScoRe is an Attention-based model which leverage attention on descriptor map level to produce meaningful and high-semantic 2D descriptors.
Results show our methods achieve comparable performance with State-of-the-art methods on multiple benchmark while being light-weighted and much more flexible.
arXiv Detail & Related papers (2025-03-18T07:39:50Z) - Flemme: A Flexible and Modular Learning Platform for Medical Images [5.086862917025204]
Flemme is a FLExible and Modular learning platform for MEdical images.
We construct encoders using building blocks based on convolution, transformer, and state-space model (SSM) to process both 2D and 3D image patches.
arXiv Detail & Related papers (2024-08-18T05:47:33Z) - Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans [1.0436971860292366]
We propose two novel pixel-wise segmentation methods based on the MDA-Unet and MACU-Net architectures.
The proposed methods are compared with two other state-of-the-art techniques and several benchmark datasets.
On the commonly used CubiCasa benchmark dataset, our methods have achieved the mean F1 score of 0.86 over five examined classes.
arXiv Detail & Related papers (2024-08-02T18:36:45Z) - Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain [48.440691680864745]
We introduce a new neural network architecture, termed LoGoNet, with a tailored self-supervised learning (SSL) method.
LoGoNet integrates a novel feature extractor within a U-shaped architecture, leveraging Large Kernel Attention (LKA) and a dual encoding strategy.
We propose a novel SSL method tailored for 3D images to compensate for the lack of large labeled datasets.
arXiv Detail & Related papers (2024-02-09T05:06:58Z) - VMamba: Visual State Space Model [92.83984290020891]
VMamba is a vision backbone that works in linear time complexity.
At the core of VMamba lies a stack of Visual State-Space (VSS) blocks with the 2D Selective Scan (SS2D) module.
arXiv Detail & Related papers (2024-01-18T17:55:39Z) - Inferring 3D change detection from bitemporal optical images [6.050310428775564]
We propose two novel networks, able to solve simultaneously the 2D and 3D CD tasks.
The aim of this work is to lay the foundations for the development of DL algorithms able to automatically infer an elevation (3D) CD map.
The code and the 3DCD dataset are available at urlhttps://sites.google.com/uniroma1.it/3dchangedetection/home-page.
arXiv Detail & Related papers (2022-05-31T15:53:33Z) - Deep Learning for Real Time Satellite Pose Estimation on Low Power Edge
TPU [58.720142291102135]
In this paper we propose a pose estimation software exploiting neural network architectures.
We show how low power machine learning accelerators could enable Artificial Intelligence exploitation in space.
arXiv Detail & Related papers (2022-04-07T08:53:18Z) - Multi-Stage Progressive Image Restoration [167.6852235432918]
We propose a novel synergistic design that can optimally balance these competing goals.
Our main proposal is a multi-stage architecture, that progressively learns restoration functions for the degraded inputs.
The resulting tightly interlinked multi-stage architecture, named as MPRNet, delivers strong performance gains on ten datasets.
arXiv Detail & Related papers (2021-02-04T18:57:07Z) - Boundary-Aware Segmentation Network for Mobile and Web Applications [60.815545591314915]
Boundary-Aware Network (BASNet) is integrated with a predict-refine architecture and a hybrid loss for highly accurate image segmentation.
BASNet runs at over 70 fps on a single GPU which benefits many potential real applications.
Based on BASNet, we further developed two (close to) commercial applications: AR COPY & PASTE, in which BASNet is augmented reality for "COPY" and "PASTING" real-world objects, and OBJECT CUT, which is a web-based tool for automatic object background removal.
arXiv Detail & Related papers (2021-01-12T19:20:26Z) - AutoPose: Searching Multi-Scale Branch Aggregation for Pose Estimation [96.29533512606078]
We present AutoPose, a novel neural architecture search(NAS) framework.
It is capable of automatically discovering multiple parallel branches of cross-scale connections towards accurate and high-resolution 2D human pose estimation.
arXiv Detail & Related papers (2020-08-16T22:27:43Z) - Making DensePose fast and light [78.49552144907513]
Existing neural network models capable of solving this task are heavily parameterized.
To enable Dense Pose inference on the end device with current models, one needs to support an expensive server-side infrastructure and have a stable internet connection.
In this work, we target the problem of redesigning the DensePose R-CNN model's architecture so that the final network retains most of its accuracy but becomes more light-weight and fast.
arXiv Detail & Related papers (2020-06-26T19:42:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.