Data Fusion for Multi-Task Learning of Building Extraction and Height
Estimation
- URL: http://arxiv.org/abs/2308.02960v1
- Date: Sat, 5 Aug 2023 22:16:19 GMT
- Title: Data Fusion for Multi-Task Learning of Building Extraction and Height
Estimation
- Authors: Saad Ahmed Jamal, Arioluwa Aribisala
- Abstract summary: This paper attempts a multitask-learning method of building extraction and height estimation using both optical and radar satellite imagery.
Contrary to the initial goal of multitask learning, this paper reports the individual implementation of the building extraction and height estimation under constraints.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In accordance with the urban reconstruction problem proposed by the DFC23
Track 2 Contest, this paper attempts a multitask-learning method of building
extraction and height estimation using both optical and radar satellite
imagery. Contrary to the initial goal of multitask learning which could
potentially give a superior solution by reusing features and forming implicit
constraints between multiple tasks, this paper reports the individual
implementation of the building extraction and height estimation under
constraints. The baseline results for the building extraction and the height
estimation significantly increased after designed experiments.
Related papers
- Estimate the building height at a 10-meter resolution based on Sentinel data [5.080045077714947]
This study established a set of spatial-spectral-temporal feature databases.
It combined SAR data provided by Sentinel-1, optical data provided by Sentinel-2, and shape data provided by building footprints.
The statistical indicators on the time scale are extracted to form a rich database of 160 features.
arXiv Detail & Related papers (2024-05-02T03:53:59Z) - RadOcc: Learning Cross-Modality Occupancy Knowledge through Rendering
Assisted Distillation [50.35403070279804]
3D occupancy prediction is an emerging task that aims to estimate the occupancy states and semantics of 3D scenes using multi-view images.
We propose RadOcc, a Rendering assisted distillation paradigm for 3D Occupancy prediction.
arXiv Detail & Related papers (2023-12-19T03:39:56Z) - Multi-task deep learning for large-scale building detail extraction from
high-resolution satellite imagery [13.544826927121992]
Multi-task Building Refiner (MT-BR) is an adaptable neural network tailored for simultaneous extraction of building details from satellite imagery.
For large-scale applications, we devise a novel spatial sampling scheme that strategically selects limited but representative image samples.
MT-BR consistently outperforms other state-of-the-art methods in extracting building details across various metrics.
arXiv Detail & Related papers (2023-10-29T04:43:30Z) - HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View
Building Extraction and Height Estimation [13.09940764764909]
We propose a Height-hierarchy Guided Dual-decoder Network (HGDNet) to estimate building height.
Under the guidance of synthesized discrete height-hierarchy nDSM, auxiliary height-hierarchical building extraction branch enhance the height estimation branch.
Additional two-stage cascade architecture is adopted to achieve more accurate building extraction.
arXiv Detail & Related papers (2023-08-10T07:03:32Z) - LIGHT: Joint Individual Building Extraction and Height Estimation from
Satellite Images through a Unified Multitask Learning Network [8.09909901104654]
Building extraction and height estimation are two important basic tasks in remote sensing image interpretation.
Most of the existing research regards the two tasks as independent studies.
In this work, we combine the individuaL buIlding extraction and heiGHt estimation through a unified multiTask learning network.
arXiv Detail & Related papers (2023-04-03T15:48:24Z) - Hierarchical Imitation Learning with Vector Quantized Models [77.67190661002691]
We propose to use reinforcement learning to identify subgoals in expert trajectories.
We build a vector-quantized generative model for the identified subgoals to perform subgoal-level planning.
In experiments, the algorithm excels at solving complex, long-horizon decision-making problems outperforming state-of-the-art.
arXiv Detail & Related papers (2023-01-30T15:04:39Z) - Geometry Uncertainty Projection Network for Monocular 3D Object
Detection [138.24798140338095]
We propose a Geometry Uncertainty Projection Network (GUP Net) to tackle the error amplification problem at both inference and training stages.
Specifically, a GUP module is proposed to obtains the geometry-guided uncertainty of the inferred depth.
At the training stage, we propose a Hierarchical Task Learning strategy to reduce the instability caused by error amplification.
arXiv Detail & Related papers (2021-07-29T06:59:07Z) - Unsupervised Scale-consistent Depth Learning from Video [131.3074342883371]
We propose a monocular depth estimator SC-Depth, which requires only unlabelled videos for training.
Thanks to the capability of scale-consistent prediction, we show that our monocular-trained deep networks are readily integrated into the ORB-SLAM2 system.
The proposed hybrid Pseudo-RGBD SLAM shows compelling results in KITTI, and it generalizes well to the KAIST dataset without additional training.
arXiv Detail & Related papers (2021-05-25T02:17:56Z) - A Multi-Task Deep Learning Framework for Building Footprint Segmentation [0.0]
We propose a joint optimization scheme for the task of building footprint delineation.
We also introduce two auxiliary tasks; image reconstruction and building footprint boundary segmentation.
In particular, we propose a deep multi-task learning (MTL) based unified fully convolutional framework.
arXiv Detail & Related papers (2021-04-19T15:07:27Z) - Neural Architecture Search From Fr\'echet Task Distance [50.9995960884133]
We show how the distance between a target task and each task in a given set of baseline tasks can be used to reduce the neural architecture search space for the target task.
The complexity reduction in search space for task-specific architectures is achieved by building on the optimized architectures for similar tasks instead of doing a full search without using this side information.
arXiv Detail & Related papers (2021-03-23T20:43:31Z) - Counting from Sky: A Large-scale Dataset for Remote Sensing Object
Counting and A Benchmark Method [52.182698295053264]
We are interested in counting dense objects from remote sensing images. Compared with object counting in a natural scene, this task is challenging in the following factors: large scale variation, complex cluttered background, and orientation arbitrariness.
To address these issues, we first construct a large-scale object counting dataset with remote sensing images, which contains four important geographic objects.
We then benchmark the dataset by designing a novel neural network that can generate a density map of an input image.
arXiv Detail & Related papers (2020-08-28T03:47:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.