Related papers: Detailed Aerial Mapping of Photovoltaic Power Plants Through Semantically Significant Keypoints

Detailed Aerial Mapping of Photovoltaic Power Plants Through Semantically Significant Keypoints

URL: http://arxiv.org/abs/2510.04840v2
Date: Mon, 03 Nov 2025 13:29:54 GMT
Title: Detailed Aerial Mapping of Photovoltaic Power Plants Through Semantically Significant Keypoints
Authors: Viktor Kozák, Jan Chudoba, Libor Přeučil,
Abstract summary: An accurate and up-to-date model of a photovoltaic (PV) power plant is essential for its optimal operation and maintenance.<n>This work introduces a novel approach for PV power plant mapping based on aerial overview images.<n>The presented mapping method takes advantage of the structural layout of the power plants to achieve detailed modeling down to the level of individual PV modules.
Score: 0.5505634045241289
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: An accurate and up-to-date model of a photovoltaic (PV) power plant is essential for its optimal operation and maintenance. However, such a model may not be easily available. This work introduces a novel approach for PV power plant mapping based on aerial overview images. It enables the automation of the mapping process while removing the reliance on third-party data. The presented mapping method takes advantage of the structural layout of the power plants to achieve detailed modeling down to the level of individual PV modules. The approach relies on visual segmentation of PV modules in overview images and the inference of structural information in each image, assigning modules to individual benches, rows, and columns. We identify visual keypoints related to the layout and use these to merge detections from multiple images while maintaining their structural integrity. The presented method was experimentally verified and evaluated on two different power plants. The final fusion of 3D positions and semantic structures results in a compact georeferenced model suitable for power plant maintenance.

Related papers

Solar PV Installation Potential Assessment on Building Facades Based on Vision and Language Foundation Models [11.037550898765502]
This study introduces SF-SPA (Semantic Facade Solar-PV Assessment), an automated framework that transforms street-view photographs into quantitative PV deployment assessments.<n>The approach combines com puter vision and artificial intelligence techniques to address three key challenges: perspective distortion correction, semantic understanding of facade elements, and spatial reasoning for PV layout optimization.
arXiv Detail & Related papers (2025-10-01T11:51:28Z)
Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection [0.6291443816903801]
This paper presents a novel localization pipeline that integrates PV module detection with UAV navigation.<n> Detections are used to identify the power plant structures in the image and associate these with the power plant model.<n>We present three distinct methods for visual segmentation of PV modules based on traditional computer vision, deep learning, and their fusion.
arXiv Detail & Related papers (2025-01-24T15:48:41Z)
SHIC: Shape-Image Correspondences with no Keypoint Supervision [106.99157362200867]
Canonical surface mapping generalizes keypoint detection by assigning each pixel of an object to a corresponding point in a 3D template. Popularised by DensePose for the analysis of humans, authors have attempted to apply the concept to more categories. We introduce SHIC, a method to learn canonical maps without manual supervision which achieves better results than supervised methods for most categories.
arXiv Detail & Related papers (2024-07-26T17:58:59Z)
S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling [6.646508986504754]
We introduce S3Former, designed to segment solar panels from aerial imagery and provide size and location information.<n>S3Former features a Masked Attention Mask Transformer incorporating a self-supervised learning pretrained backbone.<n>We evaluate S3Former using diverse datasets, demonstrate improvement state-of-the-art models.
arXiv Detail & Related papers (2024-05-07T16:56:21Z)
AutoPV: Automated photovoltaic forecasts with limited information using an ensemble of pre-trained models [0.20999222360659608]
We propose a new method for day-ahead PV power generation forecasts called AutoPV. AutoPV is a weighted ensemble of forecasting models that represent different PV mounting configurations. For a real-world data set with 11 PV plants, the accuracy of AutoPV is comparable to a model trained on two years of data and outperforms an incrementally trained model.
arXiv Detail & Related papers (2022-12-13T18:29:03Z)
SIM-Trans: Structure Information Modeling Transformer for Fine-grained Visual Categorization [59.732036564862796]
We propose the Structure Information Modeling Transformer (SIM-Trans) to incorporate object structure information into transformer for enhancing discriminative representation learning. The proposed two modules are light-weighted and can be plugged into any transformer network and trained end-to-end easily. Experiments and analyses demonstrate that the proposed SIM-Trans achieves state-of-the-art performance on fine-grained visual categorization benchmarks.
arXiv Detail & Related papers (2022-08-31T03:00:07Z)
ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond [76.35955924137986]
We propose a Vision Transformer Advanced by Exploring intrinsic IB from convolutions, i.e., ViTAE. ViTAE has several spatial pyramid reduction modules to downsample and embed the input image into tokens with rich multi-scale context. We obtain the state-of-the-art classification performance, i.e., 88.5% Top-1 classification accuracy on ImageNet validation set and the best 91.2% Top-1 accuracy on ImageNet real validation set.
arXiv Detail & Related papers (2022-02-21T10:40:05Z)
VoRTX: Volumetric 3D Reconstruction With Transformers for Voxelwise View Selection and Fusion [68.68537312256144]
VoRTX is an end-to-end volumetric 3D reconstruction network using transformers for wide-baseline, multi-view feature fusion. We train our model on ScanNet and show that it produces better reconstructions than state-of-the-art methods.
arXiv Detail & Related papers (2021-12-01T02:18:11Z)
PnP-DETR: Towards Efficient Visual Analysis with Transformers [146.55679348493587]
Recently, DETR pioneered the solution vision tasks with transformers, it directly translates the image feature map into the object result. Recent transformer-based image recognition model andTT show consistent efficiency gain.
arXiv Detail & Related papers (2021-09-15T01:10:30Z)
Segmentation of cell-level anomalies in electroluminescence images of photovoltaic modules [0.0]
We propose an end-to-end deep learning pipeline that detects, locates and segments cell-level anomalies from entire photovoltaic modules. The proposed modular pipeline combines three deep learning techniques: 1. object detection (modified Faster-RNN), 2. image classification (EfficientNet) and 3. weakly supervised segmentation (autoencoder)
arXiv Detail & Related papers (2021-06-21T10:17:40Z)
TransVG: End-to-End Visual Grounding with Transformers [102.11922622103613]
We present a transformer-based framework for visual grounding, namely TransVG, to address the task of grounding a language query to an image. We show that the complex fusion modules can be replaced by a simple stack of transformer encoder layers with higher performance.
arXiv Detail & Related papers (2021-04-17T13:35:24Z)

This list is automatically generated from the titles and abstracts of the papers in this site.