Related papers: Object Detection in Aerial Imagery

Object Detection in Aerial Imagery

URL: http://arxiv.org/abs/2211.15479v1
Date: Tue, 15 Nov 2022 11:22:18 GMT
Title: Object Detection in Aerial Imagery
Authors: Dmitry Demidov, Rushali Grandhe, Salem AlMarri
Abstract summary: We show the performance of two-stage, one-stage and attention based object detectors on the iSAID dataset. We also show a comparative study highlighting the pros and cons of different models in aerial imagery setting.
Score: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Object detection in natural images has achieved remarkable results over the years. However, a similar progress has not yet been observed in aerial object detection due to several challenges, such as high resolution images, instances scale variation, class imbalance etc. We show the performance of two-stage, one-stage and attention based object detectors on the iSAID dataset. Furthermore, we describe some modifications and analysis performed for different models - a) In two stage detector: introduced weighted attention based FPN, class balanced sampler and density prediction head. b) In one stage detector: used weighted focal loss and introduced FPN. c) In attention based detector: compare single,multi-scale attention and demonstrate effect of different backbones. Finally, we show a comparative study highlighting the pros and cons of different models in aerial imagery setting.

Related papers

Understanding and Improving Training-Free AI-Generated Image Detections with Vision Foundation Models [68.90917438865078]
Deepfake techniques for facial synthesis and editing pose serious risks for generative models. In this paper, we investigate how detection performance varies across model backbones, types, and datasets. We introduce Contrastive Blur, which enhances performance on facial images, and MINDER, which addresses noise type bias, balancing performance across domains.
arXiv Detail & Related papers (2024-11-28T13:04:45Z)
SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection [59.868772767818975]
We propose a simple yet effective Semi-supervised Oriented Object Detection method termed SOOD++. Specifically, we observe that objects from aerial images are usually arbitrary orientations, small scales, and aggregation. Extensive experiments conducted on various multi-oriented object datasets under various labeled settings demonstrate the effectiveness of our method.
arXiv Detail & Related papers (2024-07-01T07:03:51Z)
Object Detection in Aerial Images in Scarce Data Regimes [0.0]
Small objects, more numerous in aerial images, are the cause for the apparent performance gap between natural and aerial images. We propose a scale-adaptive box similarity criterion, that improves the training and evaluation of FSOD methods. We also contribute to generic FSOD with two distinct approaches based on metric learning and fine-tuning.
arXiv Detail & Related papers (2023-10-16T14:16:47Z)
On the Robustness of Object Detection Models in Aerial Images [37.50307094643692]
We introduce two novel benchmarks based on DOTA-v1.0. The first benchmark encompasses 19 prevalent corruptions, while the second focuses on cloud-corrupted images. We find that enhanced model architectures, larger networks, well-crafted modules, and judicious data augmentation strategies collectively enhance the robustness of aerial object detection models.
arXiv Detail & Related papers (2023-08-29T15:16:51Z)
ODAM: Gradient-based instance-specific visual explanations for object detection [51.476702316759635]
gradient-weighted Object Detector Activation Maps (ODAM) ODAM produces heat maps that show the influence of regions on the detector's decision for each predicted attribute. We propose Odam-NMS, which considers the information of the model's explanation for each prediction to distinguish duplicate detected objects.
arXiv Detail & Related papers (2023-04-13T09:20:26Z)
Active Gaze Control for Foveal Scene Exploration [124.11737060344052]
We propose a methodology to emulate how humans and robots with foveal cameras would explore a scene. The proposed method achieves an increase in detection F1-score of 2-3 percentage points for the same number of gaze shifts.
arXiv Detail & Related papers (2022-08-24T14:59:28Z)
Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection [65.30079184700755]
This study addresses the issue of fusing infrared and visible images that appear differently for object detection. Previous approaches discover commons underlying the two modalities and fuse upon the common space either by iterative optimization or deep networks. This paper proposes a bilevel optimization formulation for the joint problem of fusion and detection, and then unrolls to a target-aware Dual Adversarial Learning (TarDAL) network for fusion and a commonly used detection network.
arXiv Detail & Related papers (2022-03-30T11:44:56Z)
Focus-and-Detect: A Small Object Detection Framework for Aerial Images [1.911678487931003]
We propose a two-stage object detection framework called "Focus-and-Detect" The first stage generates clusters of objects constituting the focused regions. The second stage, which is also an object detector network, predicts objects within the focal regions. Results indicate that the proposed two-stage framework achieves an AP score of 42.06 on VisDrone validation dataset.
arXiv Detail & Related papers (2022-03-24T10:43:56Z)
Validation of object detection in UAV-based images using synthetic data [9.189702268557483]
Machine learning (ML) models for UAV-based detection are often validated using data curated for tasks unrelated to the UAV application. Such errors arise due to differences in imaging conditions between images from UAVs and images in training. Our work is focused on understanding the impact of different UAV-based imaging conditions on detection performance by using synthetic data generated using a game engine.
arXiv Detail & Related papers (2022-01-17T20:56:56Z)
RelationRS: Relationship Representation Network for Object Detection in Aerial Images [15.269897893563417]
We propose a relationship representation network for object detection in aerial images (RelationRS) The dual relationship module learns the potential relationship between features of different scales and learns the relationship between different scenes from different patches in a same iteration. The bridging visual representations module (BVR) is introduced into the field of aerial images to improve the object detection effect in images with complex backgrounds.
arXiv Detail & Related papers (2021-10-13T14:02:33Z)
Object-aware Contrastive Learning for Debiased Scene Representation [74.30741492814327]
We develop a novel object-aware contrastive learning framework that localizes objects in a self-supervised manner. We also introduce two data augmentations based on ContraCAM, object-aware random crop and background mixup, which reduce contextual and background biases during contrastive self-supervised learning.
arXiv Detail & Related papers (2021-07-30T19:24:07Z)
Perceiving Traffic from Aerial Images [86.994032967469]
We propose an object detection method called Butterfly Detector that is tailored to detect objects in aerial images. We evaluate our Butterfly Detector on two publicly available UAV datasets (UAVDT and VisDrone 2019) and show that it outperforms previous state-of-the-art methods while remaining real-time.
arXiv Detail & Related papers (2020-09-16T11:37:43Z)

This list is automatically generated from the titles and abstracts of the papers in this site.