On the Importance of Large Objects in CNN Based Object Detection
Algorithms
- URL: http://arxiv.org/abs/2311.11714v1
- Date: Mon, 20 Nov 2023 12:32:32 GMT
- Title: On the Importance of Large Objects in CNN Based Object Detection
Algorithms
- Authors: Ahmed Ben Saad (CB), Gabriele Facciolo (CB), Axel Davy (CB)
- Abstract summary: We highlight the importance of large objects in learning features that are critical for all sizes.
We show that giving more weight to large objects leads to improved detection scores across all object sizes.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object detection models, a prominent class of machine learning algorithms,
aim to identify and precisely locate objects in images or videos. However, this
task might yield uneven performances sometimes caused by the objects sizes and
the quality of the images and labels used for training. In this paper, we
highlight the importance of large objects in learning features that are
critical for all sizes. Given these findings, we propose to introduce a
weighting term into the training loss. This term is a function of the object
area size. We show that giving more weight to large objects leads to improved
detection scores across all object sizes and so an overall improvement in
Object Detectors performances (+2 p.p. of mAP on small objects, +2 p.p. on
medium and +4 p.p. on large on COCO val 2017 with InternImage-T). Additional
experiments and ablation studies with different models and on a different
dataset further confirm the robustness of our findings.
Related papers
- Correlation of Object Detection Performance with Visual Saliency and Depth Estimation [0.09208007322096533]
This paper investigates the correlations between object detection accuracy and two fundamental visual tasks: depth prediction and visual saliency prediction.
Our analysis reveals significant variations in these correlations across object categories, with larger objects showing correlation values up to three times higher than smaller objects.
These findings suggest incorporating visual saliency features into object detection architectures could be more beneficial than depth information.
arXiv Detail & Related papers (2024-11-05T06:34:19Z) - Visual Context-Aware Person Fall Detection [52.49277799455569]
We present a segmentation pipeline to semi-automatically separate individuals and objects in images.
Background objects such as beds, chairs, or wheelchairs can challenge fall detection systems, leading to false positive alarms.
We demonstrate that object-specific contextual transformations during training effectively mitigate this challenge.
arXiv Detail & Related papers (2024-04-11T19:06:36Z) - Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for
Advanced Object Detection [55.2480439325792]
We present an in-depth evaluation of an object detection model that integrates the LSKNet backbone with the DiffusionDet head.
The proposed model achieves a mean average precision (MAP) of approximately 45.7%, which is a significant improvement.
This advancement underscores the effectiveness of the proposed modifications and sets a new benchmark in aerial image analysis.
arXiv Detail & Related papers (2023-11-21T19:49:13Z) - Uncertainty Aware Active Learning for Reconfiguration of Pre-trained
Deep Object-Detection Networks for New Target Domains [0.0]
Object detection is one of the most important and fundamental aspects of computer vision tasks.
To obtain training data for object detection model efficiently, many datasets opt to obtain their unannotated data in video format.
Annotating every frame from a video is costly and inefficient since many frames contain very similar information for the model to learn from.
In this paper, we proposed a novel active learning algorithm for object detection models to tackle this problem.
arXiv Detail & Related papers (2023-03-22T17:14:10Z) - ARUBA: An Architecture-Agnostic Balanced Loss for Aerial Object
Detection [24.085715205081385]
We denote size of an object as the number of pixels it covers in an image and size imbalance as the over-representation of certain sizes of objects in a dataset.
We propose a novel ARchitectUre-agnostic BAlanced Loss (ARUBA) that can be applied as a plugin on top of any object detection model.
arXiv Detail & Related papers (2022-10-10T11:28:16Z) - Object Detection in Aerial Images with Uncertainty-Aware Graph Network [61.02591506040606]
We propose a novel uncertainty-aware object detection framework with a structured-graph, where nodes and edges are denoted by objects.
We refer to our model as Uncertainty-Aware Graph network for object DETection (UAGDet)
arXiv Detail & Related papers (2022-08-23T07:29:03Z) - Contrastive Object Detection Using Knowledge Graph Embeddings [72.17159795485915]
We compare the error statistics of the class embeddings learned from a one-hot approach with semantically structured embeddings from natural language processing or knowledge graphs.
We propose a knowledge-embedded design for keypoint-based and transformer-based object detection architectures.
arXiv Detail & Related papers (2021-12-21T17:10:21Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z) - Scale Normalized Image Pyramids with AutoFocus for Object Detection [75.71320993452372]
A scale normalized image pyramid (SNIP) is generated that, like human vision, only attends to objects within a fixed size range at different scales.
We propose an efficient spatial sub-sampling scheme which only operates on fixed-size sub-regions likely to contain objects.
The resulting algorithm is referred to as AutoFocus and results in a 2.5-5 times speed-up during inference when used with SNIP.
arXiv Detail & Related papers (2021-02-10T18:57:53Z) - Quantum Tensor Network in Machine Learning: An Application to Tiny
Object Classification [39.027985567917455]
In our work, we apply quantum spin model to image classification and bring the theory into the scenario of tiny object classification.
In the end, our experimental results indicate that tensor network models are effective for tiny object classification problem and potentially will beat state-of-the-art.
arXiv Detail & Related papers (2021-01-08T18:33:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.