Real-Time Object Detection in Occluded Environment with Background
Cluttering Effects Using Deep Learning
- URL: http://arxiv.org/abs/2401.00986v1
- Date: Tue, 2 Jan 2024 01:30:03 GMT
- Title: Real-Time Object Detection in Occluded Environment with Background
Cluttering Effects Using Deep Learning
- Authors: Syed Muhammad Aamir, Hongbin Ma, Malak Abid Ali Khan, Muhammad Aaqib
- Abstract summary: We concentrate on deep learning models for real-time detection of cars and tanks in an occluded environment with a cluttered background.
The developed method makes the custom dataset and employs a preprocessing technique to clean the noisy dataset.
The accuracy and frame per second of the SSD-Mobilenet v2 model are higher than YOLO V3 and YOLO V4.
- Score: 0.8192907805418583
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Detection of small, undetermined moving objects or objects in an occluded
environment with a cluttered background is the main problem of computer vision.
This greatly affects the detection accuracy of deep learning models. To
overcome these problems, we concentrate on deep learning models for real-time
detection of cars and tanks in an occluded environment with a cluttered
background employing SSD and YOLO algorithms and improved precision of
detection and reduce problems faced by these models. The developed method makes
the custom dataset and employs a preprocessing technique to clean the noisy
dataset. For training the developed model we apply the data augmentation
technique to balance and diversify the data. We fine-tuned, trained, and
evaluated these models on the established dataset by applying these techniques
and highlighting the results we got more accurately than without applying these
techniques. The accuracy and frame per second of the SSD-Mobilenet v2 model are
higher than YOLO V3 and YOLO V4. Furthermore, by employing various techniques
like data enhancement, noise reduction, parameter optimization, and model
fusion we improve the effectiveness of detection and recognition. We further
added a counting algorithm, and target attributes experimental comparison, and
made a graphical user interface system for the developed model with features of
object counting, alerts, status, resolution, and frame per second.
Subsequently, to justify the importance of the developed method analysis of
YOLO V3, V4, and SSD were incorporated. Which resulted in the overall
completion of the proposed method.
Related papers
- Optimizing YOLO Architectures for Optimal Road Damage Detection and Classification: A Comparative Study from YOLOv7 to YOLOv10 [0.0]
This paper presents a comprehensive workflow for road damage detection using deep learning models.
To accommodate hardware limitations, large images are cropped, and lightweight models are utilized.
The proposed approach employs multiple model architectures, including a custom YOLOv7 model with Coordinate Attention layers and a Tiny YOLOv7 model.
arXiv Detail & Related papers (2024-10-10T22:55:12Z) - SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients [0.8873228457453465]
Small object detection in aerial imagery presents significant challenges in computer vision.
Traditional methods using transformer-based models often face limitations stemming from the lack of specialized databases.
This paper introduces two innovative approaches that significantly enhance detection and segmentation capabilities for small aerial objects.
arXiv Detail & Related papers (2024-05-02T19:47:08Z) - FILP-3D: Enhancing 3D Few-shot Class-incremental Learning with
Pre-trained Vision-Language Models [62.663113296987085]
Few-shot class-incremental learning aims to mitigate the catastrophic forgetting issue when a model is incrementally trained on limited data.
We introduce two novel components: the Redundant Feature Eliminator (RFE) and the Spatial Noise Compensator (SNC)
Considering the imbalance in existing 3D datasets, we also propose new evaluation metrics that offer a more nuanced assessment of a 3D FSCIL model.
arXiv Detail & Related papers (2023-12-28T14:52:07Z) - Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for
Advanced Object Detection [55.2480439325792]
We present an in-depth evaluation of an object detection model that integrates the LSKNet backbone with the DiffusionDet head.
The proposed model achieves a mean average precision (MAP) of approximately 45.7%, which is a significant improvement.
This advancement underscores the effectiveness of the proposed modifications and sets a new benchmark in aerial image analysis.
arXiv Detail & Related papers (2023-11-21T19:49:13Z) - Randomize to Generalize: Domain Randomization for Runway FOD Detection [1.4249472316161877]
Tiny Object Detection is challenging due to small size, low resolution, occlusion, background clutter, lighting conditions and small object-to-image ratio.
We propose a novel two-stage methodology Synthetic Image Augmentation (SRIA) to enhance generalization capabilities of models encountering 2D datasets.
We report that detection accuracy improved from an initial 41% to 92% for OOD test set.
arXiv Detail & Related papers (2023-09-23T05:02:31Z) - YOLO v3: Visual and Real-Time Object Detection Model for Smart
Surveillance Systems(3s) [0.0]
This paper proposes an object detection model for cyber-physical systems known as Smart Surveillance Systems (3s)
A transfer learning approach was implemented for this research to reduce training time and computing resources.
The proposed model's results performed exceedingly well in detecting objects in surveillance footages.
arXiv Detail & Related papers (2022-09-26T06:34:12Z) - SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video
Anomaly Detection [108.57862846523858]
We revisit the self-supervised multi-task learning framework, proposing several updates to the original method.
We modernize the 3D convolutional backbone by introducing multi-head self-attention modules.
In our attempt to further improve the model, we study additional self-supervised learning tasks, such as predicting segmentation maps.
arXiv Detail & Related papers (2022-07-16T19:25:41Z) - Learning-based Point Cloud Registration for 6D Object Pose Estimation in
the Real World [55.7340077183072]
We tackle the task of estimating the 6D pose of an object from point cloud data.
Recent learning-based approaches to addressing this task have shown great success on synthetic datasets.
We analyze the causes of these failures, which we trace back to the difference between the feature distributions of the source and target point clouds.
arXiv Detail & Related papers (2022-03-29T07:55:04Z) - RandomRooms: Unsupervised Pre-training from Synthetic Shapes and
Randomized Layouts for 3D Object Detection [138.2892824662943]
A promising solution is to make better use of the synthetic dataset, which consists of CAD object models, to boost the learning on real datasets.
Recent work on 3D pre-training exhibits failure when transfer features learned on synthetic objects to other real-world applications.
In this work, we put forward a new method called RandomRooms to accomplish this objective.
arXiv Detail & Related papers (2021-08-17T17:56:12Z) - Secrets of 3D Implicit Object Shape Reconstruction in the Wild [92.5554695397653]
Reconstructing high-fidelity 3D objects from sparse, partial observation is crucial for various applications in computer vision, robotics, and graphics.
Recent neural implicit modeling methods show promising results on synthetic or dense datasets.
But, they perform poorly on real-world data that is sparse and noisy.
This paper analyzes the root cause of such deficient performance of a popular neural implicit model.
arXiv Detail & Related papers (2021-01-18T03:24:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.