Light Weight Character and Shape Recognition for Autonomous Drones
- URL: http://arxiv.org/abs/2208.06804v1
- Date: Sun, 14 Aug 2022 08:22:41 GMT
- Title: Light Weight Character and Shape Recognition for Autonomous Drones
- Authors: Neetigya Poddar, Shruti Jain
- Abstract summary: We propose an object detection and classification pipeline which prevents false positives and minimizes misclassification of alphanumeric characters and shapes in aerial images.
Our method makes use of traditional computer vision techniques and unsupervised machine learning methods for identifying region proposals.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: There has been an extensive use of Unmanned Aerial Vehicles in search and
rescue missions to distribute first aid kits and food packets. It is important
that these UAVs are able to identify and distinguish the markers from one
another for effective distribution. One of the common ways to mark the
locations is via the use of characters superimposed on shapes of various colors
which gives rise to wide variety of markers based on combination of different
shapes, characters, and their respective colors.
In this paper, we propose an object detection and classification pipeline
which prevents false positives and minimizes misclassification of alphanumeric
characters and shapes in aerial images. Our method makes use of traditional
computer vision techniques and unsupervised machine learning methods for
identifying region proposals, segmenting the image targets and removing false
positives. We make use of a computationally light model for classification,
making it easy to be deployed on any aerial vehicle.
Related papers
- An Application-Agnostic Automatic Target Recognition System Using Vision Language Models [32.858386851006316]
We present a novel Automatic Target Recognition (ATR) system using open-vocabulary object detection and classification models.
A primary advantage of this approach is that target classes can be defined just before runtime by a non-technical end user.
Nuances in the desired targets can be expressed in natural language, which is useful for unique targets with little or no training data.
arXiv Detail & Related papers (2024-11-05T20:16:15Z) - Intelligent Known and Novel Aircraft Recognition -- A Shift from
Classification to Similarity Learning for Combat Identification [0.7841783185186357]
This research addresses the accurate recognition of Novel/Unknown types of aircraft in addition to Known types.
Traditional methods, human expert-driven combat identification and image classification, fall short in identifying Novel classes.
Our methodology employs similarity learning to discern features of a broad spectrum of military and civilian aircraft.
arXiv Detail & Related papers (2024-02-26T11:08:26Z) - OsmLocator: locating overlapping scatter marks with a non-training
generative perspective [48.50108853199417]
Locating overlapping marks faces many difficulties such as no texture, less contextual information, hallow shape and tiny size.
Here, we formulate it as a optimization problem on clustering-based re-visualization from a non-training generative perspective.
We especially built a dataset named 2023 containing hundreds of scatter images with different markers and various levels of overlapping severity, and tested the proposed method and compared it to existing methods.
arXiv Detail & Related papers (2023-12-18T12:39:48Z) - DiPS: Discriminative Pseudo-Label Sampling with Self-Supervised
Transformers for Weakly Supervised Object Localization [13.412674368913747]
Discriminative Pseudo-label Sampling (DiPS) is introduced to leverage class-agnostic maps for weakly-supervised object localization.
DiPS relies on a pre-trained classifier to identify the most discriminative regions of each attention map.
It provides a rich pool of diverse and discriminative proposals to cover different parts of the object.
arXiv Detail & Related papers (2023-10-09T22:52:43Z) - A Black-Box Attack on Optical Character Recognition Systems [0.0]
Adversarial machine learning is an emerging area showing the vulnerability of deep learning models.
In this paper, we propose a simple yet efficient attack method, Efficient Combinatorial Black-box Adversarial Attack, on binary image classifiers.
We validate the efficiency of the attack technique on two different data sets and three classification networks, demonstrating its performance.
arXiv Detail & Related papers (2022-08-30T14:36:27Z) - LEAD: Self-Supervised Landmark Estimation by Aligning Distributions of
Feature Similarity [49.84167231111667]
Existing works in self-supervised landmark detection are based on learning dense (pixel-level) feature representations from an image.
We introduce an approach to enhance the learning of dense equivariant representations in a self-supervised fashion.
We show that having such a prior in the feature extractor helps in landmark detection, even under drastically limited number of annotations.
arXiv Detail & Related papers (2022-04-06T17:48:18Z) - Attention Model Enhanced Network for Classification of Breast Cancer
Image [54.83246945407568]
AMEN is formulated in a multi-branch fashion with pixel-wised attention model and classification submodular.
To focus more on subtle detail information, the sample image is enhanced by the pixel-wised attention map generated from former branch.
Experiments conducted on three benchmark datasets demonstrate the superiority of the proposed method under various scenarios.
arXiv Detail & Related papers (2020-10-07T08:44:21Z) - Perceiving Traffic from Aerial Images [86.994032967469]
We propose an object detection method called Butterfly Detector that is tailored to detect objects in aerial images.
We evaluate our Butterfly Detector on two publicly available UAV datasets (UAVDT and VisDrone 2019) and show that it outperforms previous state-of-the-art methods while remaining real-time.
arXiv Detail & Related papers (2020-09-16T11:37:43Z) - Orientation-aware Vehicle Re-identification with Semantics-guided Part
Attention Network [33.712450134663236]
We propose a dedicated Semantics-guided Part Attention Network (SPAN) to robustly predict part attention masks for different views of vehicles.
With the help of part attention masks, we can extract discriminative features in each part separately.
Then we introduce Co-occurrence Part-attentive Distance Metric (CPDM) which places greater emphasis on co-occurrence vehicle parts.
arXiv Detail & Related papers (2020-08-26T07:33:09Z) - CSI: Novelty Detection via Contrastive Learning on Distributionally
Shifted Instances [77.28192419848901]
We propose a simple, yet effective method named contrasting shifted instances (CSI)
In addition to contrasting a given sample with other instances as in conventional contrastive learning methods, our training scheme contrasts the sample with distributionally-shifted augmentations of itself.
Our experiments demonstrate the superiority of our method under various novelty detection scenarios.
arXiv Detail & Related papers (2020-07-16T08:32:56Z) - The Devil is in the Details: Self-Supervised Attention for Vehicle
Re-Identification [75.3310894042132]
Self-supervised Attention for Vehicle Re-identification (SAVER) is a novel approach to effectively learn vehicle-specific discriminative features.
We show that SAVER improves upon the state-of-the-art on challenging VeRi, VehicleID, Vehicle-1M and VERI-Wild datasets.
arXiv Detail & Related papers (2020-04-14T02:24:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.