Detection, Recognition, and Tracking: A Survey
- URL: http://arxiv.org/abs/2203.11900v1
- Date: Tue, 22 Mar 2022 17:11:24 GMT
- Title: Detection, Recognition, and Tracking: A Survey
- Authors: Shiyao Chen and Dale Chen-Song
- Abstract summary: In Computer Vision and Multimedia, it is increasingly important to detect, recognize and track objects in images and/or videos.
Many applications, such as facial recognition, surveillance, animation, are used for tracking features and/or people.
This literature review focuses on some novel techniques on object detection and recognition, and how to apply tracking algorithms to the detected features to track the objects' movements.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: For humans, object detection, recognition, and tracking are innate. These
provide the ability for human to perceive their environment and objects within
their environment. This ability however doesn't translate well in computers. In
Computer Vision and Multimedia, it is becoming increasingly more important to
detect, recognize and track objects in images and/or videos. Many of these
applications, such as facial recognition, surveillance, animation, are used for
tracking features and/or people. However, these tasks prove challenging for
computers to do effectively, as there is a significant amount of data to parse
through. Therefore, many techniques and algorithms are needed and therefore
researched to try to achieve human like perception. In this literature review,
we focus on some novel techniques on object detection and recognition, and how
to apply tracking algorithms to the detected features to track the objects'
movements.
Related papers
- Computer Vision for Primate Behavior Analysis in the Wild [61.08941894580172]
Video-based behavioral monitoring has great potential for transforming how we study animal cognition and behavior.
There is still a fairly large gap between the exciting prospects and what can actually be achieved in practice today.
arXiv Detail & Related papers (2024-01-29T18:59:56Z) - Neural feels with neural fields: Visuo-tactile perception for in-hand
manipulation [57.60490773016364]
We combine vision and touch sensing on a multi-fingered hand to estimate an object's pose and shape during in-hand manipulation.
Our method, NeuralFeels, encodes object geometry by learning a neural field online and jointly tracks it by optimizing a pose graph problem.
Our results demonstrate that touch, at the very least, refines and, at the very best, disambiguates visual estimates during in-hand manipulation.
arXiv Detail & Related papers (2023-12-20T22:36:37Z) - Object Recognition in Different Lighting Conditions at Various Angles by
Deep Learning Method [0.0]
Existing computer vision and object detection methods rely on neural networks and deep learning.
This article aims to classify the name of the various object based on the position of an object's detected box.
We find that this model's accuracy through recognition is mainly influenced by the proportion of objects and the number of samples.
arXiv Detail & Related papers (2022-10-18T06:23:26Z) - Video-based Human Action Recognition using Deep Learning: A Review [4.976815699476327]
Human action recognition is an important application domain in computer vision.
Deep learning has been given particular attention by the computer vision community.
This paper presents an overview of the current state-of-the-art in action recognition using video analysis with deep learning techniques.
arXiv Detail & Related papers (2022-08-07T17:12:12Z) - Single Object Tracking Research: A Survey [44.24280758718638]
This paper presents the rationale and works of two most popular tracking frameworks in past ten years.
We present some deep learning based tracking methods categorized by different network structures.
We also introduce some classical strategies for handling the challenges in tracking problem.
arXiv Detail & Related papers (2022-04-25T02:59:15Z) - Classifying Human Activities with Inertial Sensors: A Machine Learning
Approach [0.0]
Human Activity Recognition (HAR) is an ongoing research topic.
It has applications in medical support, sports, fitness, social networking, human-computer interfaces, senior care, entertainment, surveillance, and the list goes on.
We examined and analyzed different Machine Learning and Deep Learning approaches for Human Activity Recognition using inertial sensor data of smartphones.
arXiv Detail & Related papers (2021-11-09T08:17:33Z) - Analysis of voxel-based 3D object detection methods efficiency for
real-time embedded systems [93.73198973454944]
Two popular voxel-based 3D object detection methods are studied in this paper.
Our experiments show that these methods mostly fail to detect distant small objects due to the sparsity of the input point clouds at large distances.
Our findings suggest that a considerable part of the computations of existing methods is focused on locations of the scene that do not contribute with successful detection.
arXiv Detail & Related papers (2021-05-21T12:40:59Z) - Careful with That! Observation of Human Movements to Estimate Objects
Properties [106.925705883949]
We focus on the features of human motor actions that communicate insights on the weight of an object.
Our final goal is to enable a robot to autonomously infer the degree of care required in object handling.
arXiv Detail & Related papers (2021-03-02T08:14:56Z) - Camouflaged Object Detection and Tracking: A Survey [7.056222499095849]
We review the existing camouflaged object detection and tracking techniques using computer vision algorithms from the theoretical point of view.
This article also addresses several issues of interest as well as future research direction on this area.
arXiv Detail & Related papers (2020-12-25T14:15:45Z) - Detecting Invisible People [58.49425715635312]
We re-purpose tracking benchmarks and propose new metrics for the task of detecting invisible objects.
We demonstrate that current detection and tracking systems perform dramatically worse on this task.
Second, we build dynamic models that explicitly reason in 3D, making use of observations produced by state-of-the-art monocular depth estimation networks.
arXiv Detail & Related papers (2020-12-15T16:54:45Z) - What Can You Learn from Your Muscles? Learning Visual Representation
from Human Interactions [50.435861435121915]
We use human interaction and attention cues to investigate whether we can learn better representations compared to visual-only representations.
Our experiments show that our "muscly-supervised" representation outperforms a visual-only state-of-the-art method MoCo.
arXiv Detail & Related papers (2020-10-16T17:46:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.