CNN-Based Action Recognition and Pose Estimation for Classifying Animal
Behavior from Videos: A Survey
- URL: http://arxiv.org/abs/2301.06187v1
- Date: Sun, 15 Jan 2023 20:54:44 GMT
- Title: CNN-Based Action Recognition and Pose Estimation for Classifying Animal
Behavior from Videos: A Survey
- Authors: Michael Perez and Corey Toler-Franklin
- Abstract summary: Action recognition, classifying activities performed by one or more subjects in a trimmed video, forms the basis of many techniques.
Deep learning models for human action recognition have progressed over the last decade.
Recent interest in research that incorporates deep learning-based action recognition for classification has increased.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Classifying the behavior of humans or animals from videos is important in
biomedical fields for understanding brain function and response to stimuli.
Action recognition, classifying activities performed by one or more subjects in
a trimmed video, forms the basis of many of these techniques. Deep learning
models for human action recognition have progressed significantly over the last
decade. Recently, there is an increased interest in research that incorporates
deep learning-based action recognition for animal behavior classification.
However, human action recognition methods are more developed. This survey
presents an overview of human action recognition and pose estimation methods
that are based on convolutional neural network (CNN) architectures and have
been adapted for animal behavior classification in neuroscience. Pose
estimation, estimating joint positions from an image frame, is included because
it is often applied before classifying animal behavior. First, we provide
foundational information on algorithms that learn spatiotemporal features
through 2D, two-stream, and 3D CNNs. We explore motivating factors that
determine optimizers, loss functions and training procedures, and compare their
performance on benchmark datasets. Next, we review animal behavior frameworks
that use or build upon these methods, organized by the level of supervision
they require. Our discussion is uniquely focused on the technical evolution of
the underlying CNN models and their architectural adaptations (which we
illustrate), rather than their usability in a neuroscience lab. We conclude by
discussing open research problems, and possible research directions. Our survey
is designed to be a resource for researchers developing fully unsupervised
animal behavior classification systems of which there are only a few examples
in the literature.
Related papers
- Animal Behavior Analysis Methods Using Deep Learning: A Survey [3.9086052572489653]
State-of-the-art deep learning models have demonstrated remarkable accuracy in classifying various forms of animal data.
The article culminates in a comprehensive discussion of key research directions within deep learning that hold potential for advancing the field of animal behavior studies.
arXiv Detail & Related papers (2024-05-22T21:17:54Z) - Computer Vision for Primate Behavior Analysis in the Wild [61.08941894580172]
Video-based behavioral monitoring has great potential for transforming how we study animal cognition and behavior.
There is still a fairly large gap between the exciting prospects and what can actually be achieved in practice today.
arXiv Detail & Related papers (2024-01-29T18:59:56Z) - Language Knowledge-Assisted Representation Learning for Skeleton-Based
Action Recognition [71.35205097460124]
How humans understand and recognize the actions of others is a complex neuroscientific problem.
LA-GCN proposes a graph convolution network using large-scale language models (LLM) knowledge assistance.
arXiv Detail & Related papers (2023-05-21T08:29:16Z) - Review on Social Behavior Analysis of Laboratory Animals: From
Methodologies to Applications [0.0]
We explore a variety of behaviour detection algorithms, covering traditional vision methods, statistical methods and deep learning methods.
The objective of this work is to provide a thorough investigation of related work, furnishing biologists with a scratch of efficient animal behaviour detection methods.
arXiv Detail & Related papers (2022-06-25T13:40:35Z) - Guiding Visual Attention in Deep Convolutional Neural Networks Based on
Human Eye Movements [0.0]
Deep Convolutional Neural Networks (DCNNs) were originally inspired by principles of biological vision.
Recent advances in deep learning seem to decrease this similarity.
We investigate a purely data-driven approach to obtain useful models.
arXiv Detail & Related papers (2022-06-21T17:59:23Z) - Overcoming the Domain Gap in Neural Action Representations [60.47807856873544]
3D pose data can now be reliably extracted from multi-view video sequences without manual intervention.
We propose to use it to guide the encoding of neural action representations together with a set of neural and behavioral augmentations.
To reduce the domain gap, during training, we swap neural and behavioral data across animals that seem to be performing similar actions.
arXiv Detail & Related papers (2021-12-02T12:45:46Z) - Overcoming the Domain Gap in Contrastive Learning of Neural Action
Representations [60.47807856873544]
A fundamental goal in neuroscience is to understand the relationship between neural activity and behavior.
We generated a new multimodal dataset consisting of the spontaneous behaviors generated by fruit flies.
This dataset and our new set of augmentations promise to accelerate the application of self-supervised learning methods in neuroscience.
arXiv Detail & Related papers (2021-11-29T15:27:51Z) - Affect Analysis in-the-wild: Valence-Arousal, Expressions, Action Units
and a Unified Framework [83.21732533130846]
The paper focuses on large in-the-wild databases, i.e., Aff-Wild and Aff-Wild2.
It presents the design of two classes of deep neural networks trained with these databases.
A novel multi-task and holistic framework is presented which is able to jointly learn and effectively generalize and perform affect recognition.
arXiv Detail & Related papers (2021-03-29T17:36:20Z) - Transferring Dense Pose to Proximal Animal Classes [83.84439508978126]
We show that it is possible to transfer the knowledge existing in dense pose recognition for humans, as well as in more general object detectors and segmenters, to the problem of dense pose recognition in other classes.
We do this by establishing a DensePose model for the new animal which is also geometrically aligned to humans.
We also introduce two benchmark datasets labelled in the manner of DensePose for the class chimpanzee and use them to evaluate our approach.
arXiv Detail & Related papers (2020-02-28T21:43:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.