Group Activity Recognition in Computer Vision: A Comprehensive Review,
Challenges, and Future Perspectives
- URL: http://arxiv.org/abs/2307.13541v1
- Date: Tue, 25 Jul 2023 14:44:41 GMT
- Title: Group Activity Recognition in Computer Vision: A Comprehensive Review,
Challenges, and Future Perspectives
- Authors: Chuanchuan Wang, Ahmad Sufril Azlan Mohamed
- Abstract summary: Group activity recognition is a hot topic in computer vision.
Recognizing activities through group relationships plays a vital role in group activity recognition.
This work examines the progress in technology for recognizing group activities.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Group activity recognition is a hot topic in computer vision. Recognizing
activities through group relationships plays a vital role in group activity
recognition. It holds practical implications in various scenarios, such as
video analysis, surveillance, automatic driving, and understanding social
activities. The model's key capabilities encompass efficiently modeling
hierarchical relationships within a scene and accurately extracting distinctive
spatiotemporal features from groups. Given this technology's extensive
applicability, identifying group activities has garnered significant research
attention. This work examines the current progress in technology for
recognizing group activities, with a specific focus on global interactivity and
activities. Firstly, we comprehensively review the pertinent literature and
various group activity recognition approaches, from traditional methodologies
to the latest methods based on spatial structure, descriptors, non-deep
learning, hierarchical recurrent neural networks (HRNN), relationship models,
and attention mechanisms. Subsequently, we present the relational network and
relational architectures for each module. Thirdly, we investigate methods for
recognizing group activity and compare their performance with state-of-the-art
technologies. We summarize the existing challenges and provide comprehensive
guidance for newcomers to understand group activity recognition. Furthermore,
we review emerging perspectives in group activity recognition to explore new
directions and possibilities.
Related papers
- Visual-Geometric Collaborative Guidance for Affordance Learning [63.038406948791454]
We propose a visual-geometric collaborative guided affordance learning network that incorporates visual and geometric cues.
Our method outperforms the representative models regarding objective metrics and visual quality.
arXiv Detail & Related papers (2024-10-15T07:35:51Z) - On the Element-Wise Representation and Reasoning in Zero-Shot Image Recognition: A Systematic Survey [82.49623756124357]
Zero-shot image recognition (ZSIR) aims at empowering models to recognize and reason in unseen domains.
This paper presents a broad review of recent advances in element-wise ZSIR.
We first attempt to integrate the three basic ZSIR tasks of object recognition, compositional recognition, and foundation model-based open-world recognition into a unified element-wise perspective.
arXiv Detail & Related papers (2024-08-09T05:49:21Z) - CogNarr Ecosystem: Facilitating Group Cognition at Scale [0.0]
This concept paper proposes the CogNarr ecosystem, which is designed to facilitate functional cognition in the large-group setting.
A key perspective is to view a group as an organism that uses some form of cognitive architecture to sense the world.
arXiv Detail & Related papers (2024-07-11T08:59:17Z) - Open Problem: Active Representation Learning [0.0]
We introduce the concept of Active Representation Learning, a novel class of problems that intertwines exploration and representation learning within partially observable environments.
We extend ideas from Active Simultaneous Localization and Mapping (active SLAM), and translate them to scientific discovery problems, exemplified by adaptive microscopy.
arXiv Detail & Related papers (2024-06-06T08:23:22Z) - Graph Enhanced Reinforcement Learning for Effective Group Formation in Collaborative Problem Solving [3.392758494801288]
This study addresses the challenge of forming effective groups in collaborative problem-solving environments.
We propose a novel approach leveraging graph theory and reinforcement learning.
arXiv Detail & Related papers (2024-03-15T04:04:40Z) - Homomorphism Autoencoder -- Learning Group Structured Representations from Observed Transitions [51.71245032890532]
We propose methods enabling an agent acting upon the world to learn internal representations of sensory information consistent with actions that modify it.
In contrast to existing work, our approach does not require prior knowledge of the group and does not restrict the set of actions the agent can perform.
arXiv Detail & Related papers (2022-07-25T11:22:48Z) - Co-Located Human-Human Interaction Analysis using Nonverbal Cues: A
Survey [71.43956423427397]
We aim to identify the nonverbal cues and computational methodologies resulting in effective performance.
This survey differs from its counterparts by involving the widest spectrum of social phenomena and interaction settings.
Some major observations are: the most often used nonverbal cue, computational method, interaction environment, and sensing approach are speaking activity, support vector machines, and meetings composed of 3-4 persons equipped with microphones and cameras, respectively.
arXiv Detail & Related papers (2022-07-20T13:37:57Z) - Hunting Group Clues with Transformers for Social Group Activity
Recognition [3.1061678033205635]
Social group activity recognition requires recognizing multiple sub-group activities and identifying group members.
Most existing methods tackle both tasks by refining region features and then summarizing them into activity features.
We propose to leverage attention modules in transformers to generate effective social group features.
Our method is designed in such a way that the attention modules identify and then aggregate features relevant to social group activities.
arXiv Detail & Related papers (2022-07-12T01:46:46Z) - Improving Facial Attribute Recognition by Group and Graph Learning [34.39507051712628]
Exploiting the relationships between attributes is a key challenge for improving facial attribute recognition.
In this work, we are concerned with two types of correlations that are spatial and non-spatial relationships.
We propose a unified network called Multi-scale Group and Graph Network.
arXiv Detail & Related papers (2021-05-28T13:36:28Z) - Unsupervised Discriminative Embedding for Sub-Action Learning in Complex
Activities [54.615003524001686]
This paper proposes a novel approach for unsupervised sub-action learning in complex activities.
The proposed method maps both visual and temporal representations to a latent space where the sub-actions are learnt discriminatively.
We show that the proposed combination of visual-temporal embedding and discriminative latent concepts allow to learn robust action representations in an unsupervised setting.
arXiv Detail & Related papers (2021-04-30T20:07:27Z) - Recent Progress in Appearance-based Action Recognition [73.6405863243707]
Action recognition is a task to identify various human actions in a video.
Recent appearance-based methods have achieved promising progress towards accurate action recognition.
arXiv Detail & Related papers (2020-11-25T10:18:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.