Real-time Trajectory-based Social Group Detection
- URL: http://arxiv.org/abs/2304.05678v1
- Date: Wed, 12 Apr 2023 08:01:43 GMT
- Title: Real-time Trajectory-based Social Group Detection
- Authors: Simindokht Jahangard, Munawar Hayat and Hamid Rezatofighi
- Abstract summary: We propose a simple and efficient framework for social group detection.
Our approach explores the impact of motion trajectory on social grouping and utilizes a novel, reliable, and fast data-driven method.
Our experiments on the popular JRDBAct dataset reveal noticeable improvements in performance, with relative improvements ranging from 2% to 11%.
- Score: 22.86110112028644
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Social group detection is a crucial aspect of various robotic applications,
including robot navigation and human-robot interactions. To date, a range of
model-based techniques have been employed to address this challenge, such as
the F-formation and trajectory similarity frameworks. However, these approaches
often fail to provide reliable results in crowded and dynamic scenarios. Recent
advancements in this area have mainly focused on learning-based methods, such
as deep neural networks that use visual content or human pose. Although visual
content-based methods have demonstrated promising performance on large-scale
datasets, their computational complexity poses a significant barrier to their
practical use in real-time applications. To address these issues, we propose a
simple and efficient framework for social group detection. Our approach
explores the impact of motion trajectory on social grouping and utilizes a
novel, reliable, and fast data-driven method. We formulate the individuals in a
scene as a graph, where the nodes are represented by LSTM-encoded trajectories
and the edges are defined by the distances between each pair of tracks. Our
framework employs a modified graph transformer module and graph clustering
losses to detect social groups. Our experiments on the popular JRDBAct dataset
reveal noticeable improvements in performance, with relative improvements
ranging from 2% to 11%. Furthermore, our framework is significantly faster,
with up to 12x faster inference times compared to state-of-the-art methods
under the same computation resources. These results demonstrate that our
proposed method is suitable for real-time robotic applications.
Related papers
- Why Sample Space Matters: Keyframe Sampling Optimization for LiDAR-based Place Recognition [6.468510459310326]
We introduce the concept of sample space in place recognition and demonstrate how different sampling techniques affect the query process and overall performance.
We then present a novel sampling approach for LiDAR-based place recognition, which focuses on redundancy and information preservation in the hyper-dimensional descriptor space.
arXiv Detail & Related papers (2024-10-03T16:29:47Z) - POGEMA: A Benchmark Platform for Cooperative Multi-Agent Navigation [76.67608003501479]
We introduce and specify an evaluation protocol defining a range of domain-related metrics computed on the basics of the primary evaluation indicators.
The results of such a comparison, which involves a variety of state-of-the-art MARL, search-based, and hybrid methods, are presented.
arXiv Detail & Related papers (2024-07-20T16:37:21Z) - Multi-Agent Dynamic Relational Reasoning for Social Robot Navigation [55.65482030032804]
Social robot navigation can be helpful in various contexts of daily life but requires safe human-robot interactions and efficient trajectory planning.
We propose a systematic relational reasoning approach with explicit inference of the underlying dynamically evolving relational structures.
Our approach infers dynamically evolving relation graphs and hypergraphs to capture the evolution of relations, which the trajectory predictor employs to generate future states.
arXiv Detail & Related papers (2024-01-22T18:58:22Z) - Surface EMG-Based Inter-Session/Inter-Subject Gesture Recognition by
Leveraging Lightweight All-ConvNet and Transfer Learning [17.535392299244066]
Gesture recognition using low-resolution instantaneous HD-sEMG images opens up new avenues for the development of more fluid and natural muscle-computer interfaces.
The data variability between inter-session and inter-subject scenarios presents a great challenge.
Existing approaches employed very large and complex deep ConvNet or 2SRNN-based domain adaptation methods to approximate the distribution shift caused by these inter-session and inter-subject data variability.
We propose a lightweight All-ConvNet+TL model that leverages lightweight All-ConvNet and transfer learning (TL) for the enhancement of inter-session and inter-subject gesture recognition
arXiv Detail & Related papers (2023-05-13T21:47:55Z) - Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based
Action Recognition [88.34182299496074]
Action labels are only available on a source dataset, but unavailable on a target dataset in the training stage.
We utilize a self-supervision scheme to reduce the domain shift between two skeleton-based action datasets.
By segmenting and permuting temporal segments or human body parts, we design two self-supervised learning classification tasks.
arXiv Detail & Related papers (2022-07-17T07:05:39Z) - Domain Adaptive Robotic Gesture Recognition with Unsupervised
Kinematic-Visual Data Alignment [60.31418655784291]
We propose a novel unsupervised domain adaptation framework which can simultaneously transfer multi-modality knowledge, i.e., both kinematic and visual data, from simulator to real robot.
It remedies the domain gap with enhanced transferable features by using temporal cues in videos, and inherent correlations in multi-modal towards recognizing gesture.
Results show that our approach recovers the performance with great improvement gains, up to 12.91% in ACC and 20.16% in F1score without using any annotations in real robot.
arXiv Detail & Related papers (2021-03-06T09:10:03Z) - Straggler-Resilient Federated Learning: Leveraging the Interplay Between
Statistical Accuracy and System Heterogeneity [57.275753974812666]
Federated learning involves learning from data samples distributed across a network of clients while the data remains local.
In this paper, we propose a novel straggler-resilient federated learning method that incorporates statistical characteristics of the clients' data to adaptively select the clients in order to speed up the learning procedure.
arXiv Detail & Related papers (2020-12-28T19:21:14Z) - Memory Group Sampling Based Online Action Recognition Using Kinetic
Skeleton Features [4.674689979981502]
We propose two core ideas to handle the online action recognition problem.
First, we combine the spatial and temporal skeleton features to depict the actions.
Second, we propose a memory group sampling method to combine the previous action frames and current action frames.
Third, an improved 1D CNN network is employed for training and testing using the features from sampled frames.
arXiv Detail & Related papers (2020-11-01T16:43:08Z) - Human Trajectory Forecasting in Crowds: A Deep Learning Perspective [89.4600982169]
We present an in-depth analysis of existing deep learning-based methods for modelling social interactions.
We propose two knowledge-based data-driven methods to effectively capture these social interactions.
We develop a large scale interaction-centric benchmark TrajNet++, a significant yet missing component in the field of human trajectory forecasting.
arXiv Detail & Related papers (2020-07-07T17:19:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.