Convolutions for Spatial Interaction Modeling
- URL: http://arxiv.org/abs/2104.07182v1
- Date: Thu, 15 Apr 2021 00:41:30 GMT
- Title: Convolutions for Spatial Interaction Modeling
- Authors: Zhaoen Su, Chao Wang, David Bradley, Carlos Vallespi-Gonzalez, Carl
Wellington, Nemanja Djuric
- Abstract summary: We consider the problem of spatial interaction modeling in the context of predicting the motion of actors around autonomous vehicles.
We revisit convolutions and show that they can demonstrate comparable performance to graph networks in modeling spatial interactions with lower latency.
- Score: 9.408751013132624
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: In many different fields interactions between objects play a critical role in
determining their behavior. Graph neural networks (GNNs) have emerged as a
powerful tool for modeling interactions, although often at the cost of adding
considerable complexity and latency. In this paper, we consider the problem of
spatial interaction modeling in the context of predicting the motion of actors
around autonomous vehicles, and investigate alternative approaches to GNNs. We
revisit convolutions and show that they can demonstrate comparable performance
to graph networks in modeling spatial interactions with lower latency, thus
providing an effective and efficient alternative in time-critical systems.
Moreover, we propose a novel interaction loss to further improve the
interaction modeling of the considered methods.
Related papers
- Disentangled Neural Relational Inference for Interpretable Motion
Prediction [38.40799770648501]
We develop a variational auto-encoder framework that integrates graph-based representations and timesequence models.
Our model infers dynamic interaction graphs augmented with interpretable edge features that characterize the interactions.
We validate our approach through extensive experiments on both simulated and real-world datasets.
arXiv Detail & Related papers (2024-01-07T22:49:24Z) - Interactive Autonomous Navigation with Internal State Inference and
Interactivity Estimation [58.21683603243387]
We propose three auxiliary tasks with relational-temporal reasoning and integrate them into the standard Deep Learning framework.
These auxiliary tasks provide additional supervision signals to infer the behavior patterns other interactive agents.
Our approach achieves robust and state-of-the-art performance in terms of standard evaluation metrics.
arXiv Detail & Related papers (2023-11-27T18:57:42Z) - Inferring Relational Potentials in Interacting Systems [56.498417950856904]
We propose Neural Interaction Inference with Potentials (NIIP) as an alternative approach to discover such interactions.
NIIP assigns low energy to the subset of trajectories which respect the relational constraints observed.
It allows trajectory manipulation, such as interchanging interaction types across separately trained models, as well as trajectory forecasting.
arXiv Detail & Related papers (2023-10-23T00:44:17Z) - Neural Relational Inference with Fast Modular Meta-learning [25.313516707169498]
Graph neural networks (GNNs) are effective models for many dynamical systems consisting of entities and relations.
Relational inference is the problem of inferring these interactions and learning the dynamics from observational data.
We frame relational inference as a textitmodular meta-learning problem, where neural modules are trained to be composed in different ways to solve many tasks.
arXiv Detail & Related papers (2023-10-10T21:05:13Z) - Persistent-Transient Duality: A Multi-mechanism Approach for Modeling
Human-Object Interaction [58.67761673662716]
Humans are highly adaptable, swiftly switching between different modes to handle different tasks, situations and contexts.
In Human-object interaction (HOI) activities, these modes can be attributed to two mechanisms: (1) the large-scale consistent plan for the whole activity and (2) the small-scale children interactive actions that start and end along the timeline.
This work proposes to model two concurrent mechanisms that jointly control human motion.
arXiv Detail & Related papers (2023-07-24T12:21:33Z) - Dynamic Scenario Representation Learning for Motion Forecasting with
Heterogeneous Graph Convolutional Recurrent Networks [25.383615554172778]
We resort to dynamic heterogeneous graphs to model the evolving scenario.
We design a novel heterogeneous graphal recurrent network, aggregating diverse interaction information.
With a motion forecasting decoder, our model predicts realistic and multi-modal future trajectories of agents.
arXiv Detail & Related papers (2023-03-08T04:10:04Z) - Temporal Relevance Analysis for Video Action Models [70.39411261685963]
We first propose a new approach to quantify the temporal relationships between frames captured by CNN-based action models.
We then conduct comprehensive experiments and in-depth analysis to provide a better understanding of how temporal modeling is affected.
arXiv Detail & Related papers (2022-04-25T19:06:48Z) - VIRT: Improving Representation-based Models for Text Matching through
Virtual Interaction [50.986371459817256]
We propose a novel textitVirtual InteRacTion mechanism, termed as VIRT, to enable full and deep interaction modeling in representation-based models.
VIRT asks representation-based encoders to conduct virtual interactions to mimic the behaviors as interaction-based models do.
arXiv Detail & Related papers (2021-12-08T09:49:28Z) - Multi-agent Trajectory Prediction with Fuzzy Query Attention [15.12743751614964]
Trajectory prediction for scenes with multiple agents is a challenging problem in numerous domains such as traffic prediction, pedestrian tracking and path planning.
We present a general architecture to address this challenge which models the crucial inductive biases of motion, namely, inertia, relative motion, intents and interactions.
arXiv Detail & Related papers (2020-10-29T19:12:12Z) - EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational
Reasoning [41.42230144157259]
We propose a generic trajectory forecasting framework with explicit relational structure recognition and prediction via latent interaction graphs.
Considering the uncertainty of future behaviors, the model is designed to provide multi-modal prediction hypotheses.
We introduce a double-stage training pipeline which not only improves training efficiency and accelerates convergence, but also enhances model performance.
arXiv Detail & Related papers (2020-03-31T02:49:23Z) - Spatial-Temporal Transformer Networks for Traffic Flow Forecasting [74.76852538940746]
We propose a novel paradigm of Spatial-Temporal Transformer Networks (STTNs) to improve the accuracy of long-term traffic forecasting.
Specifically, we present a new variant of graph neural networks, named spatial transformer, by dynamically modeling directed spatial dependencies.
The proposed model enables fast and scalable training over a long range spatial-temporal dependencies.
arXiv Detail & Related papers (2020-01-09T10:21:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.