Decentralized policy learning with partial observation and mechanical
constraints for multiperson modeling
- URL: http://arxiv.org/abs/2007.03155v2
- Date: Fri, 1 Dec 2023 14:19:31 GMT
- Title: Decentralized policy learning with partial observation and mechanical
constraints for multiperson modeling
- Authors: Keisuke Fujii, Naoya Takeishi, Yoshinobu Kawahara, Kazuya Takeda
- Abstract summary: We propose sequential generative models with partial observation and mechanical constraints in a decentralized manner.
Our approach can be used as a multi-agent simulator to generate realistic trajectories using real-world data.
- Score: 14.00358511581803
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Extracting the rules of real-world multi-agent behaviors is a current
challenge in various scientific and engineering fields. Biological agents
independently have limited observation and mechanical constraints; however,
most of the conventional data-driven models ignore such assumptions, resulting
in lack of biological plausibility and model interpretability for behavioral
analyses. Here we propose sequential generative models with partial observation
and mechanical constraints in a decentralized manner, which can model agents'
cognition and body dynamics, and predict biologically plausible behaviors. We
formulate this as a decentralized multi-agent imitation-learning problem,
leveraging binary partial observation and decentralized policy models based on
hierarchical variational recurrent neural networks with physical and
biomechanical penalties. Using real-world basketball and soccer datasets, we
show the effectiveness of our method in terms of the constraint violations,
long-term trajectory prediction, and partial observation. Our approach can be
used as a multi-agent simulator to generate realistic trajectories using
real-world data.
Related papers
- Causal Representation Learning from Multimodal Biological Observations [57.00712157758845]
We aim to develop flexible identification conditions for multimodal data.
We establish identifiability guarantees for each latent component, extending the subspace identification results from prior work.
Our key theoretical ingredient is the structural sparsity of the causal connections among distinct modalities.
arXiv Detail & Related papers (2024-11-10T16:40:27Z) - Back to Bayesics: Uncovering Human Mobility Distributions and Anomalies with an Integrated Statistical and Neural Framework [14.899157568336731]
DeepBayesic is a novel framework that integrates Bayesian principles with deep neural networks to model the underlying distributions.
We evaluate our approach on several mobility datasets, demonstrating significant improvements over state-of-the-art anomaly detection methods.
arXiv Detail & Related papers (2024-10-01T19:02:06Z) - Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - Synthetic location trajectory generation using categorical diffusion
models [50.809683239937584]
Diffusion models (DPMs) have rapidly evolved to be one of the predominant generative models for the simulation of synthetic data.
We propose using DPMs for the generation of synthetic individual location trajectories (ILTs) which are sequences of variables representing physical locations visited by individuals.
arXiv Detail & Related papers (2024-02-19T15:57:39Z) - Differentiable Agent-based Epidemiology [71.81552021144589]
We introduce GradABM: a scalable, differentiable design for agent-based modeling that is amenable to gradient-based learning with automatic differentiation.
GradABM can quickly simulate million-size populations in few seconds on commodity hardware, integrate with deep neural networks and ingest heterogeneous data sources.
arXiv Detail & Related papers (2022-07-20T07:32:02Z) - EINNs: Epidemiologically-Informed Neural Networks [75.34199997857341]
We introduce a new class of physics-informed neural networks-EINN-crafted for epidemic forecasting.
We investigate how to leverage both the theoretical flexibility provided by mechanistic models as well as the data-driven expressability afforded by AI models.
arXiv Detail & Related papers (2022-02-21T18:59:03Z) - Learning interaction rules from multi-animal trajectories via augmented
behavioral models [8.747278400158718]
Granger causality is a practical framework for analyzing the interactions from observed time-series data.
This framework ignores the structures of the generative process in animal behaviors.
We propose a new framework for learning Granger causality from multi-animal trajectories.
arXiv Detail & Related papers (2021-07-12T11:33:56Z) - Multi-Agent Imitation Learning with Copulas [102.27052968901894]
Multi-agent imitation learning aims to train multiple agents to perform tasks from demonstrations by learning a mapping between observations and actions.
In this paper, we propose to use copula, a powerful statistical tool for capturing dependence among random variables, to explicitly model the correlation and coordination in multi-agent systems.
Our proposed model is able to separately learn marginals that capture the local behavioral patterns of each individual agent, as well as a copula function that solely and fully captures the dependence structure among agents.
arXiv Detail & Related papers (2021-07-10T03:49:41Z) - The Role of Isomorphism Classes in Multi-Relational Datasets [6.419762264544509]
We show that isomorphism leakage overestimates performance in multi-relational inference.
We propose isomorphism-aware synthetic benchmarks for model evaluation.
We also demonstrate that isomorphism classes can be utilised through a simple prioritisation scheme.
arXiv Detail & Related papers (2020-09-30T12:15:24Z) - Prediction with Approximated Gaussian Process Dynamical Models [7.678864239473703]
We present approximated GPDMs which are Markov and analyze their control theoretical properties.
The outcomes are illustrated with numerical examples that show the power of the approximated models.
arXiv Detail & Related papers (2020-06-25T16:51:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.