Persistent-Transient Duality: A Multi-mechanism Approach for Modeling
Human-Object Interaction
- URL: http://arxiv.org/abs/2307.12729v1
- Date: Mon, 24 Jul 2023 12:21:33 GMT
- Title: Persistent-Transient Duality: A Multi-mechanism Approach for Modeling
Human-Object Interaction
- Authors: Hung Tran, Vuong Le, Svetha Venkatesh, Truyen Tran
- Abstract summary: Humans are highly adaptable, swiftly switching between different modes to handle different tasks, situations and contexts.
In Human-object interaction (HOI) activities, these modes can be attributed to two mechanisms: (1) the large-scale consistent plan for the whole activity and (2) the small-scale children interactive actions that start and end along the timeline.
This work proposes to model two concurrent mechanisms that jointly control human motion.
- Score: 58.67761673662716
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Humans are highly adaptable, swiftly switching between different modes to
progressively handle different tasks, situations and contexts. In Human-object
interaction (HOI) activities, these modes can be attributed to two mechanisms:
(1) the large-scale consistent plan for the whole activity and (2) the
small-scale children interactive actions that start and end along the timeline.
While neuroscience and cognitive science have confirmed this multi-mechanism
nature of human behavior, machine modeling approaches for human motion are
trailing behind. While attempted to use gradually morphing structures (e.g.,
graph attention networks) to model the dynamic HOI patterns, they miss the
expeditious and discrete mode-switching nature of the human motion. To bridge
that gap, this work proposes to model two concurrent mechanisms that jointly
control human motion: the Persistent process that runs continually on the
global scale, and the Transient sub-processes that operate intermittently on
the local context of the human while interacting with objects. These two
mechanisms form an interactive Persistent-Transient Duality that
synergistically governs the activity sequences. We model this conceptual
duality by a parent-child neural network of Persistent and Transient channels
with a dedicated neural module for dynamic mechanism switching. The framework
is trialed on HOI motion forecasting. On two rich datasets and a wide variety
of settings, the model consistently delivers superior performances, proving its
suitability for the challenge.
Related papers
- in2IN: Leveraging individual Information to Generate Human INteractions [29.495166514135295]
We introduce in2IN, a novel diffusion model for human-human motion generation conditioned on individual descriptions.
We also propose DualMDM, a model composition technique that combines the motions generated with in2IN and the motions generated by a single-person motion prior pre-trained on HumanML3D.
arXiv Detail & Related papers (2024-04-15T17:59:04Z) - ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person Interactions [66.87211993793807]
We present ReMoS, a denoising diffusion based model that synthesizes full body motion of a person in two person interaction scenario.
We demonstrate ReMoS across challenging two person scenarios such as pair dancing, Ninjutsu, kickboxing, and acrobatics.
We also contribute the ReMoCap dataset for two person interactions containing full body and finger motions.
arXiv Detail & Related papers (2023-11-28T18:59:52Z) - InterControl: Zero-shot Human Interaction Generation by Controlling Every Joint [67.6297384588837]
We introduce a novel controllable motion generation method, InterControl, to encourage the synthesized motions maintaining the desired distance between joint pairs.
We demonstrate that the distance between joint pairs for human-wise interactions can be generated using an off-the-shelf Large Language Model.
arXiv Detail & Related papers (2023-11-27T14:32:33Z) - Stochastic Multi-Person 3D Motion Forecasting [21.915057426589744]
We deal with the ignored real-world complexities in prior work on human motion forecasting.
Our framework is general; we instantiate it with different generative models.
Our approach produces diverse and accurate multi-person predictions, significantly outperforming the state of the art.
arXiv Detail & Related papers (2023-06-08T17:59:09Z) - InterGen: Diffusion-based Multi-human Motion Generation under Complex Interactions [49.097973114627344]
We present InterGen, an effective diffusion-based approach that incorporates human-to-human interactions into the motion diffusion process.
We first contribute a multimodal dataset, named InterHuman. It consists of about 107M frames for diverse two-person interactions, with accurate skeletal motions and 23,337 natural language descriptions.
We propose a novel representation for motion input in our interaction diffusion model, which explicitly formulates the global relations between the two performers in the world frame.
arXiv Detail & Related papers (2023-04-12T08:12:29Z) - Task-Oriented Human-Object Interactions Generation with Implicit Neural
Representations [61.659439423703155]
TOHO: Task-Oriented Human-Object Interactions Generation with Implicit Neural Representations.
Our method generates continuous motions that are parameterized only by the temporal coordinate.
This work takes a step further toward general human-scene interaction simulation.
arXiv Detail & Related papers (2023-03-23T09:31:56Z) - Persistent-Transient Duality in Human Behavior Modeling [58.67761673662716]
We propose to model the persistent-transient duality in human behavior using a parent-child multi-channel neural network.
Our model shows superior performances in human-object interaction motion prediction.
arXiv Detail & Related papers (2022-04-21T04:29:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.