Related papers: Motion In-Betweening for Densely Interacting Characters

Motion In-Betweening for Densely Interacting Characters

URL: http://arxiv.org/abs/2510.00314v1
Date: Tue, 30 Sep 2025 22:11:39 GMT
Title: Motion In-Betweening for Densely Interacting Characters
Authors: Xiaotang Zhang, Ziyi Chang, Qianhui Men, Hubert P. H. Shum,
Abstract summary: Motion in-betweening is the problem to synthesize movement between keyposes.<n>We present a method for long-horizon interaction in-betweening that enables two characters to engage and respond to one another naturally.
Score: 17.863671809124295
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Motion in-betweening is the problem to synthesize movement between keyposes. Traditional research focused primarily on single characters. Extending them to densely interacting characters is highly challenging, as it demands precise spatial-temporal correspondence between the characters to maintain the interaction, while creating natural transitions towards predefined keyposes. In this research, we present a method for long-horizon interaction in-betweening that enables two characters to engage and respond to one another naturally. To effectively represent and synthesize interactions, we propose a novel solution called Cross-Space In-Betweening, which models the interactions of each character across different conditioning representation spaces. We further observe that the significantly increased constraints in interacting characters heavily limit the solution space, leading to degraded motion quality and diminished interaction over time. To enable long-horizon synthesis, we present two solutions to maintain long-term interaction and motion quality, thereby keeping synthesis in the stable region of the solution space.We first sustain interaction quality by identifying periodic interaction patterns through adversarial learning. We further maintain the motion quality by learning to refine the drifted latent space and prevent pose error accumulation. We demonstrate that our approach produces realistic, controllable, and long-horizon in-between motions of two characters with dynamic boxing and dancing actions across multiple keyposes, supported by extensive quantitative evaluations and user studies.

Related papers

Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models [80.28579390566298]
We introduce Interact2Ar, a text-conditioned autoregressive diffusion model for generating full-body, human-human interactions.<n>Hand kinematics are incorporated through dedicated parallel branches, enabling high-fidelity full-body generation.<n>Our model enables a series of downstream applications, including temporal motion composition, real-time adaptation to disturbances, and extension beyond dyadic to multi-person scenarios.
arXiv Detail & Related papers (2025-12-22T18:59:50Z)
MoReact: Generating Reactive Motion from Textual Descriptions [57.642436102978245]
MoReact is a diffusion-based method designed to disentangle the generation of global trajectories and local motions sequentially.<n>Our experiments, utilizing data adapted from a two-person motion dataset, demonstrate the efficacy of our approach.
arXiv Detail & Related papers (2025-09-28T14:31:41Z)
InterAct: A Large-Scale Dataset of Dynamic, Expressive and Interactive Activities between Two People in Daily Scenarios [40.42003202491803]
We propose to simultaneously model two people's activities, and target objective-driven, dynamic, and semantically consistent interactions.<n>We capture a new multi-modal dataset dubbed InterAct composed of 241 motion sequences.<n>InterAct contains diverse and complex motions of individuals and interesting and relatively long-term interaction patterns barely seen before.
arXiv Detail & Related papers (2025-09-06T15:36:47Z)
InterSyn: Interleaved Learning for Dynamic Motion Synthesis in the Wild [65.29569330744056]
We present Interleaved Learning for Motion Synthesis (InterSyn), a novel framework that targets the generation of realistic interaction motions.<n>InterSyn employs an interleaved learning strategy to capture the natural, dynamic interactions and nuanced coordination inherent in real-world scenarios.
arXiv Detail & Related papers (2025-08-14T03:00:06Z)
Large-Scale Multi-Character Interaction Synthesis [13.992868723420836]
We propose a conditional generative pipeline comprising a coordinatable multi-character interaction space for interaction synthesis and a transition planning network for coordinations.<n>Existing datasets either do not have multiple characters or do not have close and dense interactions.
arXiv Detail & Related papers (2025-05-20T08:49:27Z)
Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer [24.166147954731652]
Multi-person interactive motion generation is a critical yet under-explored domain in computer character animation.<n>Current research often employs separate module branches for individual motions, leading to a loss of interaction information.<n>We propose a novel, unified approach that models multi-person motions and their interactions within a single latent space.
arXiv Detail & Related papers (2024-12-21T15:35:50Z)
Two-Person Interaction Augmentation with Skeleton Priors [16.65884142618145]
We propose a new deep learning method for two-body skeletal interaction motion augmentation. Our system can learn effectively from a relatively small amount of data and generalize to drastically different skeleton sizes.
arXiv Detail & Related papers (2024-04-08T13:11:57Z)
ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person Interactions [66.87211993793807]
We present ReMoS, a denoising diffusion based model that synthesizes full body motion of a person in two person interaction scenario. We demonstrate ReMoS across challenging two person scenarios such as pair dancing, Ninjutsu, kickboxing, and acrobatics. We also contribute the ReMoCap dataset for two person interactions containing full body and finger motions.
arXiv Detail & Related papers (2023-11-28T18:59:52Z)
InterControl: Zero-shot Human Interaction Generation by Controlling Every Joint [67.6297384588837]
We introduce a novel controllable motion generation method, InterControl, to encourage the synthesized motions maintaining the desired distance between joint pairs. We demonstrate that the distance between joint pairs for human-wise interactions can be generated using an off-the-shelf Large Language Model.
arXiv Detail & Related papers (2023-11-27T14:32:33Z)
Interaction Transformer for Human Reaction Generation [61.22481606720487]
We propose a novel interaction Transformer (InterFormer) consisting of a Transformer network with both temporal and spatial attentions. Our method is general and can be used to generate more complex and long-term interactions.
arXiv Detail & Related papers (2022-07-04T19:30:41Z)

This list is automatically generated from the titles and abstracts of the papers in this site.