ArticFlow: Generative Simulation of Articulated Mechanisms
- URL: http://arxiv.org/abs/2511.17883v1
- Date: Sat, 22 Nov 2025 02:19:53 GMT
- Title: ArticFlow: Generative Simulation of Articulated Mechanisms
- Authors: Jiong Lin, Jinchen Ruan, Hod Lipson,
- Abstract summary: ArticFlow is a two-stage flow matching framework that learns a controllable velocity field from noise to target point sets under explicit action control.<n>On MuJoCo Menagerie, ArticFlow functions both as a generative model and as a neural simulator.
- Score: 5.861206243996455
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent advances in generative models have produced strong results for static 3D shapes, whereas articulated 3D generation remains challenging due to action-dependent deformations and limited datasets. We introduce ArticFlow, a two-stage flow matching framework that learns a controllable velocity field from noise to target point sets under explicit action control. ArticFlow couples (i) a latent flow that transports noise to a shape-prior code and (ii) a point flow that transports points conditioned on the action and the shape prior, enabling a single model to represent diverse articulated categories and generalize across actions. On MuJoCo Menagerie, ArticFlow functions both as a generative model and as a neural simulator: it predicts action-conditioned kinematics from a compact prior and synthesizes novel morphologies via latent interpolation. Compared with object-specific simulators and an action-conditioned variant of static point-cloud generators, ArticFlow achieves higher kinematic accuracy and better shape quality. Results show that action-conditioned flow matching is a practical route to controllable and high-quality articulated mechanism generation.
Related papers
- TriC-Motion: Tri-Domain Causal Modeling Grounded Text-to-Motion Generation [61.94780858309546]
Tri-Domain Causal Text-to-Motion Generation (TriC-Motion) is a novel diffusion-based framework integrating spatial-domain modeling with causal intervention.<n>TriC-Motion achieves superior performance compared to state-of-the-art methods, attaining an outstanding R@1 of 0.612 on the HumanML3D dataset.
arXiv Detail & Related papers (2026-02-09T10:12:13Z) - URDF-Anything: Constructing Articulated Objects with 3D Multimodal Language Model [76.08429266631823]
We propose an end-to-end automatic reconstruction framework based on a 3D multimodal large language model (MLLM)<n>URDF-Anything utilizes an autoregressive prediction framework based on point-cloud and text multimodal input to jointly optimize geometric segmentation and kinematic parameter prediction.<n> Experiments on both simulated and real-world datasets demonstrate that our method significantly outperforms existing approaches.
arXiv Detail & Related papers (2025-11-02T13:45:51Z) - FlowMotion: Target-Predictive Conditional Flow Matching for Jitter-Reduced Text-Driven Human Motion Generation [0.6554326244334868]
FlowMotion incorporates a training objective that focuses on more accurately predicting target motion in 3D human motion generation.<n>FlowMotion achieves state-of-the-art jitter performance, achieving the best jitter in the KIT dataset and the second-best jitter in the HumanML3D dataset.
arXiv Detail & Related papers (2025-04-02T03:55:21Z) - Not-So-Optimal Transport Flows for 3D Point Cloud Generation [58.164908756416615]
Learning generative models of 3D point clouds is one of the fundamental problems in 3D generative learning.<n>In this paper, we analyze the recently proposed equivariant OT flows that learn permutation invariant generative models for point-based molecular data.<n>We show that our proposed model outperforms prior diffusion- and flow-based approaches on a wide range of unconditional generation and shape completion.
arXiv Detail & Related papers (2025-02-18T02:37:34Z) - ActionFlow: Equivariant, Accurate, and Efficient Policies with Spatially Symmetric Flow Matching [20.20511152176522]
ActionFlow is a policy class that integrates spatial symmetry inductive biases.
On the representation level, ActionFlow introduces an SE(3) Invariant Transformer architecture.
For action generation, ActionFlow leverages Flow Matching, a state-of-the-art deep generative model.
arXiv Detail & Related papers (2024-09-06T19:30:36Z) - Generalizable Implicit Motion Modeling for Video Frame Interpolation [51.966062283735596]
Motion is critical in flow-based Video Frame Interpolation (VFI)<n>We introduce General Implicit Motion Modeling (IMM), a novel and effective approach to motion modeling VFI.<n>Our GIMM can be easily integrated with existing flow-based VFI works by supplying accurately modeled motion.
arXiv Detail & Related papers (2024-07-11T17:13:15Z) - SemlaFlow -- Efficient 3D Molecular Generation with Latent Attention and Equivariant Flow Matching [43.56824843205882]
Semla is a scalable E(3)-equivariant message passing architecture.<n>SemlaFlow is trained to generate a joint distribution over atom types, coordinates, bond types and formal charges.<n>Our model produces state-of-the-art results on benchmark datasets with as few as 20 sampling steps.
arXiv Detail & Related papers (2024-06-11T13:51:51Z) - Guided Flows for Generative Modeling and Decision Making [55.42634941614435]
We show that Guided Flows significantly improves the sample quality in conditional image generation and zero-shot text synthesis-to-speech.
Notably, we are first to apply flow models for plan generation in the offline reinforcement learning setting ax speedup in compared to diffusion models.
arXiv Detail & Related papers (2023-11-22T15:07:59Z) - Deep Probabilistic Movement Primitives with a Bayesian Aggregator [4.796643369294991]
Movement primitives are trainable parametric models that reproduce robotic movements starting from a limited set of demonstrations.
This paper proposes a deep movement primitive architecture that encodes all the operations above and uses a Bayesian context aggregator.
arXiv Detail & Related papers (2023-07-11T09:34:15Z) - From Zero to Turbulence: Generative Modeling for 3D Flow Simulation [45.626346087828765]
We propose to approach turbulent flow simulation as a generative task directly learning the manifold of all possible turbulent flow states without relying on any initial flow state.
Our generative model captures the distribution of turbulent flows caused by unseen objects and generates high-quality, realistic samples for downstream applications.
arXiv Detail & Related papers (2023-05-29T18:20:28Z) - MoCo-Flow: Neural Motion Consensus Flow for Dynamic Humans in Stationary
Monocular Cameras [98.40768911788854]
We introduce MoCo-Flow, a representation that models the dynamic scene using a 4D continuous time-variant function.
At the heart of our work lies a novel optimization formulation, which is constrained by a motion consensus regularization on the motion flow.
We extensively evaluate MoCo-Flow on several datasets that contain human motions of varying complexity.
arXiv Detail & Related papers (2021-06-08T16:03:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.