Measuring and Modeling Physical Intrinsic Motivation
- URL: http://arxiv.org/abs/2305.13452v3
- Date: Mon, 7 Aug 2023 19:57:38 GMT
- Title: Measuring and Modeling Physical Intrinsic Motivation
- Authors: Julio Martinez, Felix Binder, Haoliang Wang, Nick Haber, Judith Fan,
Daniel L. K. Yamins
- Abstract summary: Humans are interactive agents driven to seek out situations with interesting physical dynamics.
We first collect ratings of how interesting humans find a variety of physics scenarios.
We then model human interestingness responses by implementing various hypotheses of intrinsic motivation.
- Score: 4.995872423496944
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Humans are interactive agents driven to seek out situations with interesting
physical dynamics. Here we formalize the functional form of physical intrinsic
motivation. We first collect ratings of how interesting humans find a variety
of physics scenarios. We then model human interestingness responses by
implementing various hypotheses of intrinsic motivation including models that
rely on simple scene features to models that depend on forward physics
prediction. We find that the single best predictor of human responses is
adversarial reward, a model derived from physical prediction loss. We also find
that simple scene feature models do not generalize their prediction of human
responses across all scenarios. Finally, linearly combining the adversarial
model with the number of collisions in a scene leads to the greatest
improvement in predictivity of human responses, suggesting humans are driven
towards scenarios that result in high information gain and physical activity.
Related papers
- WoW: Towards a World omniscient World model Through Embodied Interaction [83.43543124512719]
Authentic physical intuition of the world model must be grounded in extensive, causally rich interactions with the real world.<n>We present WoW, a generative world model trained on 2 million robot interaction trajectories.<n>We establish WoWBench, a new benchmark focused on physical consistency and causal reasoning in video.
arXiv Detail & Related papers (2025-09-26T17:59:07Z) - Whole-Body Conditioned Egocentric Video Prediction [98.94980209293776]
We train models to Predict Ego-centric Video from human Actions (PEVA)<n>By conditioning on kinematic pose trajectories, structured by the joint hierarchy of the body, our model learns to simulate how physical human actions shape the environment from a first-person point of view.<n>Our work represents an initial attempt to tackle the challenges of modeling complex real-world environments and embodied agent behaviors with video prediction from the perspective of a human.
arXiv Detail & Related papers (2025-06-26T17:59:59Z) - Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuli [10.978614683038758]
We evaluate a broad range of optical flow models and a neuroscience inspired motion energy model for zero-shot figure-ground segmentation.
We find that a cross section of 40 deep optical flow models trained on different datasets struggle to estimate motion patterns in random dot videos.
This neuroscience-inspired model successfully addresses the lack of human-like zero-shot generalization to random dot stimuli in current computer vision models.
arXiv Detail & Related papers (2024-11-03T09:59:45Z) - HUMOS: Human Motion Model Conditioned on Body Shape [54.20419874234214]
We introduce a new approach to develop a generative motion model based on body shape.
We show that it's possible to train this model using unpaired data.
The resulting model generates diverse, physically plausible, and dynamically stable human motions.
arXiv Detail & Related papers (2024-09-05T23:50:57Z) - Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption [64.07607726562841]
Existing multi-person human reconstruction approaches mainly focus on recovering accurate poses or avoiding penetration.
In this work, we tackle the task of reconstructing closely interactive humans from a monocular video.
We propose to leverage knowledge from proxemic behavior and physics to compensate the lack of visual information.
arXiv Detail & Related papers (2024-04-17T11:55:45Z) - FORCE: Dataset and Method for Intuitive Physics Guided Human-object Interaction [39.810254311528354]
We introduce the FORCE model, a kinematic approach for diverse, nuanced human-object interactions by modeling physical attributes.
Our key insight is that human motion is dictated by the interrelation between the force exerted by the human and the perceived resistance.
Experiments also demonstrate incorporating human force facilitates learning multi-class motion.
arXiv Detail & Related papers (2024-03-17T14:52:05Z) - Physion++: Evaluating Physical Scene Understanding that Requires Online
Inference of Different Physical Properties [100.19685489335828]
This work proposes a novel dataset and benchmark, termed Physion++, to rigorously evaluate visual physical prediction in artificial systems.
We test scenarios where accurate prediction relies on estimates of properties such as mass, friction, elasticity, and deformability.
We evaluate the performance of a number of state-of-the-art prediction models that span a variety of levels of learning vs. built-in knowledge, and compare that performance to a set of human predictions.
arXiv Detail & Related papers (2023-06-27T17:59:33Z) - Neural Foundations of Mental Simulation: Future Prediction of Latent
Representations on Dynamic Scenes [3.2744507958793143]
We combine a goal-driven modeling approach with dense neurophysiological data and human behavioral readouts to impinge on this question.
Specifically, we construct and evaluate several classes of sensory-cognitive networks to predict the future state of rich, ethologically-relevant environments.
We find strong differentiation across these model classes in their ability to predict neural and behavioral data both within and across diverse environments.
arXiv Detail & Related papers (2023-05-19T15:56:06Z) - Learn to Predict How Humans Manipulate Large-sized Objects from
Interactive Motions [82.90906153293585]
We propose a graph neural network, HO-GCN, to fuse motion data and dynamic descriptors for the prediction task.
We show the proposed network that consumes dynamic descriptors can achieve state-of-the-art prediction results and help the network better generalize to unseen objects.
arXiv Detail & Related papers (2022-06-25T09:55:39Z) - Physion: Evaluating Physical Prediction from Vision in Humans and
Machines [46.19008633309041]
We present a visual and physical prediction benchmark that precisely measures this capability.
We compare an array of algorithms on their ability to make diverse physical predictions.
We find that graph neural networks with access to the physical state best capture human behavior.
arXiv Detail & Related papers (2021-06-15T16:13:39Z) - 3D Human motion anticipation and classification [8.069283749930594]
We propose a novel sequence-to-sequence model for human motion prediction and feature learning.
Our model learns to predict multiple future sequences of human poses from the same input sequence.
We show that it takes less than half the number of epochs to train an activity recognition network by using the feature learned from the discriminator.
arXiv Detail & Related papers (2020-12-31T00:19:39Z) - Visual Grounding of Learned Physical Models [66.04898704928517]
Humans intuitively recognize objects' physical properties and predict their motion, even when the objects are engaged in complicated interactions.
We present a neural model that simultaneously reasons about physics and makes future predictions based on visual and dynamics priors.
Experiments show that our model can infer the physical properties within a few observations, which allows the model to quickly adapt to unseen scenarios and make accurate predictions into the future.
arXiv Detail & Related papers (2020-04-28T17:06:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.