Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
- URL: http://arxiv.org/abs/2510.04978v3
- Date: Sun, 19 Oct 2025 02:43:11 GMT
- Title: Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
- Authors: Kun Xiang, Terry Jingchen Zhang, Yinya Huang, Jixi He, Zirong Liu, Yueling Tang, Ruizhe Zhou, Lijing Luo, Youpeng Wen, Xiuwei Chen, Bingqian Lin, Jianhua Han, Hang Xu, Hanhui Li, Bin Dong, Xiaodan Liang,
- Abstract summary: We advocate for intelligent systems that ground learning in both physical principles and embodied reasoning processes.<n>Our synthesis envisions next-generation world models capable of explaining physical phenomena and predicting future states.
- Score: 57.44526951497041
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The rapid advancement of embodied intelligence and world models has intensified efforts to integrate physical laws into AI systems, yet physical perception and symbolic physics reasoning have developed along separate trajectories without a unified bridging framework. This work provides a comprehensive overview of physical AI, establishing clear distinctions between theoretical physics reasoning and applied physical understanding while systematically examining how physics-grounded methods enhance AI's real-world comprehension across structured symbolic reasoning, embodied systems, and generative models. Through rigorous analysis of recent advances, we advocate for intelligent systems that ground learning in both physical principles and embodied reasoning processes, transcending pattern recognition toward genuine understanding of physical laws. Our synthesis envisions next-generation world models capable of explaining physical phenomena and predicting future states, advancing safe, generalizable, and interpretable AI systems. We maintain a continuously updated resource at https://github.com/AI4Phys/Awesome-AI-for-Physics.
Related papers
- PAI-Bench: A Comprehensive Benchmark For Physical AI [70.22914615084215]
Video generative models often struggle to maintain physically coherent dynamics.<n>Multi-modal large language models exhibit limited performance in forecasting and causal interpretation.<n>These observations suggest that current systems are still at an early stage in handling the perceptual and predictive demands of Physical AI.
arXiv Detail & Related papers (2025-12-01T18:47:39Z) - AI4X Roadmap: Artificial Intelligence for the advancement of scientific pursuit and its future directions [65.44445343399126]
We look at AI-enabled science across biology, chemistry, climate science, mathematics, materials science, physics, self-driving laboratories and unconventional computing.<n>Several shared themes emerge: the need for diverse and trustworthy data, transferable electronic-structure and interatomic models, AI systems integrated into end-to-end scientific synthesis.<n>Across domains, we highlight how large foundation models, active learning and self-driving laboratories can close loops between prediction and validation.
arXiv Detail & Related papers (2025-11-26T02:10:28Z) - Digital Gene: Learning about the Physical World through Analytic Concepts [54.21005370169846]
AI systems still struggle when it comes to understanding and interacting with the physical world.<n>This research introduces the idea of analytic concept.<n>It provides machine intelligence a portal to perceive, reason about, and interact with the physical world.
arXiv Detail & Related papers (2025-04-05T13:22:11Z) - Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning [77.6397528430433]
We present Physical AI models that can understand the physical world and generate appropriate embodied decisions.<n>To represent physical common sense, we use a hierarchical ontology that captures fundamental knowledge about space, time, and physics.<n>For embodied reasoning, we rely on a two-dimensional ontology that generalizes across different physical embodiments.
arXiv Detail & Related papers (2025-03-18T22:06:58Z) - Generative Physical AI in Vision: A Survey [78.07014292304373]
Gene Artificial Intelligence (AI) has rapidly advanced the field of computer vision by enabling machines to create and interpret visual data with unprecedented sophistication.<n>This transformation builds upon a foundation of generative models to produce realistic images, videos, and 3D/4D content.<n>As generative models evolve to increasingly integrate physical realism and dynamic simulation, their potential to function as "world simulators" expands.
arXiv Detail & Related papers (2025-01-19T03:19:47Z) - A Survey on Machine Learning Approaches for Modelling Intuitive Physics [1.3190581566723918]
intuitive physics is a cognitive ability that is commonly known as intuitive physics.
Many of the contemporary approaches in modelling intuitive physics for machine cognition have been inspired by literature from cognitive science.
This paper presents a comprehensive survey of recent advances and techniques in intuitive physics-inspired deep learning approaches for physical reasoning.
arXiv Detail & Related papers (2022-02-14T04:44:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.