PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era
- URL: http://arxiv.org/abs/2509.12989v1
- Date: Tue, 16 Sep 2025 11:54:37 GMT
- Title: PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era
- Authors: Xu Zheng, Chenfei Liao, Ziqiao Weng, Kaiyu Lei, Zihao Dongfang, Haocong He, Yuanhuiyi Lyu, Lutao Jiang, Lu Qi, Li Chen, Danda Pani Paudel, Kailun Yang, Linfeng Zhang, Luc Van Gool, Xuming Hu,
- Abstract summary: This talk presents an emerging trend in the embodied AI era: the rapid development of omnidirectional vision.<n>We highlight recent breakthroughs in omnidirectional generation, omnidirectional perception, omnidirectional understanding, and related datasets.<n>We propose an ideal panoramic system architecture in the embodied AI era, PANORAMA, which consists of four key subsystems.
- Score: 92.63017552735103
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Omnidirectional vision, using 360-degree vision to understand the environment, has become increasingly critical across domains like robotics, industrial inspection, and environmental monitoring. Compared to traditional pinhole vision, omnidirectional vision provides holistic environmental awareness, significantly enhancing the completeness of scene perception and the reliability of decision-making. However, foundational research in this area has historically lagged behind traditional pinhole vision. This talk presents an emerging trend in the embodied AI era: the rapid development of omnidirectional vision, driven by growing industrial demand and academic interest. We highlight recent breakthroughs in omnidirectional generation, omnidirectional perception, omnidirectional understanding, and related datasets. Drawing on insights from both academia and industry, we propose an ideal panoramic system architecture in the embodied AI era, PANORAMA, which consists of four key subsystems. Moreover, we offer in-depth opinions related to emerging trends and cross-community impacts at the intersection of panoramic vision and embodied AI, along with the future roadmap and open challenges. This overview synthesizes state-of-the-art advancements and outlines challenges and opportunities for future research in building robust, general-purpose omnidirectional AI systems in the embodied AI era.
Related papers
- One Flight Over the Gap: A Survey from Perspective to Panoramic Vision [117.80970697177025]
This survey reviews recent panoramic vision techniques with a particular emphasis on the perspective-to-panorama adaptation.<n>We first revisit the panoramic imaging pipeline and projection methods to build the prior knowledge required for analyzing the structural disparities.<n>Building on this, we cover 20+ representative tasks drawn from more than 300 research papers in two dimensions.
arXiv Detail & Related papers (2025-09-04T17:59:10Z) - Edge General Intelligence Through World Models and Agentic AI: Fundamentals, Solutions, and Challenges [87.02855999212817]
Edge General Intelligence (EGI) represents a transformative evolution of edge computing, where distributed agents possess the capability to perceive, reason, and act autonomously.<n>World models act as proactive internal simulators that not only predict but also actively imagine future trajectories, reason under uncertainty, and plan multi-step actions with foresight.<n>This survey bridges the gap by offering a comprehensive analysis of how world models can empower agentic artificial intelligence (AI) systems at the edge.
arXiv Detail & Related papers (2025-08-13T07:29:40Z) - A Review of Vision-Based Assistive Systems for Visually Impaired People: Technologies, Applications, and Future Directions [1.9253333342733674]
This paper presents a review of recent advances in assistive systems designed for the visually impaired.<n>It focuses on state-of-the-art technologies in obstacle detection, navigation, and user interaction.<n>Emerging trends and future directions in visual guidance systems are discussed.
arXiv Detail & Related papers (2025-05-20T12:47:07Z) - Toward Embodied AGI: A Review of Embodied AI and the Road Ahead [49.10409405708527]
Embodied Artificial General Intelligence (AGI) is often envisioned as inherently embodied.<n>This paper introduces a systematic taxonomy of Embodied AGI spanning five levels (L1-L5)<n>We propose a conceptual framework for an L3+ robotic brain, offering both a technical outlook and a foundation for future exploration.
arXiv Detail & Related papers (2025-05-20T11:42:26Z) - Challenges and Trends in Egocentric Vision: A Survey [11.593894126370724]
Egocentric vision captures visual and multimodal data through cameras or sensors worn on the human body.<n>This paper provides a comprehensive survey of the research on egocentric vision understanding.<n>By summarizing the latest advancements, we anticipate the broad applications of egocentric vision technologies in fields such as augmented reality, virtual reality, and embodied intelligence.
arXiv Detail & Related papers (2025-03-19T14:51:27Z) - Diffusion-Based Visual Art Creation: A Survey and New Perspectives [51.522935314070416]
This survey explores the emerging realm of diffusion-based visual art creation, examining its development from both artistic and technical perspectives.
Our findings reveal how artistic requirements are transformed into technical challenges and highlight the design and application of diffusion-based methods within visual art creation.
We aim to shed light on the mechanisms through which AI systems emulate and possibly, enhance human capacities in artistic perception and creativity.
arXiv Detail & Related papers (2024-08-22T04:49:50Z) - Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI [116.8199519880327]
Embodied Artificial Intelligence (Embodied AI) is crucial for achieving Artificial General Intelligence (AGI)<n>In this survey, we give a comprehensive exploration of the latest advancements in Embodied AI.
arXiv Detail & Related papers (2024-07-09T14:14:47Z) - Vision-based Learning for Drones: A Survey [1.280979348722635]
Drones as advanced cyber-physical systems are undergoing a transformative shift with the advent of vision-based learning.
This review offers a comprehensive overview of vision-based learning in drones, emphasizing its pivotal role in enhancing their operational capabilities.
We explore various applications of vision-based drones with learning capabilities, ranging from single-agent systems to more complex multi-agent and heterogeneous system scenarios.
arXiv Detail & Related papers (2023-12-08T12:57:13Z) - Edge AI for Internet of Energy: Challenges and Perspectives [5.267662071764103]
The digital landscape of the Internet of Energy (IoE) is on the brink of a revolutionary transformation with the integration of edge Artificial Intelligence (AI)
This comprehensive review elucidates the promise and potential that edge AI holds for reshaping the IoE ecosystem.
arXiv Detail & Related papers (2023-11-28T15:01:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.