A Survey and Framework of Cooperative Perception: From Heterogeneous
Singleton to Hierarchical Cooperation
- URL: http://arxiv.org/abs/2208.10590v1
- Date: Mon, 22 Aug 2022 20:47:35 GMT
- Title: A Survey and Framework of Cooperative Perception: From Heterogeneous
Singleton to Hierarchical Cooperation
- Authors: Zhengwei Bai, Guoyuan Wu, Matthew J. Barth, Yongkang Liu, Emrah Akin
Sisbot, Kentaro Oguchi, Zhitong Huang
- Abstract summary: This paper reviews the research progress on Cooperative Perception (CP) and proposes a unified CP framework.
CP is born to unlock the bottleneck of perception for driving automation.
A Hierarchical CP framework is proposed, followed by a review of existing datasets and Simulators to sketch an overall landscape of CP.
- Score: 14.525705886707089
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Perceiving the environment is one of the most fundamental keys to enabling
Cooperative Driving Automation (CDA), which is regarded as the revolutionary
solution to addressing the safety, mobility, and sustainability issues of
contemporary transportation systems. Although an unprecedented evolution is now
happening in the area of computer vision for object perception,
state-of-the-art perception methods are still struggling with sophisticated
real-world traffic environments due to the inevitably physical occlusion and
limited receptive field of single-vehicle systems. Based on multiple spatially
separated perception nodes, Cooperative Perception (CP) is born to unlock the
bottleneck of perception for driving automation. In this paper, we
comprehensively review and analyze the research progress on CP and, to the best
of our knowledge, this is the first time to propose a unified CP framework.
Architectures and taxonomy of CP systems based on different types of sensors
are reviewed to show a high-level description of the workflow and different
structures for CP systems. Node structure, sensor modality, and fusion schemes
are reviewed and analyzed with comprehensive literature to provide detailed
explanations of specific methods. A Hierarchical CP framework is proposed,
followed by a review of existing Datasets and Simulators to sketch an overall
landscape of CP. Discussion highlights the current opportunities, open
challenges, and anticipated future trends.
Related papers
- Deconstructing Human-AI Collaboration: Agency, Interaction, and Adaptation [9.36651659099834]
We propose a new unified set of dimensions through which to analyze and describe human-AI systems.
Our conceptual model is centered around three high-level aspects - agency, interaction, and adaptation.
arXiv Detail & Related papers (2024-04-18T10:12:18Z) - A Holistic Framework Towards Vision-based Traffic Signal Control with
Microscopic Simulation [53.39174966020085]
Traffic signal control (TSC) is crucial for reducing traffic congestion that leads to smoother traffic flow, reduced idling time, and mitigated CO2 emissions.
In this study, we explore the computer vision approach for TSC that modulates on-road traffic flows through visual observation.
We introduce a holistic traffic simulation framework called TrafficDojo towards vision-based TSC and its benchmarking.
arXiv Detail & Related papers (2024-03-11T16:42:29Z) - Towards Vehicle-to-everything Autonomous Driving: A Survey on
Collaborative Perception [40.90789787242417]
Vehicle-to-everything (V2X) autonomous driving opens up a promising direction for developing a new generation of intelligent transportation systems.
Collaborative perception (CP) as an essential component to achieve V2X can overcome the inherent limitations of individual perception.
We provide a comprehensive review of CP methods for V2X scenarios, bringing a profound and in-depth understanding to the community.
arXiv Detail & Related papers (2023-08-31T13:28:32Z) - Viewpoint Generation using Feature-Based Constrained Spaces for Robot
Vision Systems [63.942632088208505]
This publication outlines the generation of viewpoints as a geometrical problem and introduces a generalized theoretical framework for solving it.
A $mathcalC$-space can be understood as the topological space that a viewpoint constraint spans, where the sensor can be positioned for acquiring a feature while fulfilling the regarded constraint.
The introduced $mathcalC$-spaces are characterized based on generic domain and viewpoint constraints models to ease the transferability of the present framework to different applications and robot vision systems.
arXiv Detail & Related papers (2023-06-12T08:57:15Z) - Cooperverse: A Mobile-Edge-Cloud Framework for Universal Cooperative
Perception with Mixed Connectivity and Automation [15.195933965761645]
We formulate a universal CP system into an optimization problem and a mobile-edge-cloud framework called Cooperverse.
A Dynamic Feature Sharing (DFS) methodology is introduced to support this CP system under certain constraints.
Experiments have been conducted based on a high-fidelity CP platform and the results show that the Cooperverse framework is effective for dynamic node engagement.
arXiv Detail & Related papers (2023-02-06T21:30:08Z) - Exploring Contextual Representation and Multi-Modality for End-to-End
Autonomous Driving [58.879758550901364]
Recent perception systems enhance spatial understanding with sensor fusion but often lack full environmental context.
We introduce a framework that integrates three cameras to emulate the human field of view, coupled with top-down bird-eye-view semantic data to enhance contextual representation.
Our method achieves displacement error by 0.67m in open-loop settings, surpassing current methods by 6.9% on the nuScenes dataset.
arXiv Detail & Related papers (2022-10-13T05:56:20Z) - Infrastructure-Based Object Detection and Tracking for Cooperative
Driving Automation: A Survey [16.20885642028316]
Infrastructure-based object detection and tracking systems can enhance the perception capability for connected vehicles.
Discussions conducted to point out current opportunities, open problems, and anticipated future trends.
arXiv Detail & Related papers (2022-01-28T00:55:24Z) - Self-supervised Video Object Segmentation by Motion Grouping [79.13206959575228]
We develop a computer vision system able to segment objects by exploiting motion cues.
We introduce a simple variant of the Transformer to segment optical flow frames into primary objects and the background.
We evaluate the proposed architecture on public benchmarks (DAVIS2016, SegTrackv2, and FBMS59)
arXiv Detail & Related papers (2021-04-15T17:59:32Z) - Systemic formalisation of Cyber-Physical-Social System (CPSS): A
systematic literature review [0.0]
The concept of CPSS has been around for over a decade and it has gained increasing attention over the past few years.
The exploration to conceptualise the notion of CPSS has been partially addressed in few scientific literatures.
This work aims at addressing these issues by first exploring and analysing scientific literature to understand the complete spectrum of CPSS.
arXiv Detail & Related papers (2021-04-11T22:31:57Z) - Investigating Bi-Level Optimization for Learning and Vision from a
Unified Perspective: A Survey and Beyond [114.39616146985001]
In machine learning and computer vision fields, despite the different motivations and mechanisms, a lot of complex problems contain a series of closely related subproblms.
In this paper, we first uniformly express these complex learning and vision problems from the perspective of Bi-Level Optimization (BLO)
Then we construct a value-function-based single-level reformulation and establish a unified algorithmic framework to understand and formulate mainstream gradient-based BLO methodologies.
arXiv Detail & Related papers (2021-01-27T16:20:23Z) - Towards an Interface Description Template for AI-enabled Systems [77.34726150561087]
Reuse is a common system architecture approach that seeks to instantiate a system architecture with existing components.
There is currently no framework that guides the selection of necessary information to assess their portability to operate in a system different than the one for which the component was originally purposed.
We present ongoing work on establishing an interface description template that captures the main information of an AI-enabled component.
arXiv Detail & Related papers (2020-07-13T20:30:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.