InfoCom: Kilobyte-Scale Communication-Efficient Collaborative Perception with Information Bottleneck
- URL: http://arxiv.org/abs/2512.10305v1
- Date: Thu, 11 Dec 2025 05:51:02 GMT
- Title: InfoCom: Kilobyte-Scale Communication-Efficient Collaborative Perception with Information Bottleneck
- Authors: Quanmin Wei, Penglin Dai, Wei Li, Bingyi Liu, Xiao Wu,
- Abstract summary: InfoCom is an information-aware framework establishing the pioneering theoretical foundation for communication-efficient collaborative perception.<n>Experiments across multiple datasets demonstrate that InfoCom achieves near-lossless perception while reducing communication overhead from megabyte to kilobyte-scale, representing 440-fold and 90-fold reductions per agent compared to Where2comm and ERMVP, respectively.
- Score: 14.691852809650323
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Precise environmental perception is critical for the reliability of autonomous driving systems. While collaborative perception mitigates the limitations of single-agent perception through information sharing, it encounters a fundamental communication-performance trade-off. Existing communication-efficient approaches typically assume MB-level data transmission per collaboration, which may fail due to practical network constraints. To address these issues, we propose InfoCom, an information-aware framework establishing the pioneering theoretical foundation for communication-efficient collaborative perception via extended Information Bottleneck principles. Departing from mainstream feature manipulation, InfoCom introduces a novel information purification paradigm that theoretically optimizes the extraction of minimal sufficient task-critical information under Information Bottleneck constraints. Its core innovations include: i) An Information-Aware Encoding condensing features into minimal messages while preserving perception-relevant information; ii) A Sparse Mask Generation identifying spatial cues with negligible communication cost; and iii) A Multi-Scale Decoding that progressively recovers perceptual information through mask-guided mechanisms rather than simple feature reconstruction. Comprehensive experiments across multiple datasets demonstrate that InfoCom achieves near-lossless perception while reducing communication overhead from megabyte to kilobyte-scale, representing 440-fold and 90-fold reductions per agent compared to Where2comm and ERMVP, respectively.
Related papers
- Send Less, Perceive More: Masked Quantized Point Cloud Communication for Loss-Tolerant Collaborative Perception [38.10779821259225]
We introduce QPoint2Comm, a quantized point-cloud communication framework that dramatically reduces bandwidth.<n>QPoint2Comm directly communicates quantized point-cloud indices using a shared codebook.<n>We employ a masked training strategy that simulates random packet loss, allowing the model to maintain strong performance even under severe transmission failures.
arXiv Detail & Related papers (2026-02-25T08:00:48Z) - Rate-Distortion Optimized Communication for Collaborative Perception [47.737814518681326]
We introduce a pragmatic rate-distortion theory for multi-agent collaboration, specifically formulated to analyze performance-communication trade-off.<n>We propose RDcomm, a communication-efficient collaborative perception framework that introduces two key innovations.<n>Experiments on 3D object detection and BEV segmentation demonstrate that RDcomm achieves state-of-the-art accuracy on DAIR-V2X and OPV2V, while reducing communication volume by up to 108 times.
arXiv Detail & Related papers (2025-09-26T07:21:32Z) - Wireless Communication as an Information Sensor for Multi-agent Cooperative Perception: A Survey [4.570256768472447]
Cooperative perception extends the perception capabilities of autonomous vehicles by enabling multi-agent information sharing via Vehicle-to-Everything (V2X) communication.<n>This survey focuses on three key dimensions: information representation, information fusion, and large-scale deployment.
arXiv Detail & Related papers (2025-04-30T12:23:57Z) - Integration of Communication and Computational Imaging [49.2442836992307]
We propose a novel framework that integrates communication and computational imaging (ICCI) for remote perception.
ICCI framework performs a full-link information transfer optimization, aiming to minimize information loss from the generation of the information source to the execution of the final vision tasks.
An 80 km 27-band hyperspectral video perception with a rate of 30 fps is experimentally achieved.
arXiv Detail & Related papers (2024-10-25T09:19:59Z) - Semantic Communication for Cooperative Perception using HARQ [51.148203799109304]
We leverage an importance map to distill critical semantic information, introducing a cooperative perception semantic communication framework.
To counter the challenges posed by time-varying multipath fading, our approach incorporates the use of frequency-division multiplexing (OFDM) along with channel estimation and equalization strategies.
We introduce a novel semantic error detection method that is integrated with our semantic communication framework in the spirit of hybrid automatic repeated request (HARQ)
arXiv Detail & Related papers (2024-08-29T08:53:26Z) - Vector Symbolic Open Source Information Discovery [0.41468088383213214]
Large language models (transformers) facilitate semantic data and metadata alignment but are inefficient in CJIIM settings.
We demonstrate a novel integration of transformer models with VSA, combining the power of the former for semantic matching with the compactness and representational structure of the latter.
This work was carried out as a bridge between previous low technology readiness level (TRL) research and future higher-TRL technology demonstration and deployment.
arXiv Detail & Related papers (2024-08-20T11:05:56Z) - Rate-Distortion-Perception Theory for Semantic Communication [73.04341519955223]
We study the achievable data rate of semantic communication under the symbol distortion and semantic perception constraints.
We observe that there exists cases that the receiver can directly infer the semantic information source satisfying certain distortion and perception constraints.
arXiv Detail & Related papers (2023-12-09T02:04:32Z) - Semantic-Native Communication: A Simplicial Complex Perspective [50.099494681671224]
We study semantic communication from a topological space perspective.
A transmitter first maps its data into a $k$-order simplicial complex and then learns its high-order correlations.
The receiver decodes the structure and infers the missing or distorted data.
arXiv Detail & Related papers (2022-10-30T22:33:44Z) - Multi-agent Communication with Graph Information Bottleneck under
Limited Bandwidth (a position paper) [92.11330289225981]
In many real-world scenarios, communication can be expensive and the bandwidth of the multi-agent system is subject to certain constraints.
Redundant messages who occupy the communication resources can block the transmission of informative messages and thus jeopardize the performance.
We propose a novel multi-agent communication module, CommGIB, which effectively compresses the structure information and node information in the communication graph to deal with bandwidth-constrained settings.
arXiv Detail & Related papers (2021-12-20T07:53:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.