Bridging the Domain Gap for Multi-Agent Perception
- URL: http://arxiv.org/abs/2210.08451v1
- Date: Sun, 16 Oct 2022 05:29:21 GMT
- Title: Bridging the Domain Gap for Multi-Agent Perception
- Authors: Runsheng Xu, Jinlong Li, Xiaoyu Dong, Hongkai Yu, Jiaqi Ma
- Abstract summary: We propose the first lightweight framework to bridge domain gaps for multi-agent perception.
Our framework consists of a learnable feature resizer to align features in multiple dimensions and a sparse cross-domain transformer for domain adaption.
Our method can effectively bridge the gap for features from different domains and outperform other baseline methods significantly by at least 8% for point-cloud-based 3D object detection.
- Score: 19.724227909352976
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Existing multi-agent perception algorithms usually select to share deep
neural features extracted from raw sensing data between agents, achieving a
trade-off between accuracy and communication bandwidth limit. However, these
methods assume all agents have identical neural networks, which might not be
practical in the real world. The transmitted features can have a large domain
gap when the models differ, leading to a dramatic performance drop in
multi-agent perception. In this paper, we propose the first lightweight
framework to bridge such domain gaps for multi-agent perception, which can be a
plug-in module for most existing systems while maintaining confidentiality. Our
framework consists of a learnable feature resizer to align features in multiple
dimensions and a sparse cross-domain transformer for domain adaption. Extensive
experiments on the public multi-agent perception dataset V2XSet have
demonstrated that our method can effectively bridge the gap for features from
different domains and outperform other baseline methods significantly by at
least 8% for point-cloud-based 3D object detection.
Related papers
- Investigating the potential of Sparse Mixtures-of-Experts for multi-domain neural machine translation [59.41178047749177]
We focus on multi-domain Neural Machine Translation, with the goal of developing efficient models which can handle data from various domains seen during training and are robust to domains unseen during training.
We hypothesize that Sparse Mixture-of-Experts (SMoE) models are a good fit for this task, as they enable efficient model scaling.
We conduct a series of experiments aimed at validating the utility of SMoE for the multi-domain scenario, and find that a straightforward width scaling of Transformer is a simpler and surprisingly more efficient approach in practice, and reaches the same performance level as SMoE.
arXiv Detail & Related papers (2024-07-01T09:45:22Z) - Multimodal 3D Object Detection on Unseen Domains [37.142470149311904]
Domain adaptation approaches assume access to unannotated samples from the test distribution to address this problem.
We propose CLIX$text3D$, a multimodal fusion and supervised contrastive learning framework for 3D object detection.
We show that CLIX$text3D$ yields state-of-the-art domain generalization performance under multiple dataset shifts.
arXiv Detail & Related papers (2024-04-17T21:47:45Z) - Breaking Data Silos: Cross-Domain Learning for Multi-Agent Perception
from Independent Private Sources [39.22864188785987]
The diverse agents in multi-agent perception systems may be from different companies. Each company might use the identical classic neural network architecture based encoder for feature extraction.
The data silos by the above Distribution Gap could result in a significant performance decline in multi-agent perception.
We introduce the Feature Distribution-aware Aggregation (FDA) framework for cross-domain learning to mitigate the above Distribution Gap in multi-agent perception.
arXiv Detail & Related papers (2024-02-06T20:09:52Z) - Compositional Semantic Mix for Domain Adaptation in Point Cloud
Segmentation [65.78246406460305]
compositional semantic mixing represents the first unsupervised domain adaptation technique for point cloud segmentation.
We present a two-branch symmetric network architecture capable of concurrently processing point clouds from a source domain (e.g. synthetic) and point clouds from a target domain (e.g. real-world)
arXiv Detail & Related papers (2023-08-28T14:43:36Z) - Robust Domain Adaptive Object Detection with Unified Multi-Granularity Alignment [59.831917206058435]
Domain adaptive detection aims to improve the generalization of detectors on target domain.
Recent approaches achieve domain adaption through feature alignment in different granularities via adversarial learning.
We introduce a unified multi-granularity alignment (MGA)-based detection framework for domain-invariant feature learning.
arXiv Detail & Related papers (2023-01-01T08:38:07Z) - Multi-Granularity Alignment Domain Adaptation for Object Detection [33.32519045960187]
Domain adaptive object detection is challenging due to distinctive data distribution between source domain and target domain.
We propose a unified multi-granularity alignment based object detection framework towards domain-invariant feature learning.
arXiv Detail & Related papers (2022-03-31T09:05:06Z) - META: Mimicking Embedding via oThers' Aggregation for Generalizable
Person Re-identification [68.39849081353704]
Domain generalizable (DG) person re-identification (ReID) aims to test across unseen domains without access to the target domain data at training time.
This paper presents a new approach called Mimicking Embedding via oThers' Aggregation (META) for DG ReID.
arXiv Detail & Related papers (2021-12-16T08:06:50Z) - Multi-Source Domain Adaptation for Object Detection [52.87890831055648]
We propose a unified Faster R-CNN based framework, termed Divide-and-Merge Spindle Network (DMSN)
DMSN can simultaneously enhance domain innative and preserve discriminative power.
We develop a novel pseudo learning algorithm to approximate optimal parameters of pseudo target subset.
arXiv Detail & Related papers (2021-06-30T03:17:20Z) - AFAN: Augmented Feature Alignment Network for Cross-Domain Object
Detection [90.18752912204778]
Unsupervised domain adaptation for object detection is a challenging problem with many real-world applications.
We propose a novel augmented feature alignment network (AFAN) which integrates intermediate domain image generation and domain-adversarial training.
Our approach significantly outperforms the state-of-the-art methods on standard benchmarks for both similar and dissimilar domain adaptations.
arXiv Detail & Related papers (2021-06-10T05:01:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.