Multi-view Sensor Fusion by Integrating Model-based Estimation and Graph
Learning for Collaborative Object Localization
- URL: http://arxiv.org/abs/2011.07704v2
- Date: Sun, 7 Mar 2021 01:31:36 GMT
- Title: Multi-view Sensor Fusion by Integrating Model-based Estimation and Graph
Learning for Collaborative Object Localization
- Authors: Peng Gao, Rui Guo, Hongsheng Lu and Hao Zhang
- Abstract summary: Collaborative object localization aims to collaboratively estimate locations of objects observed from multiple views or perspectives.
To enable collaborative localization, several model-based state estimation and learning-based localization methods have been developed.
We introduce a novel graph filter approach that integrates graph learning and model-based estimation to perform multi-view sensor fusion.
- Score: 22.57544305097723
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Collaborative object localization aims to collaboratively estimate locations
of objects observed from multiple views or perspectives, which is a critical
ability for multi-agent systems such as connected vehicles. To enable
collaborative localization, several model-based state estimation and
learning-based localization methods have been developed. Given their
encouraging performance, model-based state estimation often lacks the ability
to model the complex relationships among multiple objects, while learning-based
methods are typically not able to fuse the observations from an arbitrary
number of views and cannot well model uncertainty. In this paper, we introduce
a novel spatiotemporal graph filter approach that integrates graph learning and
model-based estimation to perform multi-view sensor fusion for collaborative
object localization. Our approach models complex object relationships using a
new spatiotemporal graph representation and fuses multi-view observations in a
Bayesian fashion to improve location estimation under uncertainty. We evaluate
our approach in the applications of connected autonomous driving and multiple
pedestrian localization. Experimental results show that our approach
outperforms previous techniques and achieves the state-of-the-art performance
on collaboration localization.
Related papers
- Zero-Shot Object-Centric Representation Learning [72.43369950684057]
We study current object-centric methods through the lens of zero-shot generalization.
We introduce a benchmark comprising eight different synthetic and real-world datasets.
We find that training on diverse real-world images improves transferability to unseen scenarios.
arXiv Detail & Related papers (2024-08-17T10:37:07Z) - Corpus Considerations for Annotator Modeling and Scaling [9.263562546969695]
We show that the commonly used user token model consistently outperforms more complex models.
Our findings shed light on the relationship between corpus statistics and annotator modeling performance.
arXiv Detail & Related papers (2024-04-02T22:27:24Z) - FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects [55.77542145604758]
FoundationPose is a unified foundation model for 6D object pose estimation and tracking.
Our approach can be instantly applied at test-time to a novel object without fine-tuning.
arXiv Detail & Related papers (2023-12-13T18:28:09Z) - Reconstructing Spatiotemporal Data with C-VAEs [49.1574468325115]
Conditional continuous representation of moving regions is commonly used.
In this work, we explore the capabilities of Conditional Varitemporal Autoencoder (C-VAE) models to generate realistic representations of regions' evolution.
arXiv Detail & Related papers (2023-07-12T15:34:10Z) - GenPose: Generative Category-level Object Pose Estimation via Diffusion
Models [5.1998359768382905]
We propose a novel solution by reframing categorylevel object pose estimation as conditional generative modeling.
Our approach achieves state-of-the-art performance on the REAL275 dataset, surpassing 50% and 60% on strict 5d2cm and 5d5cm metrics.
arXiv Detail & Related papers (2023-06-18T11:45:42Z) - Structured Cooperative Learning with Graphical Model Priors [98.53322192624594]
We study how to train personalized models for different tasks on decentralized devices with limited local data.
We propose "Structured Cooperative Learning (SCooL)", in which a cooperation graph across devices is generated by a graphical model.
We evaluate SCooL and compare it with existing decentralized learning methods on an extensive set of benchmarks.
arXiv Detail & Related papers (2023-06-16T02:41:31Z) - Federated Learning of Models Pre-Trained on Different Features with
Consensus Graphs [19.130197923214123]
Learning an effective global model on private and decentralized datasets has become an increasingly important challenge of machine learning.
We propose a feature fusion approach that extracts local representations from local models and incorporates them into a global representation that improves the prediction performance.
This paper presents solutions to these problems and demonstrates them in real-world applications on time series data such as power grids and traffic networks.
arXiv Detail & Related papers (2023-06-02T02:24:27Z) - Graph state-space models [19.88814714919019]
State-space models are used to describe time series and operate by maintaining an updated representation of the system state from which predictions are made.
The manuscript aims, for the first time, for the first time filling this gap by matching unattended state data where the functional graph capturing latent dependencies is learned directly from data and is allowed to change over time.
An encoder-decoder architecture is proposed to learn the state-space model end-to-end on a downstream task.
arXiv Detail & Related papers (2023-01-04T18:15:07Z) - Generalization Properties of Retrieval-based Models [50.35325326050263]
Retrieval-based machine learning methods have enjoyed success on a wide range of problems.
Despite growing literature showcasing the promise of these models, the theoretical underpinning for such models remains underexplored.
We present a formal treatment of retrieval-based models to characterize their generalization ability.
arXiv Detail & Related papers (2022-10-06T00:33:01Z) - Asynchronous Collaborative Localization by Integrating Spatiotemporal
Graph Learning with Model-Based Estimation [22.63837164001751]
Collaborative localization is an essential capability for a team of robots such as connected vehicles to collaboratively estimate object locations.
To enable collaborative localization, four key challenges must be addressed, including modeling complex relationships between observed objects.
We introduce a novel approach that integrates uncertainty-aware graph learning model and state estimation for a team of robots to collaboratively localize objects.
arXiv Detail & Related papers (2021-11-05T22:48:13Z) - Multi-Agent Imitation Learning with Copulas [102.27052968901894]
Multi-agent imitation learning aims to train multiple agents to perform tasks from demonstrations by learning a mapping between observations and actions.
In this paper, we propose to use copula, a powerful statistical tool for capturing dependence among random variables, to explicitly model the correlation and coordination in multi-agent systems.
Our proposed model is able to separately learn marginals that capture the local behavioral patterns of each individual agent, as well as a copula function that solely and fully captures the dependence structure among agents.
arXiv Detail & Related papers (2021-07-10T03:49:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.