Joint Forecasting of Panoptic Segmentations with Difference Attention
- URL: http://arxiv.org/abs/2204.07157v1
- Date: Thu, 14 Apr 2022 17:59:32 GMT
- Title: Joint Forecasting of Panoptic Segmentations with Difference Attention
- Authors: Colin Graber, Cyril Jazra, Wenjie Luo, Liangyan Gui, Alexander Schwing
- Abstract summary: We study a new panoptic segmentation forecasting model that jointly forecasts all object instances in a scene.
We evaluate the proposed model on the Cityscapes and AIODrive datasets.
- Score: 72.03470153917189
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Forecasting of a representation is important for safe and effective autonomy.
For this, panoptic segmentations have been studied as a compelling
representation in recent work. However, recent state-of-the-art on panoptic
segmentation forecasting suffers from two issues: first, individual object
instances are treated independently of each other; second, individual object
instance forecasts are merged in a heuristic manner. To address both issues, we
study a new panoptic segmentation forecasting model that jointly forecasts all
object instances in a scene using a transformer model based on 'difference
attention.' It further refines the predictions by taking depth estimates into
account. We evaluate the proposed model on the Cityscapes and AIODrive
datasets. We find difference attention to be particularly suitable for
forecasting because the difference of quantities like locations enables a model
to explicitly reason about velocities and acceleration. Because of this, we
attain state-of-the-art on panoptic segmentation forecasting metrics.
Related papers
- Learning Socio-Temporal Graphs for Multi-Agent Trajectory Prediction [22.667895324575824]
We introduce a Directed Acyclic Graph-based structure, which we term Socio-Temporal Graph (STG), to explicitly capture pair-wise socio-temporal interactions.
We design an attention-based model named STGformer that affords an end-to-end pipeline to learn the structure of the STGs for trajectory prediction.
Our solution achieves overall state-of-the-art prediction accuracy in two large-scale benchmark datasets.
arXiv Detail & Related papers (2023-12-22T01:48:09Z) - Scene Informer: Anchor-based Occlusion Inference and Trajectory
Prediction in Partially Observable Environments [43.28710918095309]
Navigating complex and dynamic environments requires autonomous vehicles to reason about both visible and occluded regions.
This involves predicting the future motion of observed agents, inferring occluded ones, and modeling their interactions.
We introduce the Scene Informer, a unified approach for predicting both observed agent trajectories and inferring occludeds in a partially observable setting.
arXiv Detail & Related papers (2023-09-25T06:16:09Z) - Towards Self-Supervised Category-Level Object Pose and Size Estimation [121.28537953301951]
This work presents a self-supervised framework for category-level object pose and size estimation from a single depth image.
We leverage the geometric consistency residing in point clouds of the same shape for self-supervision.
arXiv Detail & Related papers (2022-03-06T06:02:30Z) - Parameter Decoupling Strategy for Semi-supervised 3D Left Atrium
Segmentation [0.0]
We present a novel semi-supervised segmentation model based on parameter decoupling strategy to encourage consistent predictions from diverse views.
Our method has achieved a competitive result over the state-of-the-art semisupervised methods on the Atrial Challenge dataset.
arXiv Detail & Related papers (2021-09-20T14:51:42Z) - RAIN: Reinforced Hybrid Attention Inference Network for Motion
Forecasting [34.54878390622877]
We propose a generic motion forecasting framework with dynamic key information selection and ranking based on a hybrid attention mechanism.
The framework is instantiated to handle multi-agent trajectory prediction and human motion forecasting tasks.
We validate the framework on both synthetic simulations and motion forecasting benchmarks in different domains.
arXiv Detail & Related papers (2021-08-03T06:30:30Z) - Instance-Level Relative Saliency Ranking with Graph Reasoning [126.09138829920627]
We present a novel unified model to segment salient instances and infer relative saliency rank order.
A novel loss function is also proposed to effectively train the saliency ranking branch.
experimental results demonstrate that our proposed model is more effective than previous methods.
arXiv Detail & Related papers (2021-07-08T13:10:42Z) - Attentional Prototype Inference for Few-Shot Segmentation [128.45753577331422]
We propose attentional prototype inference (API), a probabilistic latent variable framework for few-shot segmentation.
We define a global latent variable to represent the prototype of each object category, which we model as a probabilistic distribution.
We conduct extensive experiments on four benchmarks, where our proposal obtains at least competitive and often better performance than state-of-the-art prototype-based methods.
arXiv Detail & Related papers (2021-05-14T06:58:44Z) - Panoptic Segmentation Forecasting [71.75275164959953]
Our goal is to forecast the near future given a set of recent observations.
We think this ability to forecast, i.e., to anticipate, is integral for the success of autonomous agents.
We develop a two-component model: one component learns the dynamics of the background stuff by anticipating odometry, the other one anticipates the dynamics of detected things.
arXiv Detail & Related papers (2021-04-08T17:59:16Z) - Spatiotemporal Relationship Reasoning for Pedestrian Intent Prediction [57.56466850377598]
Reasoning over visual data is a desirable capability for robotics and vision-based applications.
In this paper, we present a framework on graph to uncover relationships in different objects in the scene for reasoning about pedestrian intent.
Pedestrian intent, defined as the future action of crossing or not-crossing the street, is a very crucial piece of information for autonomous vehicles.
arXiv Detail & Related papers (2020-02-20T18:50:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.