SwinUNet3D -- A Hierarchical Architecture for Deep Traffic Prediction
using Shifted Window Transformers
- URL: http://arxiv.org/abs/2201.06390v1
- Date: Mon, 17 Jan 2022 12:58:45 GMT
- Title: SwinUNet3D -- A Hierarchical Architecture for Deep Traffic Prediction
using Shifted Window Transformers
- Authors: Alabi Bojesomo and Hasan Al Marzouqi and Panos Liatsis
- Abstract summary: In this paper, we explore the use of vision transformer in a UNet setting.
We completely remove all convolution-based building blocks in UNet, while using 3D shifted transformer in both encoder and decoder branches.
The proposed network is tested on the data provided by Traffic Map Movie Forecasting Challenge 2021(Traffic4cast), held in the competition of Neural Information Processing Systems (eurIPS)
- Score: 5.414308305392762
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Traffic forecasting is an important element of mobility management, an
important key that drives the logistics industry. Over the years, lots of work
have been done in Traffic forecasting using time series as well as
spatiotemporal dynamic forecasting. In this paper, we explore the use of vision
transformer in a UNet setting. We completely remove all convolution-based
building blocks in UNet, while using 3D shifted window transformer in both
encoder and decoder branches. In addition, we experiment with the use of
feature mixing just before patch encoding to control the inter-relationship of
the feature while avoiding contraction of the depth dimension of our
spatiotemporal input. The proposed network is tested on the data provided by
Traffic Map Movie Forecasting Challenge 2021(Traffic4cast2021), held in the
competition track of Neural Information Processing Systems (NeurIPS).
Traffic4cast2021 task is to predict an hour (6 frames) of traffic conditions
(volume and average speed)from one hour of given traffic state (12 frames
averaged in 5 minutes time span). Source code is available online at
https://github.com/bojesomo/Traffic4Cast2021-SwinUNet3D.
Related papers
- Transport-Hub-Aware Spatial-Temporal Adaptive Graph Transformer for
Traffic Flow Prediction [10.722455633629883]
We propose a Transport-Hub-aware spatial-temporal adaptive graph transFormer for traffic flow prediction.
Specifically, we first design a novel spatial self-attention module to capture the dynamic spatial dependencies.
We also employ a temporal self-attention module to detect dynamic temporal patterns in the traffic flow data.
arXiv Detail & Related papers (2023-10-12T13:44:35Z) - PDFormer: Propagation Delay-Aware Dynamic Long-Range Transformer for
Traffic Flow Prediction [78.05103666987655]
spatial-temporal Graph Neural Network (GNN) models have emerged as one of the most promising methods to solve this problem.
We propose a novel propagation delay-aware dynamic long-range transFormer, namely PDFormer, for accurate traffic flow prediction.
Our method can not only achieve state-of-the-art performance but also exhibit competitive computational efficiency.
arXiv Detail & Related papers (2023-01-19T08:42:40Z) - A Novel Framework for Handling Sparse Data in Traffic Forecast [1.6549126148958608]
A set of sparse and time evolving traffic reports is generated for each road.
In this paper we present a deep learning framework that encodes the sparse recent traffic information and forecasts the future traffic condition.
arXiv Detail & Related papers (2023-01-12T21:07:04Z) - Multi-task Learning for Sparse Traffic Forecasting [13.359590890052454]
We propose a multi-task learning network that can simultaneously predict the congestion classes and the speed of each road segment.
Our method achieved excellent results on the dataset provided by the Traffic4cast Competition 2022, source code is available at https://github.com/OctopusLi/NeurIPS2022-traffic4cast.
arXiv Detail & Related papers (2022-11-18T02:10:40Z) - Correlating sparse sensing for large-scale traffic speed estimation: A
Laplacian-enhanced low-rank tensor kriging approach [76.45949280328838]
We propose a Laplacian enhanced low-rank tensor (LETC) framework featuring both lowrankness and multi-temporal correlations for large-scale traffic speed kriging.
We then design an efficient solution algorithm via several effective numeric techniques to scale up the proposed model to network-wide kriging.
arXiv Detail & Related papers (2022-10-21T07:25:57Z) - S2TNet: Spatio-Temporal Transformer Networks for Trajectory Prediction
in Autonomous Driving [7.862992905548721]
We propose S2TNet, which models the spatial-temporal interactions by the S-temporal Transformer and deals with the temporel sequences by temporal Transformer.
The methods outperforms state-of-the-art methods on ApolloScape Trajectory dataset by more than 7% on both the weighted sum of Average and Final Displacement Error.
arXiv Detail & Related papers (2022-06-22T08:12:31Z) - Real-time Object Detection for Streaming Perception [84.2559631820007]
Streaming perception is proposed to jointly evaluate the latency and accuracy into a single metric for video online perception.
We build a simple and effective framework for streaming perception.
Our method achieves competitive performance on Argoverse-HD dataset and improves the AP by 4.9% compared to the strong baseline.
arXiv Detail & Related papers (2022-03-23T11:33:27Z) - Traffic4cast -- Large-scale Traffic Prediction using 3DResNet and
Sparse-UNet [2.568084386350801]
The aim is to build a machine learning model for predicting the normalized average traffic speed and flow of subregions of multiple-scale cities using historical data points.
We explore 3DRparseNetes and Sparse-UNet approaches for the tasks in this competition.
Our results show that both of the proposed models achieve much better performance than the baseline algorithms.
arXiv Detail & Related papers (2021-11-10T23:40:52Z) - Transformers Solve the Limited Receptive Field for Monocular Depth
Prediction [82.90445525977904]
We propose TransDepth, an architecture which benefits from both convolutional neural networks and transformers.
This is the first paper which applies transformers into pixel-wise prediction problems involving continuous labels.
arXiv Detail & Related papers (2021-03-22T18:00:13Z) - A Graph Convolutional Network with Signal Phasing Information for
Arterial Traffic Prediction [63.470149585093665]
arterial traffic prediction plays a crucial role in the development of modern intelligent transportation systems.
Many existing studies on arterial traffic prediction only consider temporal measurements of flow and occupancy from loop sensors and neglect the rich spatial relationships between upstream and downstream detectors.
We fill this gap by enhancing a deep learning approach, Diffusion Convolutional Recurrent Neural Network, with spatial information generated from signal timing plans at targeted intersections.
arXiv Detail & Related papers (2020-12-25T01:40:29Z) - DS-Net: Dynamic Spatiotemporal Network for Video Salient Object
Detection [78.04869214450963]
We propose a novel dynamic temporal-temporal network (DSNet) for more effective fusion of temporal and spatial information.
We show that the proposed method achieves superior performance than state-of-the-art algorithms.
arXiv Detail & Related papers (2020-12-09T06:42:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.