Related papers: Spatio-temporal dual-stage hypergraph MARL for human-centric multimodal corridor traffic signal control

Spatio-temporal dual-stage hypergraph MARL for human-centric multimodal corridor traffic signal control

URL: http://arxiv.org/abs/2602.17068v1
Date: Thu, 19 Feb 2026 04:18:50 GMT
Title: Spatio-temporal dual-stage hypergraph MARL for human-centric multimodal corridor traffic signal control
Authors: Xiaocai Zhang, Neema Nassir, Milad Haghani,
Abstract summary: This paper proposes STDSH-MARL (Spatio-Temporal Dual-Stage Hypergraph based Multi-Agent Reinforcement Learning)<n>The proposed method captures dependencies through a novel dual-stage hypergraph attention mechanism that models interactions across both spatial temporal hyperedges.<n> Experiments conducted on a corridor network under five traffic scenarios demonstrate that STDSH-MARL consistently improves multimodal performance and provides clear benefits for public transportation priority.
Score: 5.728450793445691
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Human-centric traffic signal control in corridor networks must increasingly account for multimodal travelers, particularly high-occupancy public transportation, rather than focusing solely on vehicle-centric performance. This paper proposes STDSH-MARL (Spatio-Temporal Dual-Stage Hypergraph based Multi-Agent Reinforcement Learning), a scalable multi-agent deep reinforcement learning framework that follows a centralized training and decentralized execution paradigm. The proposed method captures spatio-temporal dependencies through a novel dual-stage hypergraph attention mechanism that models interactions across both spatial and temporal hyperedges. In addition, a hybrid discrete action space is introduced to jointly determine the next signal phase configuration and its corresponding green duration, enabling more adaptive signal timing decisions. Experiments conducted on a corridor network under five traffic scenarios demonstrate that STDSH-MARL consistently improves multimodal performance and provides clear benefits for public transportation priority. Compared with state-of-the-art baseline methods, the proposed approach achieves superior overall performance. Further ablation studies confirm the contribution of each component of STDSH-MARL, with temporal hyperedges identified as the most influential factor driving the observed performance gains.

Related papers

Human-Centric Traffic Signal Control for Equity: A Multi-Agent Action Branching Deep Reinforcement Learning Approach [5.2437780355984165]
We propose MA2B-DDQN, a human-centric multi-agent action-branching double Deep Q-Network (DQN) framework.<n>Our key contribution is an action-branching discrete control formulation that decomposes corridor control into local, per-intersection actions.<n>We also design a human-centric reward that penalizes the number of delayed individuals in the corridor, accounting for pedestrians, vehicle occupants, and transit passengers.
arXiv Detail & Related papers (2026-02-03T00:56:03Z)
Spatiotemporal Decision Transformer for Traffic Coordination [1.2099551931618155]
MADT (Multi-Agent Decision Transformer) is a novel approach that reformulates multi-agent traffic signal control as a sequence modeling problem.<n>Our approach enables offline learning from historical traffic data, with architecture design that facilitates potential online fine-tuning.
arXiv Detail & Related papers (2026-02-02T23:19:13Z)
Learning Multi-Modal Mobility Dynamics for Generalized Next Location Recommendation [51.00494428978262]
We leverage multi-modal spatial-temporal knowledge to characterize mobility dynamics for the location recommendation task.<n>First, we construct a unified spatial-temporal relational graph (STRG) for multi-modal representation.<n>Second, we design a gating mechanism to fuse spatial-temporal graph representations of different modalities.
arXiv Detail & Related papers (2025-12-27T14:23:04Z)
HyperD: Hybrid Periodicity Decoupling Framework for Traffic Forecasting [10.043485636925265]
HyperD is a novel framework that decouples traffic data into periodic and residual components.<n>Experiments on four real-world traffic datasets demonstrate that HyperD achieves state-of-the-art prediction accuracy.
arXiv Detail & Related papers (2025-11-12T12:42:22Z)
STAR-RIS-assisted Collaborative Beamforming for Low-altitude Wireless Networks [58.13757830013997]
Wireless networks based on uncrewed aerial vehicles (UAVs) offer high mobility, flexibility, and coverage for urban communications.<n>They face severe signal attenuation in dense environments due to obstructions.<n>To address this critical issue, we consider introducing collaborative beam of UAVs and omni-directional re-altitude beamforming.
arXiv Detail & Related papers (2025-10-25T01:28:37Z)
WSM: Decay-Free Learning Rate Schedule via Checkpoint Merging for LLM Pre-training [64.0932926819307]
We present Warmup-Stable and Merge (WSM), a framework that establishes a formal connection between learning rate decay and model merging.<n>WSM provides a unified theoretical foundation for emulating various decay strategies.<n>Our framework consistently outperforms the widely-adopted Warmup-Stable-Decay (WSD) approach across multiple benchmarks.
arXiv Detail & Related papers (2025-07-23T16:02:06Z)
Forecasting at Full Spectrum: Holistic Multi-Granular Traffic Modeling under High-Throughput Inference Regimes [2.3759432635713895]
We propose MultiGranGranSTG-Fog, an efficient fog distributed inference system with a novel traffic forecasting model.<n>The proposed algorithm employs multi-granular GA-Fog feature fusion on generated dynamic traffic graphs to fully capture traffic dynamics.<n>Extensive experiments on real-world datasets demonstrate the superiority of the proposed method over selected GCN baselines.
arXiv Detail & Related papers (2025-05-02T13:55:22Z)
Toward Dependency Dynamics in Multi-Agent Reinforcement Learning for Traffic Signal Control [8.312659530314937]
Reinforcement learning (RL) emerges as a promising data-driven approach for adaptive traffic signal control.<n>In this paper, we propose a novel Dynamic Reinforcement Update Strategy for Deep Q-Network (DQN-DPUS)<n>We show that the proposed strategy can speed up the convergence rate without sacrificing optimal exploration.
arXiv Detail & Related papers (2025-02-23T15:29:12Z)
Heterogeneous Multi-Agent Reinforcement Learning for Distributed Channel Access in WLANs [47.600901884970845]
This paper investigates the use of multi-agent reinforcement learning (MARL) to address distributed channel access in wireless local area networks.<n>In particular, we consider the challenging yet more practical case where the agents heterogeneously adopt value-based or policy-based reinforcement learning algorithms to train the model.<n>We propose a heterogeneous MARL training framework, named QPMIX, which adopts a centralized training with distributed execution paradigm to enable heterogeneous agents to collaborate.
arXiv Detail & Related papers (2024-12-18T13:50:31Z)
Improving Traffic Flow Predictions with SGCN-LSTM: A Hybrid Model for Spatial and Temporal Dependencies [55.2480439325792]
This paper introduces the Signal-Enhanced Graph Convolutional Network Long Short Term Memory (SGCN-LSTM) model for predicting traffic speeds across road networks. Experiments on the PEMS-BAY road network traffic dataset demonstrate the SGCN-LSTM model's effectiveness.
arXiv Detail & Related papers (2024-11-01T00:37:00Z)
Adaptive Hierarchical SpatioTemporal Network for Traffic Forecasting [70.66710698485745]
We propose an Adaptive Hierarchical SpatioTemporal Network (AHSTN) to promote traffic forecasting. AHSTN exploits the spatial hierarchy and modeling multi-scale spatial correlations. Experiments on two real-world datasets show that AHSTN achieves better performance over several strong baselines.
arXiv Detail & Related papers (2023-06-15T14:50:27Z)
Safety-compliant Generative Adversarial Networks for Human Trajectory Forecasting [95.82600221180415]
Human forecasting in crowds presents the challenges of modelling social interactions and outputting collision-free multimodal distribution. We introduce SGANv2, an improved safety-compliant SGAN architecture equipped with motion-temporal interaction modelling and a transformer-based discriminator design.
arXiv Detail & Related papers (2022-09-25T15:18:56Z)

This list is automatically generated from the titles and abstracts of the papers in this site.