Related papers: Meta-Learning Multi-armed Bandits for Beam Tracking in 5G and 6G Networks

Meta-Learning Multi-armed Bandits for Beam Tracking in 5G and 6G Networks

URL: http://arxiv.org/abs/2512.05680v1
Date: Fri, 05 Dec 2025 12:48:50 GMT
Title: Meta-Learning Multi-armed Bandits for Beam Tracking in 5G and 6G Networks
Authors: Alexander Mattick, George Yammine, Georgios Kontes, Setareh Maghsudi, Christopher Mutschler,
Abstract summary: We formulate the problem as a partially observable Markov decision process (POMDP) and model the environment as the codebook itself.<n>This frames the beam selection problem as an online search procedure that locates the moving optimal beam.<n>In contrast to previous work, our method handles new or unforeseen trajectories and changes in the physical environment, and outperforms previous work by orders of magnitude.
Score: 45.68033457046781
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Beamforming-capable antenna arrays with many elements enable higher data rates in next generation 5G and 6G networks. In current practice, analog beamforming uses a codebook of pre-configured beams with each of them radiating towards a specific direction, and a beam management function continuously selects \textit{optimal} beams for moving user equipments (UEs). However, large codebooks and effects caused by reflections or blockages of beams make an optimal beam selection challenging. In contrast to previous work and standardization efforts that opt for supervised learning to train classifiers to predict the next best beam based on previously selected beams we formulate the problem as a partially observable Markov decision process (POMDP) and model the environment as the codebook itself. At each time step, we select a candidate beam conditioned on the belief state of the unobservable optimal beam and previously probed beams. This frames the beam selection problem as an online search procedure that locates the moving optimal beam. In contrast to previous work, our method handles new or unforeseen trajectories and changes in the physical environment, and outperforms previous work by orders of magnitude.

Related papers

BERT4beam: Large AI Model Enabled Generalized Beamforming Optimization [77.17508487745026]
This paper investigates the large-scale AI model designed for beamforming optimization to adapt and generalize to diverse tasks defined by system utilities and scales.<n>We propose a novel framework based on bidirectional encoder representations from transformers (BERT), termed BERT4 encoder.<n>Based on the framework, we propose two BERT-based approaches for single-task and multi-task beamforming optimization, respectively.
arXiv Detail & Related papers (2025-09-14T02:49:29Z)
Causal Beam Selection for Reliable Initial Access in AI-driven Beam Management [27.860170227028963]
Existing deep learning (DL)-based beam alignment methods often neglect the underlying causal relationships between inputs and outputs.<n>We propose a causally-aware DL framework that integrates causal discovery into beam management pipeline.
arXiv Detail & Related papers (2025-08-22T12:56:07Z)
Beam Training in mmWave Vehicular Systems: Machine Learning for Decoupling Beam Selection [28.79913643775395]
Location information coupled with machine learning (ML) beam recommendation is one way to reduce the overhead of beam pair selection. We develop ML-based location-aided approaches to decouple the beam selection between the user equipment (UE) and the base station (BS)
arXiv Detail & Related papers (2024-04-16T22:27:23Z)
Hierarchical ML Codebook Design for Extreme MIMO Beam Management [37.51593770637367]
Beam management is a strategy to unify beamforming and channel state information (CSI) acquisition with large antenna arrays in 5G. Codebooks serve multiple uses in beam management including beamforming reference signals, CSI reporting, and analog beam training. We propose and evaluate a machine learning-refined codebook design process for extremely large multiple-input multiple-output (X-MIMO) systems.
arXiv Detail & Related papers (2023-11-24T17:14:11Z)
Deep Learning and Image Super-Resolution-Guided Beam and Power Allocation for mmWave Networks [80.37827344656048]
We develop a deep learning (DL)-guided hybrid beam and power allocation approach for millimeter-wave (mmWave) networks. We exploit the synergy of supervised learning and super-resolution technology to enable low-overhead beam- and power allocation.
arXiv Detail & Related papers (2023-05-08T05:40:54Z)
Fast Beam Alignment via Pure Exploration in Multi-armed Bandits [91.11360914335384]
We develop a bandit-based fast BA algorithm to reduce BA latency for millimeter-wave (mmWave) communications. Our algorithm is named Two-Phase Heteroscedastic Track-and-Stop (2PHT&S)
arXiv Detail & Related papers (2022-10-23T05:57:39Z)
Efficient Beam Search for Initial Access Using Collaborative Filtering [1.496194593196997]
Beamforming-capable antenna arrays overcome the high free-space path loss at higher carrier frequencies. The beams must be properly aligned to ensure that the highest power is radiated towards (and received by) the user equipment (UE)
arXiv Detail & Related papers (2022-09-14T14:25:56Z)
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam Search [103.53249725360286]
Existing trackers usually select a location or proposal with the maximum score as tracking result for each frame. We propose a novel multi-agent reinforcement learning based beam search strategy (termed BeamTracking) to address this issue.
arXiv Detail & Related papers (2022-05-19T16:35:36Z)
End-To-End Optimization of LiDAR Beam Configuration for 3D Object Detection and Localization [87.56144220508587]
We take a new route to learn to optimize the LiDAR beam configuration for a given application. We propose a reinforcement learning-based learning-to-optimize framework to automatically optimize the beam configuration. Our method is especially useful when a low-resolution (low-cost) LiDAR is needed.
arXiv Detail & Related papers (2022-01-11T09:46:31Z)
Deep Learning Assisted Calibrated Beam Training for Millimeter-Wave Communication Systems [15.297530726877786]
Huge overhead of beam training imposes a significant challenge in millimeter-wave (mmWave) wireless communications. We propose a wide beam based training approach to calibrate the narrow beam direction according to the channel power leakage. To handle the complex nonlinear properties of the channel power leakage, deep learning is utilized to predict the optimal narrow beam directly.
arXiv Detail & Related papers (2021-01-08T04:02:34Z)

This list is automatically generated from the titles and abstracts of the papers in this site.