Efficient Online Learning for Cognitive Radar-Cellular Coexistence via
Contextual Thompson Sampling
- URL: http://arxiv.org/abs/2008.10149v1
- Date: Mon, 24 Aug 2020 01:20:58 GMT
- Title: Efficient Online Learning for Cognitive Radar-Cellular Coexistence via
Contextual Thompson Sampling
- Authors: Charles E. Thornton, R. Michael Buehrer, Anthony F. Martone
- Abstract summary: This paper describes a sequential, or online, learning scheme for adaptive radar transmissions.
A linear Contextual Bandit (CB) learning framework is applied to drive the radar's behavior.
We show that the proposed Thompson Sampling (TS) algorithm maintains competitive performance with a more complex Deep Q-Network (DQN)
- Score: 9.805913930878
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper describes a sequential, or online, learning scheme for adaptive
radar transmissions that facilitate spectrum sharing with a non-cooperative
cellular network. First, the interference channel between the radar and a
spatially distant cellular network is modeled. Then, a linear Contextual Bandit
(CB) learning framework is applied to drive the radar's behavior. The
fundamental trade-off between exploration and exploitation is balanced by a
proposed Thompson Sampling (TS) algorithm, a pseudo-Bayesian approach which
selects waveform parameters based on the posterior probability that a specific
waveform is optimal, given discounted channel information as context. It is
shown that the contextual TS approach converges more rapidly to behavior that
minimizes mutual interference and maximizes spectrum utilization than
comparable contextual bandit algorithms. Additionally, we show that the TS
learning scheme results in a favorable SINR distribution compared to other
online learning algorithms. Finally, the proposed TS algorithm is compared to a
deep reinforcement learning model. We show that the TS algorithm maintains
competitive performance with a more complex Deep Q-Network (DQN).
Related papers
- Online Network Source Optimization with Graph-Kernel MAB [62.6067511147939]
We propose Grab-UCB, a graph- kernel multi-arms bandit algorithm to learn online the optimal source placement in large scale networks.
We describe the network processes with an adaptive graph dictionary model, which typically leads to sparse spectral representations.
We derive the performance guarantees that depend on network parameters, which further influence the learning curve of the sequential decision strategy.
arXiv Detail & Related papers (2023-07-07T15:03:42Z) - Deep Learning-Based Synchronization for Uplink NB-IoT [72.86843435313048]
We propose a neural network (NN)-based algorithm for device detection and time of arrival (ToA) estimation for the narrowband physical random-access channel (NPRACH) of narrowband internet of things (NB-IoT)
The introduced NN architecture leverages residual convolutional networks as well as knowledge of the preamble structure of the 5G New Radio (5G NR) specifications.
arXiv Detail & Related papers (2022-05-22T12:16:43Z) - Learning Optimal Antenna Tilt Control Policies: A Contextual Linear
Bandit Approach [65.27783264330711]
Controlling antenna tilts in cellular networks is imperative to reach an efficient trade-off between network coverage and capacity.
We devise algorithms learning optimal tilt control policies from existing data.
We show that they can produce optimal tilt update policy using much fewer data samples than naive or existing rule-based learning algorithms.
arXiv Detail & Related papers (2022-01-06T18:24:30Z) - Reinforcement Learning Algorithm for Traffic Steering in Heterogeneous
Network [0.609170287691728]
This paper proposes a novel traffic steering algorithm for usage in HetNets.
The novel algorithm was compared with two reference algorithms using network simulation results.
arXiv Detail & Related papers (2021-11-29T23:56:18Z) - Learning to Estimate RIS-Aided mmWave Channels [50.15279409856091]
We focus on uplink cascaded channel estimation, where known and fixed base station combining and RIS phase control matrices are considered for collecting observations.
To boost the estimation performance and reduce the training overhead, the inherent channel sparsity of mmWave channels is leveraged in the deep unfolding method.
It is verified that the proposed deep unfolding network architecture can outperform the least squares (LS) method with a relatively smaller training overhead and online computational complexity.
arXiv Detail & Related papers (2021-07-27T06:57:56Z) - Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement
Learning [1.2330326247154968]
Unmanned aerial vehicles (UAVs) are expected to be an integral part of wireless networks.
In this paper, we aim to find collision-free paths for multiple cellular-connected UAVs.
We propose an offline temporal difference (TD) learning algorithm with online signal-to-interference-plus-noise ratio mapping to solve the problem.
arXiv Detail & Related papers (2021-04-09T16:52:33Z) - Fast Convergence Algorithm for Analog Federated Learning [30.399830943617772]
We propose an AirComp-based FedSplit algorithm for efficient analog federated learning over wireless channels.
We prove that the proposed algorithm linearly converges to the optimal solutions under the assumption that the objective function is strongly convex and smooth.
Our algorithm is theoretically and experimentally verified to be much more robust to the ill-conditioned problems with faster convergence compared with other benchmark FL algorithms.
arXiv Detail & Related papers (2020-10-30T10:59:49Z) - AIN: Fast and Accurate Sequence Labeling with Approximate Inference
Network [75.44925576268052]
The linear-chain Conditional Random Field (CRF) model is one of the most widely-used neural sequence labeling approaches.
Exact probabilistic inference algorithms are typically applied in training and prediction stages of the CRF model.
We propose to employ a parallelizable approximate variational inference algorithm for the CRF model.
arXiv Detail & Related papers (2020-09-17T12:18:43Z) - Deep Reinforcement Learning Control for Radar Detection and Tracking in
Congested Spectral Environments [8.103366584285645]
A radar learns to vary the bandwidth and center frequency of its linear frequency modulated (LFM) waveforms to mitigate mutual interference with other systems.
We extend the DQL-based approach to incorporate Double Q-learning and a recurrent neural network to form a Double Deep Recurrent Q-Network.
Our experimental results indicate that the proposed Deep RL approach significantly improves radar detection performance in congested spectral environments.
arXiv Detail & Related papers (2020-06-23T17:21:28Z) - Communication-Efficient Distributed Stochastic AUC Maximization with
Deep Neural Networks [50.42141893913188]
We study a distributed variable for large-scale AUC for a neural network as with a deep neural network.
Our model requires a much less number of communication rounds and still a number of communication rounds in theory.
Our experiments on several datasets show the effectiveness of our theory and also confirm our theory.
arXiv Detail & Related papers (2020-05-05T18:08:23Z) - Deep Learning for Frame Error Prediction using a DARPA Spectrum
Collaboration Challenge (SC2) Dataset [4.855663359344748]
We demonstrate a first example for employing deep learning in predicting frame errors for a Collaborative Intelligent Radio Network (CIRN) using a dataset collected during participation in the final scrimmages of the SC2 challenge.
Four scenarios are considered based on randomizing or fixing the strategy for bandwidth and channel allocation, and either training and testing with different links or using a pilot phase for each link to train the deep neural network.
The obtained insights open the door for implementing a deep-learning-based strategy that is scalable to large heterogeneous networks, generalizable to diverse wireless environments, and suitable for predicting frame error instances and rates within a congested shared
arXiv Detail & Related papers (2020-03-22T04:53:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.