A Reinforcement Learning based approach for Multi-target Detection in
Massive MIMO radar
- URL: http://arxiv.org/abs/2005.04708v4
- Date: Tue, 2 Mar 2021 11:35:32 GMT
- Title: A Reinforcement Learning based approach for Multi-target Detection in
Massive MIMO radar
- Authors: Aya Mostafa Ahmed, Alaa Alameer Ahmad, Stefano Fortunati, Aydin
Sezgin, Maria S. Greco, Fulvio Gini
- Abstract summary: This paper considers the problem of multi-target detection for massive multiple input multiple output (MMIMO) cognitive radar (CR)
We propose a reinforcement learning (RL) based algorithm for cognitive multi-target detection in the presence of unknown disturbance statistics.
Numerical simulations are performed to assess the performance of the proposed RL-based algorithm in both stationary and dynamic environments.
- Score: 12.982044791524494
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper considers the problem of multi-target detection for massive
multiple input multiple output (MMIMO) cognitive radar (CR). The concept of CR
is based on the perception-action cycle that senses and intelligently adapts to
the dynamic environment in order to optimally satisfy a specific mission.
However, this usually requires a priori knowledge of the environmental model,
which is not available in most cases. We propose a reinforcement learning (RL)
based algorithm for cognitive multi-target detection in the presence of unknown
disturbance statistics. The radar acts as an agent that continuously senses the
unknown environment (i.e., targets and disturbance) and consequently optimizes
transmitted waveforms in order to maximize the probability of detection
($P_\mathsf{D}$) by focusing the energy in specific range-angle cells (i.e.,
beamforming). Furthermore, we propose a solution to the beamforming
optimization problem with less complexity than the existing methods. Numerical
simulations are performed to assess the performance of the proposed RL-based
algorithm in both stationary and dynamic environments. The RL based beamforming
is compared to the conventional omnidirectional approach with equal power
allocation and to adaptive beamforming with no RL. As highlighted by the
proposed numerical results, our RL-based beamformer outperforms both approaches
in terms of target detection performance. The performance improvement is even
particularly remarkable under environmentally harsh conditions such as low SNR,
heavy-tailed disturbance and rapidly changing scenarios.
Related papers
- A DRL-based Reflection Enhancement Method for RIS-assisted
Multi-receiver Communications [4.598835930908191]
Superposition of multiple single-reflection profiles enables multi-reflection for distributed users.
The combination of periodical single-reflection profiles leads to amplitude/phase counteractions, affecting the performance of each reflection beam.
This paper focuses on a dual-reflection optimization scenario and investigates the far-field performance deterioration caused by the misalignment of overlapped profiles.
arXiv Detail & Related papers (2023-09-11T09:43:59Z) - Provably Efficient Algorithm for Nonstationary Low-Rank MDPs [48.92657638730582]
We make the first effort to investigate nonstationary RL under episodic low-rank MDPs, where both transition kernels and rewards may vary over time.
We propose a parameter-dependent policy optimization algorithm called PORTAL, and further improve PORTAL to its parameter-free version of Ada-PORTAL.
For both algorithms, we provide upper bounds on the average dynamic suboptimality gap, which show that as long as the nonstationarity is not significantly large, PORTAL and Ada-PORTAL are sample-efficient and can achieve arbitrarily small average dynamic suboptimality gap with sample complexity.
arXiv Detail & Related papers (2023-08-10T09:52:44Z) - Lyapunov-Driven Deep Reinforcement Learning for Edge Inference Empowered
by Reconfigurable Intelligent Surfaces [30.1512069754603]
We propose a novel algorithm for energy-efficient, low-latency, accurate inference at the wireless edge.
We consider a scenario where new data are continuously generated/collected by a set of devices and are handled through a dynamic queueing system.
arXiv Detail & Related papers (2023-05-18T12:46:42Z) - False Correlation Reduction for Offline Reinforcement Learning [115.11954432080749]
We propose falSe COrrelation REduction (SCORE) for offline RL, a practically effective and theoretically provable algorithm.
We empirically show that SCORE achieves the SoTA performance with 3.1x acceleration on various tasks in a standard benchmark (D4RL)
arXiv Detail & Related papers (2021-10-24T15:34:03Z) - Fast Distributionally Robust Learning with Variance Reduced Min-Max
Optimization [85.84019017587477]
Distributionally robust supervised learning is emerging as a key paradigm for building reliable machine learning systems for real-world applications.
Existing algorithms for solving Wasserstein DRSL involve solving complex subproblems or fail to make use of gradients.
We revisit Wasserstein DRSL through the lens of min-max optimization and derive scalable and efficiently implementable extra-gradient algorithms.
arXiv Detail & Related papers (2021-04-27T16:56:09Z) - Distributed Multi-agent Meta Learning for Trajectory Design in Wireless
Drone Networks [151.27147513363502]
This paper studies the problem of the trajectory design for a group of energyconstrained drones operating in dynamic wireless network environments.
A value based reinforcement learning (VDRL) solution and a metatraining mechanism is proposed.
arXiv Detail & Related papers (2020-12-06T01:30:12Z) - Optimization-driven Machine Learning for Intelligent Reflecting Surfaces
Assisted Wireless Networks [82.33619654835348]
Intelligent surface (IRS) has been employed to reshape the wireless channels by controlling individual scattering elements' phase shifts.
Due to the large size of scattering elements, the passive beamforming is typically challenged by the high computational complexity.
In this article, we focus on machine learning (ML) approaches for performance in IRS-assisted wireless networks.
arXiv Detail & Related papers (2020-08-29T08:39:43Z) - Deep Reinforcement Learning Control for Radar Detection and Tracking in
Congested Spectral Environments [8.103366584285645]
A radar learns to vary the bandwidth and center frequency of its linear frequency modulated (LFM) waveforms to mitigate mutual interference with other systems.
We extend the DQL-based approach to incorporate Double Q-learning and a recurrent neural network to form a Double Deep Recurrent Q-Network.
Our experimental results indicate that the proposed Deep RL approach significantly improves radar detection performance in congested spectral environments.
arXiv Detail & Related papers (2020-06-23T17:21:28Z) - Optimization-driven Deep Reinforcement Learning for Robust Beamforming
in IRS-assisted Wireless Communications [54.610318402371185]
Intelligent reflecting surface (IRS) is a promising technology to assist downlink information transmissions from a multi-antenna access point (AP) to a receiver.
We minimize the AP's transmit power by a joint optimization of the AP's active beamforming and the IRS's passive beamforming.
We propose a deep reinforcement learning (DRL) approach that can adapt the beamforming strategies from past experiences.
arXiv Detail & Related papers (2020-05-25T01:42:55Z) - Reconfigurable Intelligent Surface Assisted Multiuser MISO Systems
Exploiting Deep Reinforcement Learning [21.770491711632832]
The reconfigurable intelligent surface (RIS) has been speculated as one of the key enabling technologies for the future six generation (6G) wireless communication systems.
In this paper, we investigate the joint design of transmit beamforming matrix at the base station and the phase shift matrix at the RIS, by leveraging recent advances in deep reinforcement learning (DRL)
The proposed algorithm is not only able to learn from the environment and gradually improve its behavior, but also obtains the comparable performance compared with two state-of-the-art benchmarks.
arXiv Detail & Related papers (2020-02-24T04:28:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.