DQLEL: Deep Q-Learning for Energy-Optimized LoS/NLoS UWB Node Selection
- URL: http://arxiv.org/abs/2108.13157v1
- Date: Tue, 24 Aug 2021 19:15:57 GMT
- Title: DQLEL: Deep Q-Learning for Energy-Optimized LoS/NLoS UWB Node Selection
- Authors: Zohreh Hajiakhondi-Meybodi, Arash Mohammadi, Ming Hou, Konstantinos N.
Plataniotis
- Abstract summary: We introduce an efficient node selection framework to enhance the location accuracy without using complex Non Line of Sight (NLoS) mitigation methods.
A mobile user is autonomously trained to determine the optimal pair of UWB beacons to be localized based on the 2-D Time Difference of Arrival (TDoA) framework.
The effectiveness of the proposed DQLEL framework is evaluated in terms of the link condition, the deviation of the remaining battery life of UWB beacons, location error, and cumulative rewards.
- Score: 39.51375711937119
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent advancements in Internet of Things (IoTs) have brought about a surge
of interest in indoor positioning for the purpose of providing reliable,
accurate, and energy-efficient indoor navigation/localization systems. Ultra
Wide Band (UWB) technology has been emerged as a potential candidate to satisfy
the aforementioned requirements. Although UWB technology can enhance the
accuracy of indoor positioning due to the use of a wide-frequency spectrum,
there are key challenges ahead for its efficient implementation. On the one
hand, achieving high precision in positioning relies on the
identification/mitigation Non Line of Sight (NLoS) links, leading to a
significant increase in the complexity of the localization framework. On the
other hand, UWB beacons have a limited battery life, which is especially
problematic in practical circumstances with certain beacons located in
strategic positions. To address these challenges, we introduce an efficient
node selection framework to enhance the location accuracy without using complex
NLoS mitigation methods, while maintaining a balance between the remaining
battery life of UWB beacons. Referred to as the Deep Q-Learning
Energy-optimized LoS/NLoS (DQLEL) UWB node selection framework, the mobile user
is autonomously trained to determine the optimal pair of UWB beacons to be
localized based on the 2-D Time Difference of Arrival (TDoA) framework. The
effectiveness of the proposed DQLEL framework is evaluated in terms of the link
condition, the deviation of the remaining battery life of UWB beacons, location
error, and cumulative rewards. Based on the simulation results, the proposed
DQLEL framework significantly outperformed its counterparts across the
aforementioned aspects.
Related papers
- Adaptive Resource Allocation for Virtualized Base Stations in O-RAN with
Online Learning [60.17407932691429]
Open Radio Access Network systems, with their base stations (vBSs), offer operators the benefits of increased flexibility, reduced costs, vendor diversity, and interoperability.
We propose an online learning algorithm that balances the effective throughput and vBS energy consumption, even under unforeseeable and "challenging'' environments.
We prove the proposed solutions achieve sub-linear regret, providing zero average optimality gap even in challenging environments.
arXiv Detail & Related papers (2023-09-04T17:30:21Z) - On the Effective Usage of Priors in RSS-based Localization [56.68864078417909]
We propose a Received Signal Strength (RSS) fingerprint and convolutional neural network-based algorithm, LocUNet.
In this paper, we study the localization problem in dense urban settings.
We first recognize LocUNet's ability to learn the underlying prior distribution of the Rx position or Rx and transmitter (Tx) association preferences from the training data, and attribute its high performance to these.
arXiv Detail & Related papers (2022-11-28T00:31:02Z) - Deep reinforcement learning for automatic run-time adaptation of UWB PHY
radio settings [3.7885834570803842]
We propose a deep Q-learning approach for enabling reliable UWB communication, maximizing packet reception rate (PRR) and minimizing energy consumption.
We found that deep Q-learning achieves a higher average PRR and reduces the ranging error while using only 14% of the energy compared to a fixed PHY setting in a dynamic office environment.
arXiv Detail & Related papers (2022-10-13T13:47:12Z) - Deep Reinforcement Learning in mmW-NOMA: Joint Power Allocation and
Hybrid Beamforming [0.0]
High demand of data rate could be ensured by Non-Orthogonal Multiple Access (NOMA) approach in the millimetre-wave (mmW) frequency band.
Joint power allocation and hybrid beamforming of mmW-NOMA systems is brought up via recent advances in machine learning and control theory approaches.
arXiv Detail & Related papers (2022-05-13T07:55:48Z) - JUNO: Jump-Start Reinforcement Learning-based Node Selection for UWB
Indoor Localization [16.633804827001285]
Performance of UWB-based localization systems can significantly degrade because of Non Line of Sight (NLoS) connections between a mobile user and UWB beacons.
We propose a Jump-start RL-based UWB NOde selection framework, which performs real-time location predictions without relying on complex NLoS identification/mitigation methods.
The effectiveness of the proposed JUNO framework is evaluated in term of the location error, where the mobile user moves randomly through an ultra-dense indoor environment with a high chance of establishing NLoS connections.
arXiv Detail & Related papers (2022-05-06T23:48:49Z) - Federated Learning for Energy-limited Wireless Networks: A Partial Model
Aggregation Approach [79.59560136273917]
limited communication resources, bandwidth and energy, and data heterogeneity across devices are main bottlenecks for federated learning (FL)
We first devise a novel FL framework with partial model aggregation (PMA)
The proposed PMA-FL improves 2.72% and 11.6% accuracy on two typical heterogeneous datasets.
arXiv Detail & Related papers (2022-04-20T19:09:52Z) - Anomaly Detection Based on Generalized Gaussian Distribution approach
for Ultra-Wideband (UWB) Indoor Positioning System [1.8619310493118186]
This paper focuses on employing an anomaly detection approach based on Gaussian Distribution (GD) and Generalized Gaussian Distribution (GGD) algorithms to detect and identify the Non-Line-of-Sight (NLoS) components.
The simulation results indicate that the proposed approach can provide a robust NLoS component identification which improves the NLoS signal classification accuracy.
arXiv Detail & Related papers (2021-08-09T11:54:11Z) - JUMBO: Scalable Multi-task Bayesian Optimization using Offline Data [86.8949732640035]
We propose JUMBO, an MBO algorithm that sidesteps limitations by querying additional data.
We show that it achieves no-regret under conditions analogous to GP-UCB.
Empirically, we demonstrate significant performance improvements over existing approaches on two real-world optimization problems.
arXiv Detail & Related papers (2021-06-02T05:03:38Z) - Learning-based Bias Correction for Time Difference of Arrival
Ultra-wideband Localization of Resource-constrained Mobile Robots [8.016760287602084]
Ultra-wideband (UWB) time difference of arrival (TDOA)-based localization is a promising lightweight, low-cost solution that can scale to a large number of devices.
However, the localization accuracy of standard, commercially available UWB radios is often insufficient due to significant measurement bias and outliers.
We propose a robust UWB TDOA localization framework comprising of (i) learning-based bias correction and (ii) M-estimation-based robust filtering to handle outliers.
arXiv Detail & Related papers (2021-03-02T17:31:32Z) - RIS Enhanced Massive Non-orthogonal Multiple Access Networks: Deployment
and Passive Beamforming Design [116.88396201197533]
A novel framework is proposed for the deployment and passive beamforming design of a reconfigurable intelligent surface (RIS)
The problem of joint deployment, phase shift design, as well as power allocation is formulated for maximizing the energy efficiency.
A novel long short-term memory (LSTM) based echo state network (ESN) algorithm is proposed to predict users' tele-traffic demand by leveraging a real dataset.
A decaying double deep Q-network (D3QN) based position-acquisition and phase-control algorithm is proposed to solve the joint problem of deployment and design of the RIS.
arXiv Detail & Related papers (2020-01-28T14:37:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.