JUNO: Jump-Start Reinforcement Learning-based Node Selection for UWB
Indoor Localization
- URL: http://arxiv.org/abs/2205.08422v1
- Date: Fri, 6 May 2022 23:48:49 GMT
- Title: JUNO: Jump-Start Reinforcement Learning-based Node Selection for UWB
Indoor Localization
- Authors: Zohreh Hajiakhondi-Meybodi, Ming Hou, Arash Mohammadi
- Abstract summary: Performance of UWB-based localization systems can significantly degrade because of Non Line of Sight (NLoS) connections between a mobile user and UWB beacons.
We propose a Jump-start RL-based UWB NOde selection framework, which performs real-time location predictions without relying on complex NLoS identification/mitigation methods.
The effectiveness of the proposed JUNO framework is evaluated in term of the location error, where the mobile user moves randomly through an ultra-dense indoor environment with a high chance of establishing NLoS connections.
- Score: 16.633804827001285
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Ultra-Wideband (UWB) is one of the key technologies empowering the Internet
of Thing (IoT) concept to perform reliable, energy-efficient, and highly
accurate monitoring, screening, and localization in indoor environments.
Performance of UWB-based localization systems, however, can significantly
degrade because of Non Line of Sight (NLoS) connections between a mobile user
and UWB beacons. To mitigate the destructive effects of NLoS connections, we
target development of a Reinforcement Learning (RL) anchor selection framework
that can efficiently cope with the dynamic nature of indoor environments.
Existing RL models in this context, however, lack the ability to generalize
well to be used in a new setting. Moreover, it takes a long time for the
conventional RL models to reach the optimal policy. To tackle these challenges,
we propose the Jump-start RL-based Uwb NOde selection (JUNO) framework, which
performs real-time location predictions without relying on complex NLoS
identification/mitigation methods. The effectiveness of the proposed JUNO
framework is evaluated in term of the location error, where the mobile user
moves randomly through an ultra-dense indoor environment with a high chance of
establishing NLoS connections. Simulation results corroborate the effectiveness
of the proposed framework in comparison to its state-of-the-art counterparts.
Related papers
- Enhancing Spectrum Efficiency in 6G Satellite Networks: A GAIL-Powered Policy Learning via Asynchronous Federated Inverse Reinforcement Learning [67.95280175998792]
A novel adversarial imitation learning (GAIL)-powered policy learning approach is proposed for optimizing beamforming, spectrum allocation, and remote user equipment (RUE) association ins.
We employ inverse RL (IRL) to automatically learn reward functions without manual tuning.
We show that the proposed MA-AL method outperforms traditional RL approaches, achieving a $14.6%$ improvement in convergence and reward value.
arXiv Detail & Related papers (2024-09-27T13:05:02Z) - HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection [66.42229859018775]
We introduce a unified, high-capacity weakly supervised object detection (WSOD) network called HUWSOD.
HUWSOD incorporates a self-supervised proposal generator and an autoencoder proposal generator with a multi-rate re-supervised pyramid to replace traditional object proposals.
Our findings indicate that randomly boxes, although significantly different from well-designed offline object proposals, are effective for WSOD training.
arXiv Detail & Related papers (2024-06-27T17:59:49Z) - Design Optimization of NOMA Aided Multi-STAR-RIS for Indoor Environments: A Convex Approximation Imitated Reinforcement Learning Approach [51.63921041249406]
Non-orthogonal multiple access (NOMA) enables multiple users to share the same frequency band, and simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS)
deploying STAR-RIS indoors presents challenges in interference mitigation, power consumption, and real-time configuration.
A novel network architecture utilizing multiple access points (APs), STAR-RISs, and NOMA is proposed for indoor communication.
arXiv Detail & Related papers (2024-06-19T07:17:04Z) - Stragglers-Aware Low-Latency Synchronous Federated Learning via Layer-Wise Model Updates [71.81037644563217]
Synchronous federated learning (FL) is a popular paradigm for collaborative edge learning.
As some of the devices may have limited computational resources and varying availability, FL latency is highly sensitive to stragglers.
We propose straggler-aware layer-wise federated learning (SALF) that leverages the optimization procedure of NNs via backpropagation to update the global model in a layer-wise fashion.
arXiv Detail & Related papers (2024-03-27T09:14:36Z) - Federated Learning Under Restricted User Availability [3.0846824529023387]
Non-uniform availability or participation of users is unavoidable due to an adverse or environment.
We propose a new formulation of the FL problem which effectively captures and mitigates limited participation of data originating from infrequent, or restricted users.
Our experiments on synthetic and benchmark datasets show that the proposed approach significantly improved performance as compared with standard FL.
arXiv Detail & Related papers (2023-09-25T14:40:27Z) - FedSpeed: Larger Local Interval, Less Communication Round, and Higher
Generalization Accuracy [84.45004766136663]
Federated learning is an emerging distributed machine learning framework.
It suffers from the non-vanishing biases introduced by the local inconsistent optimal and the rugged client-drifts by the local over-fitting.
We propose a novel and practical method, FedSpeed, to alleviate the negative impacts posed by these problems.
arXiv Detail & Related papers (2023-02-21T03:55:29Z) - On the Effective Usage of Priors in RSS-based Localization [56.68864078417909]
We propose a Received Signal Strength (RSS) fingerprint and convolutional neural network-based algorithm, LocUNet.
In this paper, we study the localization problem in dense urban settings.
We first recognize LocUNet's ability to learn the underlying prior distribution of the Rx position or Rx and transmitter (Tx) association preferences from the training data, and attribute its high performance to these.
arXiv Detail & Related papers (2022-11-28T00:31:02Z) - Hybrid Indoor Localization via Reinforcement Learning-based Information
Fusion [17.079430640475962]
The paper is motivated by the importance of the Smart Cities (SC) concept for future management of global urbanization.
Among all Internet of Things (IoT)-based communication technologies, Bluetooth Low Energy (BLE) plays a vital role in city-wide decision making and services.
arXiv Detail & Related papers (2022-10-27T02:38:25Z) - Ensembles of Compact, Region-specific & Regularized Spiking Neural
Networks for Scalable Place Recognition [25.0834855255728]
Spiking neural networks have significant potential in robotics due to their high energy efficiency on specialized hardware.
This paper introduces a novel modular ensemble network approach, where compact, localized spiking networks each learn and are solely responsible for recognizing places in a local region only.
It comes with a high-performance cost where a lack of global regularization at deployment time leads to hyperactive neurons that erroneously respond to places outside their learned region.
We evaluate this new scalable modular system on benchmark localization datasets Nordland and Oxford RobotCar, with comparisons to standard techniques NetVLAD, DenseVLAD, and SAD, and a previous spiking
arXiv Detail & Related papers (2022-09-19T02:47:48Z) - DQLEL: Deep Q-Learning for Energy-Optimized LoS/NLoS UWB Node Selection [39.51375711937119]
We introduce an efficient node selection framework to enhance the location accuracy without using complex Non Line of Sight (NLoS) mitigation methods.
A mobile user is autonomously trained to determine the optimal pair of UWB beacons to be localized based on the 2-D Time Difference of Arrival (TDoA) framework.
The effectiveness of the proposed DQLEL framework is evaluated in terms of the link condition, the deviation of the remaining battery life of UWB beacons, location error, and cumulative rewards.
arXiv Detail & Related papers (2021-08-24T19:15:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.