Sampling, Communication, and Prediction Co-Design for Synchronizing the
Real-World Device and Digital Model in Metaverse
- URL: http://arxiv.org/abs/2208.04233v1
- Date: Sun, 31 Jul 2022 20:17:31 GMT
- Title: Sampling, Communication, and Prediction Co-Design for Synchronizing the
Real-World Device and Digital Model in Metaverse
- Authors: Zhen Meng, Changyang She, Guodong Zhao, and Daniele De Martini
- Abstract summary: We develop a constrained Deep Reinforcement Learning (DRL) algorithm, named Knowledge-assisted Constrained Twin-Delayed Deep Deterministic (KC-TD3) policy gradient algorithm.
We validate our framework on a prototype composed of a real-world robotic arm and its digital model.
- Score: 14.326344469446434
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The metaverse has the potential to revolutionize the next generation of the
Internet by supporting highly interactive services with the help of Mixed
Reality (MR) technologies; still, to provide a satisfactory experience for
users, the synchronization between the physical world and its digital models is
crucial. This work proposes a sampling, communication and prediction co-design
framework to minimize the communication load subject to a constraint on
tracking the Mean Squared Error (MSE) between a real-world device and its
digital model in the metaverse. To optimize the sampling rate and the
prediction horizon, we exploit expert knowledge and develop a constrained Deep
Reinforcement Learning (DRL) algorithm, named Knowledge-assisted Constrained
Twin-Delayed Deep Deterministic (KC-TD3) policy gradient algorithm. We validate
our framework on a prototype composed of a real-world robotic arm and its
digital model. Compared with existing approaches: (1) When the tracking error
constraint is stringent (MSE=0.002 degrees), our policy degenerates into the
policy in the sampling-communication co-design framework. (2) When the tracking
error constraint is mild (MSE=0.007 degrees), our policy degenerates into the
policy in the prediction-communication co-design framework. (3) Our framework
achieves a better trade-off between the average MSE and the average
communication load compared with a communication system without sampling and
prediction. For example, the average communication load can be reduced up to
87% when the track error constraint is 0.002 degrees. (4) Our policy
outperforms the benchmark with the static sampling rate and prediction horizon
optimized by exhaustive search, in terms of the tail probability of the
tracking error. Furthermore, with the assistance of expert knowledge, the
proposed algorithm KC-TD3 achieves better convergence time, stability, and
final policy performance.
Related papers
- Smoothed Online Optimization for Target Tracking: Robust and Learning-Augmented Algorithms [14.970453355735225]
We introduce a new framework that integrates three key objectives in online decision-making under uncertainty.<n>We first present BEST, a robust algorithm with provable competitive guarantees for SOOTT.<n>To enhance practical performance, we introduce CoRT, a learning-augmented variant that incorporates untrusted black-box predictions.
arXiv Detail & Related papers (2025-09-07T05:22:41Z) - Temporal Conformal Prediction (TCP): A Distribution-Free Statistical and Machine Learning Framework for Adaptive Risk Forecasting [0.0]
Temporal Conformal Prediction ( TCP) is a principled framework for constructing well-calibrated prediction intervals for non-stationary time series.<n> TCP integrates a machine learning-based quantile forecaster with an online conformal calibration layer.
arXiv Detail & Related papers (2025-07-07T20:44:31Z) - ZIA: A Theoretical Framework for Zero-Input AI [0.0]
Zero-Input AI (ZIA) introduces a novel framework for human-computer interaction by enabling proactive intent prediction without explicit user commands.
It integrates gaze tracking, bio-signals (EEG, heart rate), and contextual data (time, location, usage history) into a multi-modal model for real-time inference.
ZIA provides a scalable, privacy-preserving framework for accessibility, healthcare, and consumer applications, advancing AI toward anticipatory intelligence.
arXiv Detail & Related papers (2025-02-22T07:42:05Z) - Decentralized Learning Strategies for Estimation Error Minimization with Graph Neural Networks [94.2860766709971]
We address the challenge of sampling and remote estimation for autoregressive Markovian processes in a wireless network with statistically-identical agents.
Our goal is to minimize time-average estimation error and/or age of information with decentralized scalable sampling and transmission policies.
arXiv Detail & Related papers (2024-04-04T06:24:11Z) - Semantic-Aware Remote Estimation of Multiple Markov Sources Under Constraints [9.514904359788156]
We study semantic-aware communication for remote estimation of Markov sources over a lossy and rate-constrained channel.
We find an optimal scheduling policy that minimizes the long-term state-dependent costs of estimation errors under a transmission frequency constraint.
arXiv Detail & Related papers (2024-03-25T15:18:23Z) - Distributional Shift-Aware Off-Policy Interval Estimation: A Unified
Error Quantification Framework [8.572441599469597]
We study high-confidence off-policy evaluation in the context of infinite-horizon Markov decision processes.
The objective is to establish a confidence interval (CI) for the target policy value using only offline data pre-collected from unknown behavior policies.
We show that our algorithm is sample-efficient, error-robust, and provably convergent even in non-linear function approximation settings.
arXiv Detail & Related papers (2023-09-23T06:35:44Z) - Communication-Efficient Zeroth-Order Distributed Online Optimization:
Algorithm, Theory, and Applications [9.045332526072828]
This paper focuses on a multi-agent zeroth-order online optimization problem in a federated learning setting for target tracking.
The proposed solution is further analyzed in terms of errors and errors in two relevant applications.
arXiv Detail & Related papers (2023-06-09T03:51:45Z) - Task-Oriented Prediction and Communication Co-Design for Haptic
Communications [20.20316445430853]
We propose a task-oriented prediction and communication co-design framework for haptic communications.
The proposed framework can reduce wireless resource consumption up to 77.80% compared with a task-agnostic benchmark.
arXiv Detail & Related papers (2023-02-21T23:39:37Z) - When Demonstrations Meet Generative World Models: A Maximum Likelihood
Framework for Offline Inverse Reinforcement Learning [62.00672284480755]
This paper aims to recover the structure of rewards and environment dynamics that underlie observed actions in a fixed, finite set of demonstrations from an expert agent.
Accurate models of expertise in executing a task has applications in safety-sensitive applications such as clinical decision making and autonomous driving.
arXiv Detail & Related papers (2023-02-15T04:14:20Z) - Collaborative Uncertainty Benefits Multi-Agent Multi-Modal Trajectory Forecasting [61.02295959343446]
This work first proposes a novel concept, collaborative uncertainty (CU), which models the uncertainty resulting from interaction modules.
We build a general CU-aware regression framework with an original permutation-equivariant uncertainty estimator to do both tasks of regression and uncertainty estimation.
We apply the proposed framework to current SOTA multi-agent trajectory forecasting systems as a plugin module.
arXiv Detail & Related papers (2022-07-11T21:17:41Z) - On Efficient Uncertainty Estimation for Resource-Constrained Mobile
Applications [0.0]
Predictive uncertainty supplements model predictions and enables improved functionality of downstream tasks.
We tackle this problem by building upon Monte Carlo Dropout (MCDO) models using the Axolotl framework.
We conduct experiments on (1) a multi-class classification task using the CIFAR10 dataset, and (2) a more complex human body segmentation task.
arXiv Detail & Related papers (2021-11-11T22:24:15Z) - Accelerating Federated Edge Learning via Optimized Probabilistic Device
Scheduling [57.271494741212166]
This paper formulates and solves the communication time minimization problem.
It is found that the optimized policy gradually turns its priority from suppressing the remaining communication rounds to reducing per-round latency as the training process evolves.
The effectiveness of the proposed scheme is demonstrated via a use case on collaborative 3D objective detection in autonomous driving.
arXiv Detail & Related papers (2021-07-24T11:39:17Z) - Congestion-aware Multi-agent Trajectory Prediction for Collision
Avoidance [110.63037190641414]
We propose to learn congestion patterns explicitly and devise a novel "Sense--Learn--Reason--Predict" framework.
By decomposing the learning phases into two stages, a "student" can learn contextual cues from a "teacher" while generating collision-free trajectories.
In experiments, we demonstrate that the proposed model is able to generate collision-free trajectory predictions in a synthetic dataset.
arXiv Detail & Related papers (2021-03-26T02:42:33Z) - Reconfigurable Intelligent Surface Assisted Mobile Edge Computing with
Heterogeneous Learning Tasks [53.1636151439562]
Mobile edge computing (MEC) provides a natural platform for AI applications.
We present an infrastructure to perform machine learning tasks at an MEC with the assistance of a reconfigurable intelligent surface (RIS)
Specifically, we minimize the learning error of all participating users by jointly optimizing transmit power of mobile users, beamforming vectors of the base station, and the phase-shift matrix of the RIS.
arXiv Detail & Related papers (2020-12-25T07:08:50Z) - Risk-Averse MPC via Visual-Inertial Input and Recurrent Networks for
Online Collision Avoidance [95.86944752753564]
We propose an online path planning architecture that extends the model predictive control (MPC) formulation to consider future location uncertainties.
Our algorithm combines an object detection pipeline with a recurrent neural network (RNN) which infers the covariance of state estimates.
The robustness of our methods is validated on complex quadruped robot dynamics and can be generally applied to most robotic platforms.
arXiv Detail & Related papers (2020-07-28T07:34:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.