Dissecting Quantum Reinforcement Learning: A Systematic Evaluation of Key Components
- URL: http://arxiv.org/abs/2511.17112v1
- Date: Fri, 21 Nov 2025 10:21:39 GMT
- Title: Dissecting Quantum Reinforcement Learning: A Systematic Evaluation of Key Components
- Authors: Javier Lazaro, Juan-Ignacio Vazquez, Pablo Garcia-Bringas,
- Abstract summary: Quantum Reinforcement Learning (QRL) has emerged as a promising paradigm at the intersection of quantum computing and reinforcement learning.<n>By design, PQCs create hybrid quantum-classical models, but their practical applicability remains uncertain.
- Score: 0.08921166277011346
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Parameterised quantum circuit (PQC) based Quantum Reinforcement Learning (QRL) has emerged as a promising paradigm at the intersection of quantum computing and reinforcement learning (RL). By design, PQCs create hybrid quantum-classical models, but their practical applicability remains uncertain due to training instabilities, barren plateaus (BPs), and the difficulty of isolating the contribution of individual pipeline components. In this work, we dissect PQC based QRL architectures through a systematic experimental evaluation of three aspects recurrently identified as critical: (i) data embedding strategies, with Data Reuploading (DR) as an advanced approach; (ii) ansatz design, particularly the role of entanglement; and (iii) post-processing blocks after quantum measurement, with a focus on the underexplored Output Reuse (OR) technique. Using a unified PPO-CartPole framework, we perform controlled comparisons between hybrid and classical agents under identical conditions. Our results show that OR, though purely classical, exhibits distinct behaviour in hybrid pipelines, that DR improves trainability and stability, and that stronger entanglement can degrade optimisation, offsetting classical gains. Together, these findings provide controlled empirical evidence of the interplay between quantum and classical contributions, and establish a reproducible framework for systematic benchmarking and component-wise analysis in QRL.
Related papers
- Quantum LEGO Learning: A Modular Design Principle for Hybrid Artificial Intelligence [63.39968536637762]
We introduce Quantum LEGO Learning, a learning framework that treats classical and quantum components as reusable, composable learning blocks.<n>Within this framework, a pre-trained classical neural network serves as a frozen feature block, while a VQC acts as a trainable adaptive module.<n>We develop a block-wise generalization theory that decomposes learning error into approximation and estimation components.
arXiv Detail & Related papers (2026-01-29T14:29:21Z) - Continual Quantum Architecture Search with Tensor-Train Encoding: Theory and Applications to Signal Processing [68.35481158940401]
CL-QAS is a continual quantum architecture search framework.<n>It mitigates challenges of costly encoding amplitude and forgetting in variational quantum circuits.<n>It achieves controllable robustness expressivity, sample-efficient generalization, and smooth convergence without barren plateaus.
arXiv Detail & Related papers (2026-01-10T02:36:03Z) - Benchmarking Quantum Data Center Architectures: A Performance and Scalability Perspective [13.628992375229247]
We study the impact of four representative quantum data-center architectures on distributed quantum circuit execution latency, resource contention, and scalability.<n>Our results show that distributed quantum performance is jointly shaped by topology, scheduling policies, and physical-layer parameters.
arXiv Detail & Related papers (2026-01-04T03:48:02Z) - Quantum Reinforcement Learning-Guided Diffusion Model for Image Synthesis via Hybrid Quantum-Classical Generative Model Architectures [2.005299372367689]
We introduce a quantum reinforcement learning (QRL) controller that dynamically adjusts CFG at each denoising step.<n>The controller adopts a hybrid quantum--classical actor--critic architecture.<n> Experiments on CIFAR-10 demonstrate that our QRL policy improves perceptual quality.
arXiv Detail & Related papers (2025-09-17T16:47:04Z) - Reinforcement Learning for Quantum Network Control with Application-Driven Objectives [53.03367590211247]
Dynamic programming and reinforcement learning offer promising tools for optimizing control strategies.<n>We propose a novel RL framework that directly optimize non-linear, differentiable objective functions.<n>Our work comprises the first step towards non-linear objective function optimization in quantum networks with RL, opening a path towards more advanced use cases.
arXiv Detail & Related papers (2025-09-12T18:41:10Z) - Parametrized Quantum Circuit Learning for Quantum Chemical Applications [6.0891078426115826]
Parametrized quantum circuits (PQCs) provide a promising hybrid framework for tackling complex machine learning problems.<n>In this study, we investigate the potential benefits and limitations of PQCs on two chemically meaningful datasets.<n>We construct a comprehensive set of 168 PQCs by combining 14 data encoding strategies with 12 variational ans"atze, and evaluate their performance on circuits with 5 and 16 qubits.
arXiv Detail & Related papers (2025-07-10T21:35:33Z) - VQC-MLPNet: An Unconventional Hybrid Quantum-Classical Architecture for Scalable and Robust Quantum Machine Learning [50.95799256262098]
Variational quantum circuits (VQCs) hold promise for quantum machine learning but face challenges in expressivity, trainability, and noise resilience.<n>We propose VQC-MLPNet, a hybrid architecture where a VQC generates the first-layer weights of a classical multilayer perceptron during training, while inference is performed entirely classically.
arXiv Detail & Related papers (2025-06-12T01:38:15Z) - Bayesian Quantum Amplitude Estimation [46.03321798937855]
We present BAE, a problem-tailored and noise-aware Bayesian algorithm for quantum amplitude estimation.<n>In a fault tolerant scenario, BAE is capable of saturating the Heisenberg limit; if device noise is present, BAE can dynamically characterize it and self-adapt.<n>We propose a benchmark for amplitude estimation algorithms and use it to test BAE against other approaches.
arXiv Detail & Related papers (2024-12-05T18:09:41Z) - Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning [3.6881738506505988]
We propose differentiable quantum architecture search (DiffQAS) to enable trainable circuit parameters and structure weights.
We show that our proposed DiffQAS-QRL approach achieves performance comparable to manually-crafted circuit architectures.
arXiv Detail & Related papers (2024-07-25T17:11:00Z) - Pre-training Tensor-Train Networks Facilitates Machine Learning with Variational Quantum Circuits [70.97518416003358]
Variational quantum circuits (VQCs) hold promise for quantum machine learning on noisy intermediate-scale quantum (NISQ) devices.
While tensor-train networks (TTNs) can enhance VQC representation and generalization, the resulting hybrid model, TTN-VQC, faces optimization challenges due to the Polyak-Lojasiewicz (PL) condition.
To mitigate this challenge, we introduce Pre+TTN-VQC, a pre-trained TTN model combined with a VQC.
arXiv Detail & Related papers (2023-05-18T03:08:18Z) - Benchmarking the Reliability of Post-training Quantization: a Particular
Focus on Worst-case Performance [53.45700148820669]
Post-training quantization (PTQ) is a popular method for compressing deep neural networks (DNNs) without modifying their original architecture or training procedures.
Despite its effectiveness and convenience, the reliability of PTQ methods in the presence of some extrem cases such as distribution shift and data noise remains largely unexplored.
This paper first investigates this problem on various commonly-used PTQ methods.
arXiv Detail & Related papers (2023-03-23T02:55:50Z) - Evolutionary Quantum Architecture Search for Parametrized Quantum
Circuits [7.298440208725654]
We introduce EQAS-PQC, an evolutionary quantum architecture search framework for PQC-based models.
We show that our method can significantly improve the performance of hybrid quantum-classical models.
arXiv Detail & Related papers (2022-08-23T19:47:37Z) - When BERT Meets Quantum Temporal Convolution Learning for Text
Classification in Heterogeneous Computing [75.75419308975746]
This work proposes a vertical federated learning architecture based on variational quantum circuits to demonstrate the competitive performance of a quantum-enhanced pre-trained BERT model for text classification.
Our experiments on intent classification show that our proposed BERT-QTC model attains competitive experimental results in the Snips and ATIS spoken language datasets.
arXiv Detail & Related papers (2022-02-17T09:55:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.