Learning to Coordinate via Quantum Entanglement in Multi-Agent Reinforcement Learning
- URL: http://arxiv.org/abs/2602.08965v2
- Date: Wed, 11 Feb 2026 13:26:34 GMT
- Title: Learning to Coordinate via Quantum Entanglement in Multi-Agent Reinforcement Learning
- Authors: John Gardiner, Orlando Romero, Brendan Tivnan, Nicolò Dal Fabbro, George J. Pappas,
- Abstract summary: This work introduces the first framework for training MARL agents to exploit shared quantum entanglement as a coordination resource.<n>We show that our machinery can learn policies with quantum advantage in an illustrative multi-agent sequential decision-making problem.
- Score: 18.2643909564315
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The inability to communicate poses a major challenge to coordination in multi-agent reinforcement learning (MARL). Prior work has explored correlating local policies via shared randomness, sometimes in the form of a correlation device, as a mechanism to assist in decentralized decision-making. In contrast, this work introduces the first framework for training MARL agents to exploit shared quantum entanglement as a coordination resource, which permits a larger class of communication-free correlated policies than shared randomness alone. This is motivated by well-known results in quantum physics which posit that, for certain single-round cooperative games with no communication, shared quantum entanglement enables strategies that outperform those that only use shared randomness. In such cases, we say that there is quantum advantage. Our framework is based on a novel differentiable policy parameterization that enables optimization over quantum measurements, together with a novel policy architecture that decomposes joint policies into a quantum coordinator and decentralized local actors. To illustrate the effectiveness of our proposed method, we first show that we can learn, purely from experience, strategies that attain quantum advantage in single-round games that are treated as black box oracles. We then demonstrate how our machinery can learn policies with quantum advantage in an illustrative multi-agent sequential decision-making problem formulated as a decentralized partially observable Markov decision process (Dec-POMDP).
Related papers
- Challenge-Response Quantum Reinforcement Learning with Application to Quantum-Assisted Authentication [0.0]
We propose a quantum reinforcement learning environment formulated as a challenge-response task with hidden information.<n>We consider three agents: a purely classical agent, a lightweight hybrid agent and a deep hybrid agent.<n>Our results show that the lightweight hybrid agent achieves reliable inference using as few as two quantum state copies.
arXiv Detail & Related papers (2026-02-12T22:44:51Z) - Reinforcement Learning for Quantum Network Control with Application-Driven Objectives [53.03367590211247]
Dynamic programming and reinforcement learning offer promising tools for optimizing control strategies.<n>We propose a novel RL framework that directly optimize non-linear, differentiable objective functions.<n>Our work comprises the first step towards non-linear objective function optimization in quantum networks with RL, opening a path towards more advanced use cases.
arXiv Detail & Related papers (2025-09-12T18:41:10Z) - eQMARL: Entangled Quantum Multi-Agent Reinforcement Learning for Distributed Cooperation over Quantum Channels [98.314893665023]
Quantum computing has sparked a potential synergy between quantum entanglement and cooperation in multi-agent environments.<n>Current state-of-the-art quantum MARL (QMARL) implementations rely on classical information sharing.<n>eQMARL is a distributed actor-critic framework that facilitates cooperation over a quantum channel.
arXiv Detail & Related papers (2024-05-24T18:43:05Z) - Separable Power of Classical and Quantum Learning Protocols Through the Lens of No-Free-Lunch Theorem [70.42372213666553]
The No-Free-Lunch (NFL) theorem quantifies problem- and data-independent generalization errors regardless of the optimization process.
We categorize a diverse array of quantum learning algorithms into three learning protocols designed for learning quantum dynamics under a specified observable.
Our derived NFL theorems demonstrate quadratic reductions in sample complexity across CLC-LPs, ReQu-LPs, and Qu-LPs.
We attribute this performance discrepancy to the unique capacity of quantum-related learning protocols to indirectly utilize information concerning the global phases of non-orthogonal quantum states.
arXiv Detail & Related papers (2024-05-12T09:05:13Z) - The Quantum Advantage in Binary Teams and the Coordination Dilemma: Part
I [10.312968200748116]
We show that entanglement assisted strategies allow access to strategic measures beyond the classically correlated measures accessible through common randomness.
In this part, identify the only problem classes that benefit from quantum strategies.
We find that these cost structures admit a special decision-theoretic feature -- the coordination dilemma'
arXiv Detail & Related papers (2023-07-04T15:05:07Z) - Classical Verification of Quantum Learning [42.362388367152256]
We develop a framework for classical verification of quantum learning.
We propose a new quantum data access model that we call "mixture-of-superpositions" quantum examples.
Our results demonstrate that the potential power of quantum data for learning tasks, while not unlimited, can be utilized by classical agents.
arXiv Detail & Related papers (2023-06-08T00:31:27Z) - Experimental violations of Leggett-Garg's inequalities on a quantum
computer [77.34726150561087]
We experimentally observe the violations of Leggett-Garg-Bell's inequalities on single and multi-qubit systems.
Our analysis highlights the limits of nowadays quantum platforms, showing that the above-mentioned correlation functions deviate from theoretical prediction as the number of qubits and the depth of the circuit grow.
arXiv Detail & Related papers (2021-09-06T14:35:15Z) - Quantum communication complexity beyond Bell nonlocality [87.70068711362255]
Efficient distributed computing offers a scalable strategy for solving resource-demanding tasks.
Quantum resources are well-suited to this task, offering clear strategies that can outperform classical counterparts.
We prove that a new class of communication complexity tasks can be associated to Bell-like inequalities.
arXiv Detail & Related papers (2021-06-11T18:00:09Z) - Converting coherence based on positive-operator-valued measures into
entanglement [2.624902795082451]
Quantum coherence is the basic ingredient in many quantum information tasks.
We show that the block coherence can be transformed into entanglement via a block incoherent operation.
arXiv Detail & Related papers (2020-10-31T09:16:26Z) - Policies for elementary links in a quantum network [0.0]
An important problem, especially for near-term quantum networks, is to develop optimal entanglement distribution protocols.
We address this problem by initiating the study of quantum network protocols for entanglement distribution using the theory of decision processes.
We show that the previously-studied memory-cutoff protocol can be phrased as a policy within our decision process framework.
arXiv Detail & Related papers (2020-07-07T04:10:41Z) - Genuine quantum networks: superposed tasks and addressing [68.8204255655161]
We show how to make quantum networks, both standard and entanglement-based, genuine quantum.
We provide them with the possibility of handling superposed tasks and superposed addressing.
arXiv Detail & Related papers (2020-04-30T18:00:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.