Generative and discriminative training of Boltzmann machine through
Quantum annealing
- URL: http://arxiv.org/abs/2002.00792v3
- Date: Tue, 19 Jul 2022 18:51:32 GMT
- Title: Generative and discriminative training of Boltzmann machine through
Quantum annealing
- Authors: Siddhartha Srivastava, Veera Sundararaghavan
- Abstract summary: A hybrid quantum-classical method for learning Boltzmann machines (BM) is presented.
The cost function for learning BM is defined as a weighted sum of Kullback-Leibler (KL) divergence and Negative conditional Log-Likelihood (NCLL)
A Newton-Raphson optimization scheme is presented.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A hybrid quantum-classical method for learning Boltzmann machines (BM) for a
generative and discriminative task is presented. Boltzmann machines are
undirected graphs with a network of visible and hidden nodes where the former
is used as the reading site while the latter is used to manipulate visible
states' probability. In Generative BM, the samples of visible data imitate the
probability distribution of a given data set. In contrast, the visible sites of
discriminative BM are treated as Input/Output (I/O) reading sites where the
conditional probability of output state is optimized for a given set of input
states. The cost function for learning BM is defined as a weighted sum of
Kullback-Leibler (KL) divergence and Negative conditional Log-Likelihood
(NCLL), adjusted using a hyperparamter. Here, the KL Divergence is the cost for
generative learning, and NCLL is the cost for discriminative learning. A
Stochastic Newton-Raphson optimization scheme is presented. The gradients and
the Hessians are approximated using direct samples of BM obtained through
Quantum annealing (QA). Quantum annealers are hardware representing the physics
of the Ising model that operates on low but finite temperature. This
temperature affects the probability distribution of the BM; however, its value
is unknown. Previous efforts have focused on estimating this unknown
temperature through regression of theoretical Boltzmann energies of sampled
states with the probability of states sampled by the actual hardware. This
assumes that the control parameter change does not affect the system
temperature, however, this is not usually the case. Instead, an approach that
works on the probability distribution of samples, instead of the energies, is
proposed to estimate the optimal parameter set. This ensures that the optimal
set can be obtained from a single run.
Related papers
- Iterated Denoising Energy Matching for Sampling from Boltzmann Densities [109.23137009609519]
Iterated Denoising Energy Matching (iDEM)
iDEM alternates between (I) sampling regions of high model density from a diffusion-based sampler and (II) using these samples in our matching objective.
We show that the proposed approach achieves state-of-the-art performance on all metrics and trains $2-5times$ faster.
arXiv Detail & Related papers (2024-02-09T01:11:23Z) - Efficient quantum loading of probability distributions through Feynman
propagators [2.56711111236449]
We present quantum algorithms for the loading of probability distributions using Hamiltonian simulation for one dimensional Hamiltonians of the form $hat H= Delta + V(x) mathbbI$.
We consider the potentials $V(x)$ for which the Feynman propagator is known to have an analytically closed form and utilize these Hamiltonians to load probability distributions into quantum states.
arXiv Detail & Related papers (2023-11-22T21:41:58Z) - Gaussian boson sampling validation via detector binning [0.0]
We propose binned-detector probability distributions as a suitable quantity to statistically validate GBS experiments.
We show how to compute such distributions by leveraging their connection with their respective characteristic function.
We also illustrate how binned-detector probability distributions behave when Haar-averaged over all possible interferometric networks.
arXiv Detail & Related papers (2023-10-27T12:55:52Z) - Boltzmann sampling with quantum annealers via fast Stein correction [1.37736442859694]
A fast and approximate method is developed to compute the sample weights, and used to correct the samples generated by D-Wave quantum annealers.
In benchmarking problems, it is observed that the residual error of thermal average calculations is reduced significantly.
arXiv Detail & Related papers (2023-09-08T04:47:10Z) - Deterministic and Bayesian Characterization of Quantum Computing Devices [0.4194295877935867]
This paper presents a data-driven characterization approach for estimating transition frequencies and decay times in a superconducting quantum device.
The data includes parity events in the transition frequency between the first and second excited states.
A simple but effective mathematical model, based upon averaging solutions of two Lindbladian models, is demonstrated to accurately capture the experimental observations.
arXiv Detail & Related papers (2023-06-23T19:11:41Z) - User-defined Event Sampling and Uncertainty Quantification in Diffusion
Models for Physical Dynamical Systems [49.75149094527068]
We show that diffusion models can be adapted to make predictions and provide uncertainty quantification for chaotic dynamical systems.
We develop a probabilistic approximation scheme for the conditional score function which converges to the true distribution as the noise level decreases.
We are able to sample conditionally on nonlinear userdefined events at inference time, and matches data statistics even when sampling from the tails of the distribution.
arXiv Detail & Related papers (2023-06-13T03:42:03Z) - A hybrid quantum-classical approach for inference on restricted
Boltzmann machines [1.0928470926399563]
A Boltzmann machine is a powerful machine learning model with many real-world applications.
Statistical inference on a Boltzmann machine can be carried out by sampling from its posterior distribution.
Quantum computers have the promise of solving some non-trivial problems in an efficient manner.
arXiv Detail & Related papers (2023-03-31T11:10:31Z) - Importance sampling for stochastic quantum simulations [68.8204255655161]
We introduce the qDrift protocol, which builds random product formulas by sampling from the Hamiltonian according to the coefficients.
We show that the simulation cost can be reduced while achieving the same accuracy, by considering the individual simulation cost during the sampling stage.
Results are confirmed by numerical simulations performed on a lattice nuclear effective field theory.
arXiv Detail & Related papers (2022-12-12T15:06:32Z) - Will My Robot Achieve My Goals? Predicting the Probability that an MDP Policy Reaches a User-Specified Behavior Target [56.99669411766284]
As an autonomous system performs a task, it should maintain a calibrated estimate of the probability that it will achieve the user's goal.
This paper considers settings where the user's goal is specified as a target interval for a real-valued performance summary.
We compute the probability estimates by inverting conformal prediction.
arXiv Detail & Related papers (2022-11-29T18:41:20Z) - Bosonic field digitization for quantum computers [62.997667081978825]
We address the representation of lattice bosonic fields in a discretized field amplitude basis.
We develop methods to predict error scaling and present efficient qubit implementation strategies.
arXiv Detail & Related papers (2021-08-24T15:30:04Z) - Exposing the Implicit Energy Networks behind Masked Language Models via
Metropolis--Hastings [57.133639209759615]
We interpret sequences as energy-based sequence models and propose two energy parametrizations derivable from traineds.
We develop a tractable emph scheme based on the Metropolis-Hastings Monte Carlo algorithm.
We validate the effectiveness of the proposed parametrizations by exploring the quality of samples drawn from these energy-based models.
arXiv Detail & Related papers (2021-06-04T22:04:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.