Polynomial-Time Exact MAP Inference on Discrete Models with Global
Dependencies
- URL: http://arxiv.org/abs/1912.12090v3
- Date: Wed, 9 Feb 2022 09:14:39 GMT
- Title: Polynomial-Time Exact MAP Inference on Discrete Models with Global
Dependencies
- Authors: Alexander Bauer, Shinichi Nakajima
- Abstract summary: junction tree algorithm is the most general solution for exact MAP inference with run-time guarantees.
We propose a new graph transformation technique via node cloning which ensures a run-time for solving our target problem independently of the form of a corresponding clique tree.
- Score: 83.05591911173332
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Considering the worst-case scenario, junction tree algorithm remains the most
general solution for exact MAP inference with polynomial run-time guarantees.
Unfortunately, its main tractability assumption requires the treewidth of a
corresponding MRF to be bounded strongly limiting the range of admissible
applications. In fact, many practical problems in the area of structured
prediction require modelling of global dependencies by either directly
introducing global factors or enforcing global constraints on the prediction
variables. That, however, always results in a fully-connected graph making
exact inference by means of this algorithm intractable. Previous work [1]-[4]
focusing on the problem of loss-augmented inference has demonstrated how
efficient inference can be performed on models with specific global factors
representing non-decomposable loss functions within the training regime of
SSVMs. In this paper, we extend the framework for an efficient exact inference
proposed in in [3] by allowing much finer interactions between the energy of
the core model and the sufficient statistics of the global terms with no
additional computation costs. We demonstrate the usefulness of our method in
several use cases, including one that cannot be handled by any of the previous
approaches. Finally, we propose a new graph transformation technique via node
cloning which ensures a polynomial run-time for solving our target problem
independently of the form of a corresponding clique tree. This is important for
the efficiency of the main algorithm and greatly improves upon the theoretical
guarantees of the previous works.
Related papers
- Advancing Counterfactual Inference through Nonlinear Quantile Regression [77.28323341329461]
We propose a framework for efficient and effective counterfactual inference implemented with neural networks.
The proposed approach enhances the capacity to generalize estimated counterfactual outcomes to unseen data.
Empirical results conducted on multiple datasets offer compelling support for our theoretical assertions.
arXiv Detail & Related papers (2023-06-09T08:30:51Z) - Joint Graph Learning and Model Fitting in Laplacian Regularized
Stratified Models [5.933030735757292]
Laplacian regularized stratified models (LRSM) are models that utilize the explicit or implicit network structure of the sub-problems.
This paper shows the importance and sensitivity of graph weights in LRSM, and provably show that the sensitivity can be arbitrarily large.
We propose a generic approach to jointly learn the graph while fitting the model parameters by solving a single optimization problem.
arXiv Detail & Related papers (2023-05-04T06:06:29Z) - Federated Learning as Variational Inference: A Scalable Expectation
Propagation Approach [66.9033666087719]
This paper extends the inference view and describes a variational inference formulation of federated learning.
We apply FedEP on standard federated learning benchmarks and find that it outperforms strong baselines in terms of both convergence speed and accuracy.
arXiv Detail & Related papers (2023-02-08T17:58:11Z) - Training and Inference on Any-Order Autoregressive Models the Right Way [97.39464776373902]
A family of Any-Order Autoregressive Models (AO-ARMs) has shown breakthrough performance in arbitrary conditional tasks.
We identify significant improvements to be made to previous formulations of AO-ARMs.
Our method leads to improved performance with no compromises on tractability.
arXiv Detail & Related papers (2022-05-26T18:00:02Z) - Domain Generalization via Domain-based Covariance Minimization [4.414778226415752]
We propose a novel variance measurement for multiple domains so as to minimize the difference between conditional distributions across domains.
We show that for small-scale datasets, we are able to achieve better quantitative results indicating better generalization performance over unseen test datasets.
arXiv Detail & Related papers (2021-10-12T19:30:15Z) - Efficient semidefinite-programming-based inference for binary and
multi-class MRFs [83.09715052229782]
We propose an efficient method for computing the partition function or MAP estimate in a pairwise MRF.
We extend semidefinite relaxations from the typical binary MRF to the full multi-class setting, and develop a compact semidefinite relaxation that can again be solved efficiently using the solver.
arXiv Detail & Related papers (2020-12-04T15:36:29Z) - Efficient Consensus Model based on Proximal Gradient Method applied to
Convolutional Sparse Problems [2.335152769484957]
We derive and detail a theoretical analysis of an efficient consensus algorithm based on gradient proximal (PG) approach.
The proposed algorithm is also applied to another particular convolutional problem for the anomaly detection task.
arXiv Detail & Related papers (2020-11-19T20:52:48Z) - Slice Sampling for General Completely Random Measures [74.24975039689893]
We present a novel Markov chain Monte Carlo algorithm for posterior inference that adaptively sets the truncation level using auxiliary slice variables.
The efficacy of the proposed algorithm is evaluated on several popular nonparametric models.
arXiv Detail & Related papers (2020-06-24T17:53:53Z) - Consistent Second-Order Conic Integer Programming for Learning Bayesian
Networks [2.7473982588529653]
We study the problem of learning the sparse DAG structure of a BN from continuous observational data.
The optimal solution to this mathematical program is known to have desirable statistical properties under certain conditions.
We propose a concrete early stopping criterion to terminate the branch-and-bound process in order to obtain a near-optimal solution.
arXiv Detail & Related papers (2020-05-29T00:13:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.