Symbolic Querying of Vector Spaces: Probabilistic Databases Meets
Relational Embeddings
- URL: http://arxiv.org/abs/2002.10029v2
- Date: Sat, 27 Jun 2020 20:01:49 GMT
- Title: Symbolic Querying of Vector Spaces: Probabilistic Databases Meets
Relational Embeddings
- Authors: Tal Friedman and Guy Van den Broeck
- Abstract summary: We formalize a probabilistic database model with respect to which all queries are done.
The lack of a well-defined joint probability distribution causes simple query problems to become provably hard.
We introduce TO, a relational embedding model designed to be a tractable probabilistic database.
- Score: 35.877591735510734
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose unifying techniques from probabilistic databases and relational
embedding models with the goal of performing complex queries on incomplete and
uncertain data. We formalize a probabilistic database model with respect to
which all queries are done. This allows us to leverage the rich literature of
theory and algorithms from probabilistic databases for solving problems. While
this formalization can be used with any relational embedding model, the lack of
a well-defined joint probability distribution causes simple query problems to
become provably hard. With this in mind, we introduce \TO, a relational
embedding model designed to be a tractable probabilistic database, by
exploiting typical embedding assumptions within the probabilistic framework.
Using a principled, efficient inference algorithm that can be derived from its
definition, we empirically demonstrate that \TOs is an effective and general
model for these querying tasks.
Related papers
- Towards Privacy-Preserving Relational Data Synthesis via Probabilistic Relational Models [3.877001015064152]
Probabilistic relational models provide a well-established formalism to combine first-order logic and probabilistic models.
The field of artificial intelligence requires increasingly large amounts of relational training data for various machine learning tasks.
Collecting real-world data is often challenging due to privacy concerns, data protection regulations, high costs, and so on.
arXiv Detail & Related papers (2024-09-06T11:24:25Z) - Estimating Causal Effects from Learned Causal Networks [56.14597641617531]
We propose an alternative paradigm for answering causal-effect queries over discrete observable variables.
We learn the causal Bayesian network and its confounding latent variables directly from the observational data.
We show that this emphmodel completion learning approach can be more effective than estimand approaches.
arXiv Detail & Related papers (2024-08-26T08:39:09Z) - Compositional Probabilistic and Causal Inference using Tractable Circuit
Models [20.07977560803858]
We introduce md-vtrees, a novel structural formulation of (marginal) determinism in structured decomposable PCs.
We derive the first polytime algorithms for causal inference queries such as backdoor adjustment on PCs.
arXiv Detail & Related papers (2023-04-17T13:48:16Z) - Validation Diagnostics for SBI algorithms based on Normalizing Flows [55.41644538483948]
This work proposes easy to interpret validation diagnostics for multi-dimensional conditional (posterior) density estimators based on NF.
It also offers theoretical guarantees based on results of local consistency.
This work should help the design of better specified models or drive the development of novel SBI-algorithms.
arXiv Detail & Related papers (2022-11-17T15:48:06Z) - An epistemic approach to model uncertainty in data-graphs [2.1261712640167847]
Graph databases can suffer from errors and discrepancies with respect to real-world data they intend to represent.
In this work we explore the notion of probabilistic unclean graph databases, previously proposed for relational databases.
We define two computational problems: data cleaning and probabilistic query answering and study for both of them their corresponding complexity.
arXiv Detail & Related papers (2021-09-29T00:08:27Z) - Structural Learning of Probabilistic Sentential Decision Diagrams under
Partial Closed-World Assumption [127.439030701253]
Probabilistic sentential decision diagrams are a class of structured-decomposable circuits.
We propose a new scheme based on a partial closed-world assumption: data implicitly provide the logical base of the circuit.
Preliminary experiments show that the proposed approach might properly fit training data, and generalize well to test data, provided that these remain consistent with the underlying logical base.
arXiv Detail & Related papers (2021-07-26T12:01:56Z) - Complex Coordinate-Based Meta-Analysis with Probabilistic Programming [0.0]
Coordinate-based meta-analysis (CBMA) databases are built by automatically extracting both coordinates of reported peak activations and term associations.
We show how recent lifted query processing algorithms make it possible to scale to the size of large neuroimaging data.
We demonstrate results for two-term conjunctive queries, both on simulated meta-analysis databases and on the widely-used Neurosynth database.
arXiv Detail & Related papers (2020-12-02T16:16:26Z) - Probabilistic Case-based Reasoning for Open-World Knowledge Graph
Completion [59.549664231655726]
A case-based reasoning (CBR) system solves a new problem by retrieving cases' that are similar to the given problem.
In this paper, we demonstrate that such a system is achievable for reasoning in knowledge-bases (KBs)
Our approach predicts attributes for an entity by gathering reasoning paths from similar entities in the KB.
arXiv Detail & Related papers (2020-10-07T17:48:12Z) - Accounting for Unobserved Confounding in Domain Generalization [107.0464488046289]
This paper investigates the problem of learning robust, generalizable prediction models from a combination of datasets.
Part of the challenge of learning robust models lies in the influence of unobserved confounders.
We demonstrate the empirical performance of our approach on healthcare data from different modalities.
arXiv Detail & Related papers (2020-07-21T08:18:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.