A tetrachotomy of ontology-mediated queries with a covering axiom
- URL: http://arxiv.org/abs/2006.04167v4
- Date: Thu, 5 May 2022 10:22:56 GMT
- Title: A tetrachotomy of ontology-mediated queries with a covering axiom
- Authors: Olga Gerasimova, Stanislav Kikot, Agi Kurucz, Vladimir Podolskii,
Michael Zakharyaschev
- Abstract summary: Our concern is the problem of efficiently determining the data complexity of answering queries mediated by description and their optimal rewritings to standard database queries.
We focus on Boolean conjunctive-mediated queries called disjunctive sirups (or d-sirups)
Some d-sirups only have exponential-size resolution features, some only double-exponential-size positive existential existential-rewritings and single-exprecursive datalog rewritings.
- Score: 1.749935196721634
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Our concern is the problem of efficiently determining the data complexity of
answering queries mediated by description logic ontologies and constructing
their optimal rewritings to standard database queries. Originated in
ontology-based data access and datalog optimisation, this problem is known to
be computationally very complex in general, with no explicit syntactic
characterisations available. In this article, aiming to understand the
fundamental roots of this difficulty, we strip the problem to the bare bones
and focus on Boolean conjunctive queries mediated by a simple covering axiom
stating that one class is covered by the union of two other classes. We show
that, on the one hand, these rudimentary ontology-mediated queries, called
disjunctive sirups (or d-sirups), capture many features and difficulties of the
general case. For example, answering d-sirups is Pi^p_2-complete for combined
complexity and can be in AC0 or LogSpace-, NL-, P-, or coNP-complete for data
complexity (with the problem of recognising FO-rewritability of d-sirups being
2ExpTime-hard); some d-sirups only have exponential-size resolution proofs,
some only double-exponential-size positive existential FO-rewritings and
single-exponential-size nonrecursive datalog rewritings. On the other hand, we
prove a few partial sufficient and necessary conditions of FO- and
(symmetric/linear-) datalog rewritability of d-sirups. Our main technical
result is a complete and transparent syntactic AC0/NL/P/coNP tetrachotomy of
d-sirups with disjoint covering classes and a path-shaped Boolean conjunctive
query. To obtain this tetrachotomy, we develop new techniques for establishing
P- and coNP-hardness of answering non-Horn ontology-mediated queries as well as
showing that they can be answered in NL.
Related papers
- MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex Proofs [80.96119560172224]
Large language models (LLMs) can solve arithmetic word problems with high accuracy, but little is known about how well they generalize to problems that are more complex than the ones on which they have been trained.
We present a framework for evaluating LLMs on problems with arbitrarily complex arithmetic proofs, called MathGAP.
arXiv Detail & Related papers (2024-10-17T12:48:14Z) - The Computational Complexity of Circuit Discovery for Inner Interpretability [0.30723404270319693]
We study circuit discovery with classical and parameterized computational complexity theory.
Our findings reveal a challenging complexity landscape.
This framework allows us to understand the scope and limits of interpretability queries.
arXiv Detail & Related papers (2024-10-10T15:22:48Z) - The Computational Complexity of Concise Hypersphere Classification [49.57441416941195]
This paper is the first complexity-theoretic study of the hypersphere classification problem for binary data.
We use the fine-grained parameterized complexity paradigm to analyze the impact of structural properties that may be present in the input data.
arXiv Detail & Related papers (2023-12-12T09:33:03Z) - Optimal Multi-Distribution Learning [88.3008613028333]
Multi-distribution learning seeks to learn a shared model that minimizes the worst-case risk across $k$ distinct data distributions.
We propose a novel algorithm that yields an varepsilon-optimal randomized hypothesis with a sample complexity on the order of (d+k)/varepsilon2.
arXiv Detail & Related papers (2023-12-08T16:06:29Z) - Counting Solutions to Conjunctive Queries: Structural and Hybrid Tractability [8.618430843854048]
Counting the number of answers to conjunctive queries is a fundamental problem in databases that, under standard assumptions, does not have an efficient solution.
We pinpoint tractable classes by examining the structural properties of intrinsic instances and introducing the novel concept of #-hypertree decomposition.
arXiv Detail & Related papers (2023-11-24T16:09:12Z) - Rethinking Complex Queries on Knowledge Graphs with Neural Link Predictors [58.340159346749964]
We propose a new neural-symbolic method to support end-to-end learning using complex queries with provable reasoning capability.
We develop a new dataset containing ten new types of queries with features that have never been considered.
Our method outperforms previous methods significantly in the new dataset and also surpasses previous methods in the existing dataset at the same time.
arXiv Detail & Related papers (2023-04-14T11:35:35Z) - Complexity-Theoretic Limitations on Quantum Algorithms for Topological
Data Analysis [59.545114016224254]
Quantum algorithms for topological data analysis seem to provide an exponential advantage over the best classical approach.
We show that the central task of TDA -- estimating Betti numbers -- is intractable even for quantum computers.
We argue that an exponential quantum advantage can be recovered if the input data is given as a specification of simplices.
arXiv Detail & Related papers (2022-09-28T17:53:25Z) - Partial Counterfactual Identification from Observational and
Experimental Data [83.798237968683]
We develop effective Monte Carlo algorithms to approximate the optimal bounds from an arbitrary combination of observational and experimental data.
Our algorithms are validated extensively on synthetic and real-world datasets.
arXiv Detail & Related papers (2021-10-12T02:21:30Z) - How to Approximate Ontology-Mediated Queries [22.99749220980996]
We introduce and study several notions of approximation for ontology-mediated queries based on the description logics ALC and ALCI.
Our approximations are of two kinds: we may (1) replace the ontology with one formulated in a tractable language such as ELI or certain TGDs and (2) replace the database with one from a tractable class such as the class of databases whose treewidth is bounded by a constant.
arXiv Detail & Related papers (2021-07-12T12:29:50Z) - When is Ontology-Mediated Querying Efficient? [10.971122842236024]
We study the evaluation of ontology-mediated queries over relational databases.
We provide a characterization of the classes of OMQs that are tractable in combined complexity.
We also study the complexity of deciding whether a given OMQ is equivalent to an OMQ of bounded tree width.
arXiv Detail & Related papers (2020-03-17T16:32:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.