Turing: an Accurate and Interpretable Multi-Hypothesis Cross-Domain
Natural Language Database Interface
- URL: http://arxiv.org/abs/2106.04559v1
- Date: Tue, 8 Jun 2021 17:46:20 GMT
- Title: Turing: an Accurate and Interpretable Multi-Hypothesis Cross-Domain
Natural Language Database Interface
- Authors: Peng Xu, Wenjie Zi, Hamidreza Shahidi, \'Akos K\'ad\'ar, Keyi Tang,
Wei Yang, Jawad Ateeq, Harsh Barot, Meidan Alon, Yanshuai Cao
- Abstract summary: Natural language database interface (NLDB) can democratize data-driven insights for non-technical users.
This work presents Turing, a NLDB system toward bridging this gap.
The cross-domain semantic validation method of Turing achieves $751%$ execution accuracy, and $78.3%$ top-5 beam execution accuracy on the Spider set.
- Score: 11.782395912109324
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: A natural language database interface (NLDB) can democratize data-driven
insights for non-technical users. However, existing Text-to-SQL semantic
parsers cannot achieve high enough accuracy in the cross-database setting to
allow good usability in practice. This work presents Turing, a NLDB system
toward bridging this gap. The cross-domain semantic parser of Turing with our
novel value prediction method achieves $75.1\%$ execution accuracy, and
$78.3\%$ top-5 beam execution accuracy on the Spider validation set. To benefit
from the higher beam accuracy, we design an interactive system where the SQL
hypotheses in the beam are explained step-by-step in natural language, with
their differences highlighted. The user can then compare and judge the
hypotheses to select which one reflects their intention if any. The English
explanations of SQL queries in Turing are produced by our high-precision
natural language generation system based on synchronous grammars.
Related papers
- UNITE: A Unified Benchmark for Text-to-SQL Evaluation [72.72040379293718]
We introduce a UNIfied benchmark for Text-to-domain systems.
It is composed of publicly available text-to-domain datasets and 29K databases.
Compared to the widely used Spider benchmark, we introduce a threefold increase in SQL patterns.
arXiv Detail & Related papers (2023-05-25T17:19:52Z) - Error Detection for Text-to-SQL Semantic Parsing [18.068244400731366]
Modern text-to- semantics are often over-confident, casting doubt on their trustworthiness when deployed for real use.
We propose a-independent error detection model for text-to- semantic parsing.
arXiv Detail & Related papers (2023-05-23T04:44:22Z) - Improving Text-to-SQL Semantic Parsing with Fine-grained Query
Understanding [84.04706075621013]
We present a general-purpose, modular neural semantic parsing framework based on token-level fine-grained query understanding.
Our framework consists of three modules: named entity recognizer (NER), neural entity linker (NEL) and neural entity linker (NSP)
arXiv Detail & Related papers (2022-09-28T21:00:30Z) - A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future
Directions [102.8606542189429]
The goal of text-to-corpora parsing is to convert a natural language (NL) question to its corresponding structured query language () based on the evidences provided by databases.
Deep neural networks have significantly advanced this task by neural generation models, which automatically learn a mapping function from an input NL question to an output query.
arXiv Detail & Related papers (2022-08-29T14:24:13Z) - S$^2$SQL: Injecting Syntax to Question-Schema Interaction Graph Encoder
for Text-to-SQL Parsers [66.78665327694625]
We propose S$2$, injecting Syntax to question- encoder graph for Text-to- relational parsing.
We also employ the decoupling constraint to induce diverse edge embedding, which further improves the network's performance.
Experiments on the Spider and robustness setting Spider-Syn demonstrate that the proposed approach outperforms all existing methods when pre-training models are used.
arXiv Detail & Related papers (2022-03-14T09:49:15Z) - Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural
Language Question [18.40290951253122]
Speech-based inputs have been gaining significant momentum with the popularity of smartphones and tablets.
This paper works towards designing more effective speech interfaces to query the structured data databases.
We propose a novel end-to-end neural architecture named SpeechNet to directly translate human speech into queries.
arXiv Detail & Related papers (2022-01-04T15:38:36Z) - "What Do You Mean by That?" A Parser-Independent Interactive Approach
for Enhancing Text-to-SQL [49.85635994436742]
We include human in the loop and present a novel-independent interactive approach (PIIA) that interacts with users using multi-choice questions.
PIIA is capable of enhancing the text-to-domain performance with limited interaction turns by using both simulation and human evaluation.
arXiv Detail & Related papers (2020-11-09T02:14:33Z) - Photon: A Robust Cross-Domain Text-to-SQL System [189.1405317853752]
We present Photon, a robust, modular, cross-domain NLIDB that can flag natural language input to which a mapping cannot be immediately determined.
The proposed method effectively improves the robustness of text-to-native system against untranslatable user input.
arXiv Detail & Related papers (2020-07-30T07:44:48Z) - Speak to your Parser: Interactive Text-to-SQL with Natural Language
Feedback [39.45695779589969]
We study the task of semantic parse correction with natural language feedback.
In this paper, we investigate a more interactive scenario where humans can further interact with the system.
arXiv Detail & Related papers (2020-05-05T23:58:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.