ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser
- URL: http://arxiv.org/abs/2104.04689v2
- Date: Wed, 14 Apr 2021 07:06:55 GMT
- Title: ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser
- Authors: Zhi Chen, Lu Chen, Yanbin Zhao, Ruisheng Cao, Zihan Xu, Su Zhu and Kai
Yu
- Abstract summary: We propose a new architecture, ShadowGNN, which processes schemas at abstract and semantic levels.
On the challenging Text-to-Spider benchmark, empirical results show that ShadowGNN outperforms state-of-the-art models.
- Score: 36.12921337235763
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Given a database schema, Text-to-SQL aims to translate a natural language
question into the corresponding SQL query. Under the setup of cross-domain,
traditional semantic parsing models struggle to adapt to unseen database
schemas. To improve the model generalization capability for rare and unseen
schemas, we propose a new architecture, ShadowGNN, which processes schemas at
abstract and semantic levels. By ignoring names of semantic items in databases,
abstract schemas are exploited in a well-designed graph projection neural
network to obtain delexicalized representation of question and schema. Based on
the domain-independent representations, a relation-aware transformer is
utilized to further extract logical linking between question and schema.
Finally, a SQL decoder with context-free grammar is applied. On the challenging
Text-to-SQL benchmark Spider, empirical results show that ShadowGNN outperforms
state-of-the-art models. When the annotated data is extremely limited (only
10\% training set), ShadowGNN gets over absolute 5\% performance gain, which
shows its powerful generalization ability. Our implementation will be
open-sourced at \url{https://github.com/WowCZ/shadowgnn}.
Related papers
- Improving Text-to-SQL Semantic Parsing with Fine-grained Query
Understanding [84.04706075621013]
We present a general-purpose, modular neural semantic parsing framework based on token-level fine-grained query understanding.
Our framework consists of three modules: named entity recognizer (NER), neural entity linker (NEL) and neural entity linker (NSP)
arXiv Detail & Related papers (2022-09-28T21:00:30Z) - Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema
Linking Graph [6.13728903057727]
The generalizability to new databases is of vital importance to Text-to- systems which aim to parse human utterances intosql statements.
In this paper, we propose a framework named IS ESL to iteratively build a enhanced semantic schema-linking graph between question tokens and database schemas.
Extensive experiments on three benchmarks demonstrate that IS ESL could consistently outperform the baselines and further investigations show its generalizability and robustness.
arXiv Detail & Related papers (2022-08-08T03:59:33Z) - Proton: Probing Schema Linking Information from Pre-trained Language
Models for Text-to-SQL Parsing [66.55478402233399]
We propose a framework to elicit relational structures via a probing procedure based on Poincar'e distance metric.
Compared with commonly-used rule-based methods for schema linking, we found that probing relations can robustly capture semantic correspondences.
Our framework sets new state-of-the-art performance on three benchmarks.
arXiv Detail & Related papers (2022-06-28T14:05:25Z) - S$^2$SQL: Injecting Syntax to Question-Schema Interaction Graph Encoder
for Text-to-SQL Parsers [66.78665327694625]
We propose S$2$, injecting Syntax to question- encoder graph for Text-to- relational parsing.
We also employ the decoupling constraint to induce diverse edge embedding, which further improves the network's performance.
Experiments on the Spider and robustness setting Spider-Syn demonstrate that the proposed approach outperforms all existing methods when pre-training models are used.
arXiv Detail & Related papers (2022-03-14T09:49:15Z) - SADGA: Structure-Aware Dual Graph Aggregation Network for Text-to-SQL [29.328698264910596]
One of the most challenging problems of Text-to-Graph is how to generalize the trained model to the unseen database schemas.
We propose a Structure-Aware Dual Graph Aggregation Network (SADGA) for cross-domain Text-to-Graph.
We achieve 3rd place on the challenging Text-to-Graph benchmark Spider at the time of writing.
arXiv Detail & Related papers (2021-11-01T01:50:28Z) - Relation Aware Semi-autoregressive Semantic Parsing for NL2SQL [17.605904256822786]
We present a Relation aware Semi-autogressive Semantic Parsing (MODN) framework, which is more adaptable for NL2 backbone.
From empirical results and case study, our model shows its effectiveness in learning better word representation in NL2.
arXiv Detail & Related papers (2021-08-02T12:21:08Z) - Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic
Parsing [110.97778888305506]
BRIDGE represents the question and DB schema in a tagged sequence where a subset of the fields are augmented with cell values mentioned in the question.
BRIDGE attained state-of-the-art performance on popular cross-DB text-to- relational benchmarks.
Our analysis shows that BRIDGE effectively captures the desired cross-modal dependencies and has the potential to generalize to more text-DB related tasks.
arXiv Detail & Related papers (2020-12-23T12:33:52Z) - IGSQL: Database Schema Interaction Graph Based Neural Model for
Context-Dependent Text-to-SQL Generation [61.09660709356527]
We propose a database schema interaction graph encoder to utilize historicalal information of database schema items.
We evaluate our model on the benchmark SParC and Co datasets.
arXiv Detail & Related papers (2020-11-11T12:56:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.