Enhancing Text-to-SQL Translation for Financial System Design
- URL: http://arxiv.org/abs/2312.14725v2
- Date: Tue, 9 Jan 2024 00:26:50 GMT
- Title: Enhancing Text-to-SQL Translation for Financial System Design
- Authors: Yewei Song, Saad Ezzini, Xunzhu Tang, Cedric Lothritz, Jacques Klein,
Tegawend\'e Bissyand\'e, Andrey Boytsov, Ulrick Ble, Anne Goujon
- Abstract summary: We consider Large Language Models (LLMs), which have achieved state of the art for various NLP tasks.
We propose two novel metrics that were designed to adequately measure the similarity between relational queries.
- Score: 5.248014305403357
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Text-to-SQL, the task of translating natural language questions into SQL
queries, is part of various business processes. Its automation, which is an
emerging challenge, will empower software practitioners to seamlessly interact
with relational databases using natural language, thereby bridging the gap
between business needs and software capabilities. In this paper, we consider
Large Language Models (LLMs), which have achieved state of the art for various
NLP tasks. Specifically, we benchmark Text-to-SQL performance, the evaluation
methodologies, as well as input optimization (e.g., prompting). In light of the
empirical observations that we have made, we propose two novel metrics that
were designed to adequately measure the similarity between SQL queries.
Overall, we share with the community various findings, notably on how to select
the right LLM on Text-to-SQL tasks. We further demonstrate that a tree-based
edit distance constitutes a reliable metric for assessing the similarity
between generated SQL queries and the oracle for benchmarking Text2SQL
approaches. This metric is important as it relieves researchers from the need
to perform computationally expensive experiments such as executing generated
queries as done in prior works. Our work implements financial domain use cases
and, therefore contributes to the advancement of Text2SQL systems and their
practical adoption in this domain.
Related papers
- Enhancing LLM Fine-tuning for Text-to-SQLs by SQL Quality Measurement [1.392448435105643]
Text-to-s enables non-expert users to effortlessly retrieve desired information from databases using natural language queries.
Current state-of-the-art (SOTA) models like GPT4 and T5 have shown impressive performance on large-scale benchmarks like BIRD.
This paper proposed a novel approach that only needs SQL Quality to enhance Text-to-s performance.
arXiv Detail & Related papers (2024-10-02T17:21:51Z) - PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL [54.304872649870575]
Large Language Models (LLMs) have emerged as powerful tools for Text-to-sense tasks.
In this study, we propose that employing query group partitioning allows LLMs to focus on learning the thought processes specific to a single problem type.
arXiv Detail & Related papers (2024-09-21T09:33:14Z) - A Survey on Employing Large Language Models for Text-to-SQL Tasks [9.527891544418805]
The increasing volume of data in relational databases pose challenges for users to access and analyze data.
Text-to-sql (Text2) solves the issues by utilizing natural language processing (NLP) techniques to convert natural language intosql queries.
With the development of Large Language Models (LLMs), a range of LLM-based Text2 methods have emerged.
arXiv Detail & Related papers (2024-07-21T14:48:23Z) - Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL [15.75829309721909]
Generating accuratesql from natural language questions (text-to-) is a long-standing challenge.
PLMs have been developed and utilized for text-to- tasks, achieving promising performance.
Recently, large language models (LLMs) have demonstrated significant capabilities in natural language understanding.
arXiv Detail & Related papers (2024-06-12T17:13:17Z) - Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation [76.76046657162306]
Large language models (LLMs) have emerged as a new paradigm for Text-to- task.
Large language models (LLMs) have emerged as a new paradigm for Text-to- task.
arXiv Detail & Related papers (2023-08-29T14:59:54Z) - SQL-PaLM: Improved Large Language Model Adaptation for Text-to-SQL (extended) [53.95151604061761]
This paper introduces the framework for enhancing Text-to- filtering using large language models (LLMs)
With few-shot prompting, we explore the effectiveness of consistency decoding with execution-based error analyses.
With instruction fine-tuning, we delve deep in understanding the critical paradigms that influence the performance of tuned LLMs.
arXiv Detail & Related papers (2023-05-26T21:39:05Z) - UNITE: A Unified Benchmark for Text-to-SQL Evaluation [72.72040379293718]
We introduce a UNIfied benchmark for Text-to-domain systems.
It is composed of publicly available text-to-domain datasets and 29K databases.
Compared to the widely used Spider benchmark, we introduce a threefold increase in SQL patterns.
arXiv Detail & Related papers (2023-05-25T17:19:52Z) - A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future
Directions [102.8606542189429]
The goal of text-to-corpora parsing is to convert a natural language (NL) question to its corresponding structured query language () based on the evidences provided by databases.
Deep neural networks have significantly advanced this task by neural generation models, which automatically learn a mapping function from an input NL question to an output query.
arXiv Detail & Related papers (2022-08-29T14:24:13Z) - "What Do You Mean by That?" A Parser-Independent Interactive Approach
for Enhancing Text-to-SQL [49.85635994436742]
We include human in the loop and present a novel-independent interactive approach (PIIA) that interacts with users using multi-choice questions.
PIIA is capable of enhancing the text-to-domain performance with limited interaction turns by using both simulation and human evaluation.
arXiv Detail & Related papers (2020-11-09T02:14:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.