Fugu-MT 論文翻訳(概要): FlexSQL: Flexible Exploration and Execution Make Better Text-to-SQL Agents

論文の概要: FlexSQL: Flexible Exploration and Execution Make Better Text-to-SQL Agents

arxiv url: http://arxiv.org/abs/2605.02815v1
Date: Mon, 04 May 2026 16:51:31 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:50.419333
Title: FlexSQL: Flexible Exploration and Execution Make Better Text-to-SQL Agents
Title（参考訳）: FlexSQL: 柔軟な探索と実行により、テキストからSQLへのエージェントが改善される
Authors: Quang Hieu Pham, Yang He, Ping Nie, Canwen Xu, Davood Rafiei, Yuepeng Wang, Xi Ye, Jocelyn Qiaochu Chen,
Abstract要約: 私たちはFlex Codeを紹介します。Flex Codeは、柔軟性のあるデータベースインタラクションを基本設計するテキスト・ツー・エージェントです。この手法はSpider2-Snowに対して10%以上の相対的な改善をもたらす。
参考スコア（独自算出の注目度）: 36.84900072417659
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Text-to-SQL over large analytical databases requires navigating complex schemas, resolving ambiguous queries, and grounding decisions in actual data. Most current systems follow a fixed pipeline where schema elements are retrieved once upfront and the database is only revisited for post-hoc repair, limiting recovery from early mistakes. We present FlexSQL, a text-to-SQL agent whose core design principle is flexible database interaction: the agent can explore schema structure, inspect data values, and run verification queries at any point during reasoning. FlexSQL generates diverse execution plans to cover multiple query interpretations, implements each plan in either SQL or Python depending on the task, and uses a two-tiered repair mechanism that can backtrack from code-level errors to plan-level revisions. On Spider2-Snow, using gpt-oss-120b, FlexSQL achieves a 65.4\% score, outperforming strong open-source baselines that use stronger, larger models such as gpt-o3 and DeepSeek-R1. When integrated into a general-purpose coding agent (as skills in Claude Code), our approach yields over 10\% relative improvement on Spider2-Snow. Further analysis shows that flexible exploration and flexible execution jointly contribute to the effectiveness of our approach, highlighting flexibility as a key design principle. Our code is available at: https://github.com/StringNLPLAB/FlexSQL
Abstract（参考訳）: 大規模な分析データベース上のテキストからSQLは、複雑なスキーマをナビゲートし、あいまいなクエリを解決し、実際のデータで決定を下す必要がある。現在のほとんどのシステムは固定パイプラインに従っており、スキーマ要素は一度前もって検索され、データベースは保留後の修復のためにのみ更新され、初期のミスからの回復が制限される。エージェントはスキーマ構造を探索し、データ値を検査し、推論中の任意の時点で検証クエリを実行できます。 FlexSQLは、複数のクエリ解釈をカバーするさまざまな実行プランを生成し、タスクに応じてSQLまたはPythonで各プランを実装し、コードレベルのエラーからプランレベルのリビジョンまで追跡可能な2段階の修復メカニズムを使用している。 Spider2-Snowでは、gpt-oss-120bを使用して、FlexSQLは65.4\%のスコアを獲得し、gpt-o3やDeepSeek-R1のようなより強力なモデルを使用する強力なオープンソースベースラインを上回っている。汎用コーディングエージェント(Claude Codeのスキルとして)に統合されると、我々のアプローチはSpider2-Snowに対して10倍以上の相対的な改善をもたらす。さらなる分析により、フレキシブルな探索とフレキシブルな実行が、我々のアプローチの有効性に共同で寄与することを示し、フレキシブルを重要な設計原則として強調する。私たちのコードは、https://github.com/StringNLPLAB/FlexSQLで利用可能です。

論文の概要: FlexSQL: Flexible Exploration and Execution Make Better Text-to-SQL Agents

関連論文リスト