Fugu-MT 論文翻訳(概要): SQLyzr: A Comprehensive Benchmark and Evaluation Platform for Text-to-SQL

論文の概要: SQLyzr: A Comprehensive Benchmark and Evaluation Platform for Text-to-SQL

arxiv url: http://arxiv.org/abs/2604.21214v2
Date: Mon, 27 Apr 2026 17:45:13 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:06.924809
Title: SQLyzr: A Comprehensive Benchmark and Evaluation Platform for Text-to-SQL
Title（参考訳）: SQLyzr: テキストからSQLへの総合的なベンチマークと評価プラットフォーム
Authors: Sepideh Abedini, M. Tamer Özsu,
Abstract要約: SQLyzrは、テキスト・ツー・モデルのための総合的なベンチマークと評価プラットフォームである。生成されたクエリの複数の側面をキャプチャする、さまざまな評価指標が組み込まれている。きめ細かいクエリ分類、エラー解析、ワークロード拡張をサポートしており、ユーザーはより優れた診断とテキスト・ツー・グラフィカル・モデルを改善することができる。
参考スコア（独自算出の注目度）: 6.156269073168807
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Text-to-SQL models have significantly improved with the adoption of Large Language Models (LLMs), leading to their increasing use in real-world applications. Although many benchmarks exist for evaluating the performance of text-to-SQL models, they often rely on a single aggregate score, lack evaluation under realistic settings, and provide limited insight into model behaviour across different query types. In this work, we present SQLyzr, a comprehensive benchmark and evaluation platform for text-to-SQL models. SQLyzr incorporates a diverse set of evaluation metrics that capture multiple aspects of generated queries, while enabling more realistic evaluation through workload alignment with real-world SQL usage patterns and database scaling. It further supports fine-grained query classification, error analysis, and workload augmentation, allowing users to better diagnose and improve text-to-SQL models. This demonstration showcases these capabilities through an interactive experience. Through SQLyzr's graphical interface, users can customize evaluation settings, analyze fine-grained reports, and explore additional features of the platform. We envision that SQLyzr facilitates the evaluation and iterative improvement of text-to-SQL models by addressing key limitations of existing benchmarks. The source code of SQLyzr is available at https://github.com/sepideh-abedini/SQLyzr.
Abstract（参考訳）: テキストからSQLへのモデルは、LLM(Large Language Models)の採用によって大幅に改善され、現実世界のアプリケーションでの利用が増加した。テキスト-SQLモデルのパフォーマンスを評価するためのベンチマークは数多く存在するが、それらは単一の集計スコアに依存しており、現実的な設定下での評価を欠いている。本稿では,テキストからSQLモデルへの総合的なベンチマークと評価プラットフォームであるSQLyzrを紹介する。 SQLyzrには、生成されたクエリの複数の側面をキャプチャするさまざまな評価指標が組み込まれており、実際のSQL使用パターンとデータベーススケーリングとのワークロードアライメントを通じて、より現実的な評価を可能にしている。さらに、きめ細かいクエリ分類、エラー解析、ワークロード拡張をサポートしており、ユーザーはよりよく診断し、テキストからSQLモデルを改善することができる。このデモでは、インタラクティブな体験を通じてこれらの機能を紹介します。 SQLyzrのグラフィカルインターフェースを通じて、ユーザーは評価設定をカスタマイズしたり、きめ細かいレポートを分析したり、プラットフォームの追加機能を探したりできる。我々は、SQLyzrが既存のベンチマークの重要な制限に対処することで、テキストからSQLモデルへの評価と反復的な改善を促進することを想定している。 SQLyzrのソースコードはhttps://github.com/sepideh-abedini/SQLyzrで入手できる。

論文の概要: SQLyzr: A Comprehensive Benchmark and Evaluation Platform for Text-to-SQL

関連論文リスト