Fugu-MT 論文翻訳(概要): Automating Database-Native Function Code Synthesis with LLMs

論文の概要: Automating Database-Native Function Code Synthesis with LLMs

arxiv url: http://arxiv.org/abs/2604.06231v1
Date: Thu, 02 Apr 2026 02:56:04 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-09 17:30:51.09492
Title: Automating Database-Native Function Code Synthesis with LLMs
Title（参考訳）: LLMを用いたデータベースネイティブ関数コードの自動生成
Authors: Wei Zhou, Xuanhe Zhou, Qikang He, Guoliang Li, Bingsheng He, Quanqing Xu, Fan Wu,
Abstract要約: データベースネイティブ関数を自動実装するLLMベースのDBCookerを提案する。まず、関数キャラクタリゼーションモジュールは、複数のソース宣言を集約し、特別なコーディングを必要とする関数ユニットを特定し、ユニット間の依存関係をトレースする。第二に,1)再利用可能な参照関数などの重要な要素を識別し,擬似コードに基づく符号化シーケンスを設計すること,2)確率的先行とコンポーネント認識によって導かれるハイブリッドフィリング・ザ・ブランクモデルにより,コアロジックと再利用可能なルーチンを統合すること,である。
参考スコア（独自算出の注目度）: 45.585082035125886
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Database systems incorporate an ever-growing number of functions in their kernels (a.k.a., database native functions) for scenarios like new application support and business migration. This growth causes an urgent demand for automatic database native function synthesis. While recent advances in LLM-based code generation (e.g., Claude Code) show promise, they are too generic for database-specific development. They often hallucinate or overlook critical context because database function synthesis is inherently complex and error-prone, where synthesizing a single function may involve registering multiple function units, linking internal references, and implementing logic correctly. To this end, we propose DBCooker, an LLM-based system for automatically synthesizing database native functions. It consists of three components. First, the function characterization module aggregates multi-source declarations, identifies function units that require specialized coding, and traces cross-unit dependencies. Second, we design operations to address the main synthesis challenges: (1) a pseudo-code-based coding plan generator that constructs structured implementation skeletons by identifying key elements such as reusable referenced functions; (2) a hybrid fill-in-the-blank model guided by probabilistic priors and component awareness to integrate core logic with reusable routines; and (3) three-level progressive validation, including syntax checking, standards compliance, and LLM-guided semantic verification. Finally, an adaptive orchestration strategy unifies these operations with existing tools and dynamically sequences them via the orchestration history of similar functions. Results show that DBCooker outperforms other methods on SQLite, PostgreSQL, and DuckDB (34.55% higher accuracy on average), and can synthesize new functions absent in the latest SQLite (v3.50).
Abstract（参考訳）: データベースシステムは、新しいアプリケーションのサポートやビジネス移行のようなシナリオのために、カーネル(すなわち、データベースネイティブ関数)に増え続ける関数を組み込んでいる。この成長は、自動データベースネイティブ関数合成の急激な需要を引き起こす。 LLMベースのコード生成(例えばClaude Code)の最近の進歩は、将来性を示しているが、データベース固有の開発には汎用的すぎる。データベース関数合成は本質的に複雑でエラーを起こしやすいため、複数の関数ユニットを登録し、内部参照をリンクし、論理を正しく実装する。そこで本研究では,データベースネイティブ関数の自動合成システムであるDBCookerを提案する。 3つの構成要素から構成される。まず、関数キャラクタリゼーションモジュールは、複数のソース宣言を集約し、特別なコーディングを必要とする関数ユニットを特定し、ユニット間の依存関係をトレースする。第2に,1)再利用可能な参照関数などの重要な要素を識別し,構造化された実装スケルトンを構築する擬似コードベースのコーディング計画生成装置,(2)確率的事前とコンポーネント認識に導かれるハイブリッド・フィリング・ザ・ブランクモデル,(3)構文チェック,標準コンプライアンス,LLM誘導セマンティック検証を含む3段階のプログレッシブ・バリデーション・バリデーションを設計する。最後に、適応的なオーケストレーション戦略は、これらの操作を既存のツールと統合し、同様の関数のオーケストレーション履歴を通じて動的にシーケンスする。結果は、DBCookerがSQLite、PostgreSQL、DuckDBの他のメソッド(平均で34.55%高い精度)より優れており、最新のSQLite(v3.50)に欠けている新機能を合成できることを示している。

論文の概要: Automating Database-Native Function Code Synthesis with LLMs

関連論文リスト