Fugu-MT 論文翻訳(概要): Cortex AISQL: A Production SQL Engine for Unstructured Data

論文の概要: Cortex AISQL: A Production SQL Engine for Unstructured Data

arxiv url: http://arxiv.org/abs/2511.07663v2
Date: Wed, 19 Nov 2025 13:22:56 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-20 13:41:21.05239
Title: Cortex AISQL: A Production SQL Engine for Unstructured Data
Title（参考訳）: Cortex AISQL: 構造化されていないデータのためのプロダクションSQLエンジン
Authors: Paweł Liskowski, Benjamin Han, Paritosh Aggarwal, Bowei Chen, Boxin Jiang, Nitish Jindal, Zihan Li, Aaron Lin, Kyle Schmaus, Jay Tayade, Weicheng Zhao, Anupam Datta, Nathan Wiegand, Dimitris Tsirogiannis,
Abstract要約: AIはSnowflakeで本番環境にデプロイされ、分析、検索、コンテンツ理解にまたがるさまざまな顧客のワークロードに電力を供給する。本稿では、AI対応クエリ最適化が、AI推論コストを第一級最適化目標として扱う方法を示す。第二に、適応モデルカスケードは、高速プロキシモデルを通じてほとんどの行をルーティングすることで、推論コストを削減する。第三に、セマンティックなジョインクエリの書き換えは、ジョイン操作の二次的な時間の複雑さを線形に下げる。
参考スコア（独自算出の注目度）: 11.480345698642006
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Snowflake's Cortex AISQL is a production SQL engine that integrates native semantic operations directly into SQL. This integration allows users to write declarative queries that combine relational operations with semantic reasoning, enabling them to query both structured and unstructured data effortlessly. However, making semantic operations efficient at production scale poses fundamental challenges. Semantic operations are more expensive than traditional SQL operations, possess distinct latency and throughput characteristics, and their cost and selectivity are unknown during query compilation. Furthermore, existing query engines are not designed to optimize semantic operations. The AISQL query execution engine addresses these challenges through three novel techniques informed by production deployment data from Snowflake customers. First, AI-aware query optimization treats AI inference cost as a first-class optimization objective, reasoning about large language model (LLM) cost directly during query planning to achieve 2-8$\times$ speedups. Second, adaptive model cascades reduce inference costs by routing most rows through a fast proxy model while escalating uncertain cases to a powerful oracle model, achieving 2-6$\times$ speedups while maintaining 90-95% of oracle model quality. Third, semantic join query rewriting lowers the quadratic time complexity of join operations to linear through reformulation as multi-label classification tasks, achieving 15-70$\times$ speedups with often improved prediction quality. AISQL is deployed in production at Snowflake, where it powers diverse customer workloads across analytics, search, and content understanding.
Abstract（参考訳）: SnowflakeのCortex AISQLは、SQLに直接ネイティブセマンティック操作を統合する、プロダクションSQLエンジンである。この統合により、ユーザーは、リレーショナル操作とセマンティック推論を組み合わせた宣言型クエリを書くことができ、構造化データと非構造化データの両方を懸命にクエリすることができる。しかし、本番環境でのセマンティック操作を効率化することは、根本的な課題となる。セマンティック操作は従来のSQL操作よりも高価で、レイテンシとスループットの特徴が異なる。さらに、既存のクエリエンジンはセマンティック操作を最適化するために設計されていない。 AISQLクエリ実行エンジンは、Snowflakeユーザの運用デプロイメントデータから通知される3つの新しいテクニックを通じて、これらの課題に対処する。まず、AI対応クエリ最適化は、AI推論コストを第一級最適化目標として扱い、2-8$\times$スピードアップを達成するためのクエリ計画中に、大きな言語モデル(LLM)コストを直接考慮する。第二に、適応モデルカスケードは、高速なプロキシモデルを通してほとんどの行をルーティングし、不確実なケースを強力なオラクルモデルにエスカレートし、90-95%のオラクルモデル品質を維持しながら2-6$\times$スピードアップを達成することで、推論コストを削減する。第3に、セマンティックなジョインクエリの書き換えにより、ジョイン操作の二次的な時間的複雑さを減らし、複数ラベルの分類タスクとして再編成し、15-70$\times$のスピードアップを実現し、予測品質が向上する。 AISQLはSnowflakeで本番環境にデプロイされ、分析、検索、コンテンツ理解にまたがるさまざまな顧客のワークロードに電力を供給する。

論文の概要: Cortex AISQL: A Production SQL Engine for Unstructured Data

関連論文リスト