Fugu-MT 論文翻訳(概要): Blue Data Intelligence Layer: Streaming Data and Agents for Multi-source Multi-modal Data-Centric Applications

論文の概要: Blue Data Intelligence Layer: Streaming Data and Agents for Multi-source Multi-modal Data-Centric Applications

arxiv url: http://arxiv.org/abs/2604.15233v1
Date: Thu, 16 Apr 2026 17:10:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-17 21:29:32.017246
Title: Blue Data Intelligence Layer: Streaming Data and Agents for Multi-source Multi-modal Data-Centric Applications
Title（参考訳）: Blue Data Intelligence Layer:マルチソースマルチモーダルデータ中心アプリケーションのためのストリーミングデータとエージェント
Authors: Moin Aminnaseri, Farima Fatahi Bayat, Nikita Bhutani, Jean-Flavien Bussotti, Kevin Chan, Rafael Li Chen, Yanlin Feng, Jackson Hassell, Estevam Hruschka, Eser Kandogan, Hannah Kim, James Levine, Seiji Maekawa, Jalal Mahmud, Kushan Mitra, Naoki Otani, Pouya Pezeshkpour, Nima Shahbazi, Chen Shen, Dan Zhang,
Abstract要約: NL2システムは、データとの自然言語インタラクションの必要性の高まりに対処することを目的としている。我々は、マルチソース、マルチモーダル、データ中心のアプリケーションをサポートするように設計されたBlueのデータインテリジェンス層(DIL)を提示する。
参考スコア（独自算出の注目度）: 26.71604287422676
License: http://creativecommons.org/licenses/by/4.0/
Abstract: NL2SQL systems aim to address the growing need for natural language interaction with data. However, real-world information rarely maps to a single SQL query because (1) users express queries iteratively (2) questions often span multiple data sources beyond the closed-world assumption of a single database, and (3) queries frequently rely on commonsense or external knowledge. Consequently, satisfying realistic data needs require integrating heterogeneous sources, modalities, and contextual data. In this paper, we present Blue's Data Intelligence Layer (DIL) designed to support multi-source, multi-modal, and data-centric applications. Blue is a compound AI system that orchestrates agents and data for enterprise settings. DIL serves as the data intelligence layer for agentic data processing, to bridge the semantic gap between user intent and available information by unifying structured enterprise data, world knowledge accessible through LLMs, and personal context obtained through interaction. At the core of DIL is a data registry that stores metadata for diverse data sources and modalities to enable both native and natural language queries. DIL treats LLMs, the Web, and the User as source 'databases', each with their own query interface, elevating them to first-class data sources. DIL relies on data planners to transform user queries into executable query plans. These plans are declarative abstractions that unify relational operators with other operators spanning multiple modalities. DIL planners support decomposition of complex requests into subqueries, retrieval from diverse sources, and finally reasoning and integration to produce final results. We demonstrate DIL through two interactive scenarios in which user queries dynamically trigger multi-source retrieval, cross-modal reasoning, and result synthesis, illustrating how compound AI systems can move beyond single database NL2SQL.
Abstract（参考訳）: NL2SQLシステムは、データとの自然言語インタラクションの必要性の高まりに対処することを目的としている。しかし,(1) ユーザがクエリを反復的に表現すること,(2) 質問は単一のデータベースのクローズドワールド仮定を超えた複数のデータソースにまたがることが多く,(3) クエリはコモンセンスや外部知識に依存することが多いため,現実の情報は単一のSQLクエリにマップされることは滅多にない。したがって、現実的なデータを満たすためには、異質なソース、モダリティ、コンテキストデータを統合する必要がある。本稿では,マルチソース,マルチモーダル,データ中心のアプリケーションをサポートするように設計されたBlue's Data Intelligence Layer(DIL)を提案する。 Blueは、企業設定のためのエージェントとデータをオーケストレーションする複合AIシステムである。 DILはエージェントデータ処理のためのデータインテリジェンスレイヤとして機能し、構造化されたエンタープライズデータ、LLMを通じてアクセス可能な世界知識、対話を通じて得られる個人的コンテキストを統一することで、ユーザ意図と利用可能な情報のセマンティックギャップを埋める。 DILの中核にあるデータレジストリは、さまざまなデータソースとモダリティのメタデータを格納し、ネイティブおよび自然言語クエリの両方を可能にする。 DIL は LLM 、 Web 、および User をソースとして扱い、それぞれに独自のクエリインターフェースを持ち、それらをファーストクラスのデータソースに昇格させる。 DILは、ユーザクエリを実行可能なクエリプランに変換するために、データプランナに依存しています。これらの計画は、複数のモダリティにまたがる他の演算子と関係演算子を統合する宣言的抽象化である。 DILプランナは、複雑なリクエストをサブクエリに分解し、さまざまなソースから検索し、最終的な結果を生成するための推論と統合をサポートする。ユーザクエリがマルチソース検索、クロスモーダル推論、結果合成を動的にトリガする2つの対話的なシナリオを通じて、DILを実証し、複合AIシステムが単一のデータベースNL2SQLを超えてどのように動くかを説明する。

論文の概要: Blue Data Intelligence Layer: Streaming Data and Agents for Multi-source Multi-modal Data-Centric Applications

関連論文リスト