Fugu-MT 論文翻訳(概要): OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

論文の概要: OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

arxiv url: http://arxiv.org/abs/2605.29250v1
Date: Thu, 28 May 2026 02:10:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-30 02:45:55.591049
Title: OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources
Title（参考訳）: OmniRetrieval:不均一な知識ソースをまたいだ統一検索
Authors: Jinheon Baek, Soyeong Jeong, Sangwoo Park, Woongyeong Yeo, Minki Kang, Patara Trirat, Heejun Lee, Sung Ju Hwang,
Abstract要約: 既存のレトリバーは、固定クエリ言語の下で一度に1つのソース上で動作します。 OmniRetrievalは、自然言語クエリを取り込み、適切な知識ソースを識別するフレームワークである。
参考スコア（独自算出の注目度）: 67.62754856088591
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Real-world information needs require access to structurally diverse knowledge sources, from unstructured text and relational tables to knowledge graphs and property graphs. Existing retrievers, however, operate over one source at a time under a fixed query language, leaving the broader landscape of available knowledge fragmented behind incompatible interfaces. A natural attempt at unification would collapse these sources into a shared space, but this erases the structural affordances (such as schemas, ontologies, compositional operators) that give each source its expressive power. Effective retrieval over diverse knowledge, therefore, requires not homogenization but an overarching layer that meets each source on its own terms. To achieve this, we present OmniRetrieval, a framework that takes any natural-language query, identifies appropriate knowledge sources, and dispatches source-native queries to their native execution engines. Across an extensive benchmark spanning 13 datasets and 309 distinct knowledge bases over text, relational, and graph-structured sources, OmniRetrieval exceeds single-source baselines, demonstrating that it can serve as a general-purpose interface to the heterogeneous sources while preserving the structural distinctions that make each source valuable.
Abstract（参考訳）: 現実世界の情報は、構造化されていないテキストやリレーショナルテーブルから知識グラフやプロパティグラフまで、構造的に多様な知識ソースにアクセスする必要がある。しかし、既存のレトリバーは、固定されたクエリ言語の下で一度に1つ以上のソースを運用しており、利用可能な知識の広い視野は、互換性のないインターフェースの後に断片化されている。自然に統一しようとする試みは、これらのソースを共有空間に分解するが、これは各ソースに表現力を与える構造的余裕(スキーマ、オントロジー、作曲演算子など)を消去する。したがって、多様な知識に対する効果的な検索は、均質化ではなく、それぞれのソースをそれぞれの用語で満たす包括的な層を必要とする。これを実現するために、自然言語クエリを取り込み、適切な知識ソースを特定し、ソースネイティブクエリをネイティブ実行エンジンにディスパッチするフレームワークであるOmniRetrievalを提案する。 OmniRetrievalは、13のデータセットと309の異なる知識ベースをテキスト、リレーショナル、グラフ構造化ソースにまたがる広範囲なベンチマークで、単一ソースベースラインを超え、各ソースの価値を保ちながら、異種ソースへの汎用インターフェースとして機能することを実証している。

論文の概要: OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

関連論文リスト