Fugu-MT 論文翻訳(概要): Hypercube-RAG: Hypercube-Based Retrieval-Augmented Generation for In-domain Scientific Question-Answering

論文の概要: Hypercube-RAG: Hypercube-Based Retrieval-Augmented Generation for In-domain Scientific Question-Answering

arxiv url: http://arxiv.org/abs/2505.19288v1
Date: Sun, 25 May 2025 19:42:06 GMT
ステータス: 翻訳完了
システム内更新日: 2025-05-27 16:58:43.033086
Title: Hypercube-RAG: Hypercube-Based Retrieval-Augmented Generation for In-domain Scientific Question-Answering
Title（参考訳）: Hypercube-RAG : ドメイン内科学的質問応答のためのハイパーキューブを用いた検索生成
Authors: Jimeng Shi, Sizhe Zhou, Bowen Jin, Wei Hu, Shaowen Wang, Giri Narasimhan, Jiawei Han,
Abstract要約: 大きな言語モデル(LLM)は、しばしばテーマ固有の問題を解決するために外部知識を組み込む必要がある。本稿では,高精度かつ効率的な検索のための新しいRAGフレームワークであるHypercube-RAGを紹介する。提案手法は精度を3.7%向上し,検索効率を81.2%向上させる。
参考スコア（独自算出の注目度）: 28.453408262591434
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) often need to incorporate external knowledge to solve theme-specific problems. Retrieval-augmented generation (RAG), which empowers LLMs to generate more qualified responses with retrieved external data and knowledge, has shown its high promise. However, traditional semantic similarity-based RAGs struggle to return concise yet highly relevant information for domain knowledge-intensive tasks, such as scientific question-answering (QA). Built on a multi-dimensional (cube) structure called Hypercube, which can index documents in an application-driven, human-defined, multi-dimensional space, we introduce the Hypercube-RAG, a novel RAG framework for precise and efficient retrieval. Given a query, Hypercube-RAG first decomposes it based on its entities and topics and then retrieves relevant documents from cubes by aligning these decomposed components with hypercube dimensions. Experiments on three in-domain scientific QA datasets demonstrate that our method improves accuracy by 3.7% and boosts retrieval efficiency by 81.2%, measured as relative gains over the strongest RAG baseline. More importantly, our Hypercube-RAG inherently offers explainability by revealing the underlying predefined hypercube dimensions used for retrieval. The code and data sets are available at https://github.com/JimengShi/Hypercube-RAG.
Abstract（参考訳）: 大きな言語モデル(LLM)は、しばしばテーマ固有の問題を解決するために外部知識を組み込む必要がある。検索された外部データと知識により、LLMがより適格な応答を生成できるようにする検索拡張生成(RAG)は、その高い可能性を示している。しかしながら、伝統的な意味的類似性に基づくRAGは、科学的質問回答(QA)のようなドメイン知識集約的なタスクに対して簡潔だが非常に関連性の高い情報を返すのに苦労している。アプリケーション駆動型,人間定義型,多次元空間における文書のインデックス化が可能な,Hypercubeと呼ばれる多次元(キューブ)構造上に構築されたHypercube-RAGは,高精度かつ効率的な検索を行うための新しいRAGフレームワークである。クエリが与えられたら、Hypercube-RAGはまずエンティティとトピックに基づいてそれを分解し、次に、これらの分解されたコンポーネントをハイパーキューブ次元に整列させることで、キューブから関連ドキュメントを取得する。 3つの領域内科学的なQAデータセットの実験により、我々の手法は精度を3.7%向上し、検索効率を81.2%向上させることを示した。さらに重要なことは、Hypercube-RAGは本質的に、検索に使用される基礎となる定義されたハイパーキューブ次元を明らかにすることによって、説明可能性を提供します。コードとデータセットはhttps://github.com/JimengShi/Hypercube-RAG.comで公開されている。

論文の概要: Hypercube-RAG: Hypercube-Based Retrieval-Augmented Generation for In-domain Scientific Question-Answering

関連論文リスト