Fugu-MT 論文翻訳(概要): DSRAG: A Domain-Specific Retrieval Framework Based on Document-derived Multimodal Knowledge Graph

論文の概要: DSRAG: A Domain-Specific Retrieval Framework Based on Document-derived Multimodal Knowledge Graph

arxiv url: http://arxiv.org/abs/2509.10467v1
Date: Fri, 22 Aug 2025 14:24:48 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-21 06:05:45.73514
Title: DSRAG: A Domain-Specific Retrieval Framework Based on Document-derived Multimodal Knowledge Graph
Title（参考訳）: DSRAG: ドキュメント由来のマルチモーダル知識グラフに基づくドメイン特化検索フレームワーク
Authors: Mengzheng Yang, Yanfei Ren, David Osei Opoku, Ruochang Li, Peng Ren, Chunxiao Xing,
Abstract要約: この研究は、生成プロセスにおける知識グラフ品質の重要な役割を強調したグラフベースのRAGフレームワークに焦点を当てている。ドメイン固有アプリケーション向けに設計されたマルチモーダルな知識グラフ駆動検索拡張生成フレームワークであるDSRAGを提案する。
参考スコア（独自算出の注目度）: 4.951890767337337
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Current general-purpose large language models (LLMs) commonly exhibit knowledge hallucination and insufficient domain-specific adaptability in domain-specific tasks, limiting their effectiveness in specialized question answering scenarios. Retrieval-augmented generation (RAG) effectively tackles these challenges by integrating external knowledge to enhance accuracy and relevance. However, traditional RAG still faces limitations in domain knowledge accuracy and context modeling.To enhance domain-specific question answering performance, this work focuses on a graph-based RAG framework, emphasizing the critical role of knowledge graph quality during the generation process. We propose DSRAG (Domain-Specific RAG), a multimodal knowledge graph-driven retrieval-augmented generation framework designed for domain-specific applications. Our approach leverages domain-specific documents as the primary knowledge source, integrating heterogeneous information such as text, images, and tables to construct a multimodal knowledge graph covering both conceptual and instance layers. Building on this foundation, we introduce semantic pruning and structured subgraph retrieval mechanisms, combining knowledge graph context and vector retrieval results to guide the language model towards producing more reliable responses. Evaluations using the Langfuse multidimensional scoring mechanism show that our method excels in domain-specific question answering, validating the efficacy of integrating multimodal knowledge graphs with retrieval-augmented generation.
Abstract（参考訳）: 現在の汎用大規模言語モデル(LLM)は、知識幻覚とドメイン固有のタスクにおけるドメイン固有の適応性に乏しく、特殊な質問応答シナリオにおける有効性を制限している。 Retrieval-augmented Generation (RAG) は、精度と妥当性を高めるために外部知識を統合することで、これらの課題に効果的に取り組む。しかし、従来のRAGは、ドメイン固有の質問応答性能を高めるために、生成プロセスにおける知識グラフ品質の重要な役割を強調し、グラフベースのRAGフレームワークに焦点を当てている。 DSRAG(Domain-Specific RAG)は,ドメイン固有アプリケーション向けに設計された多モーダル知識グラフ駆動検索拡張生成フレームワークである。提案手法では,テキストや画像,テーブルなどの異種情報を統合して,概念層とインスタンス層の両方をカバーするマルチモーダルな知識グラフを構築する。この基盤を基盤として,知識グラフコンテキストとベクトル検索結果を組み合わせたセマンティックプルーニングと構造化サブグラフ検索機構を導入し,より信頼性の高い応答を生成するための言語モデルを導出する。 Langfuseの多次元スコアリング機構を用いて評価した結果,本手法はドメイン固有の質問応答に優れており,マルチモーダル知識グラフと検索拡張生成の統合の有効性が検証されている。

論文の概要: DSRAG: A Domain-Specific Retrieval Framework Based on Document-derived Multimodal Knowledge Graph

関連論文リスト