Fugu-MT 論文翻訳(概要): A Survey on Retrieval And Structuring Augmented Generation with Large Language Models

論文の概要: A Survey on Retrieval And Structuring Augmented Generation with Large Language Models

arxiv url: http://arxiv.org/abs/2509.10697v1
Date: Fri, 12 Sep 2025 21:25:25 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-16 17:26:22.736691
Title: A Survey on Retrieval And Structuring Augmented Generation with Large Language Models
Title（参考訳）: 大規模言語モデルを用いた検索・構造化世代に関する調査研究
Authors: Pengcheng Jiang, Siru Ouyang, Yizhu Jiao, Ming Zhong, Runchu Tian, Jiawei Han,
Abstract要約: 大規模言語モデル(LLM)は、テキスト生成と推論において顕著な能力を持つ自然言語処理に革命をもたらした。しかし、これらのモデルは、時代遅れの知識や限られたドメインの専門知識を含む、現実世界のアプリケーションにデプロイする際の重要な課題に直面します。 Retrieval And Structuring (RAS) Augmented Generationは、動的情報検索と構造化知識表現を統合することで、これらの制限に対処する。
参考スコア（独自算出の注目度）: 29.707181003761004
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) have revolutionized natural language processing with their remarkable capabilities in text generation and reasoning. However, these models face critical challenges when deployed in real-world applications, including hallucination generation, outdated knowledge, and limited domain expertise. Retrieval And Structuring (RAS) Augmented Generation addresses these limitations by integrating dynamic information retrieval with structured knowledge representations. This survey (1) examines retrieval mechanisms including sparse, dense, and hybrid approaches for accessing external knowledge; (2) explore text structuring techniques such as taxonomy construction, hierarchical classification, and information extraction that transform unstructured text into organized representations; and (3) investigate how these structured representations integrate with LLMs through prompt-based methods, reasoning frameworks, and knowledge embedding techniques. It also identifies technical challenges in retrieval efficiency, structure quality, and knowledge integration, while highlighting research opportunities in multimodal retrieval, cross-lingual structures, and interactive systems. This comprehensive overview provides researchers and practitioners with insights into RAS methods, applications, and future directions.
Abstract（参考訳）: 大規模言語モデル(LLM)は、テキスト生成と推論において顕著な能力を持つ自然言語処理に革命をもたらした。しかし、これらのモデルは、幻覚生成、時代遅れの知識、限られたドメインの専門知識など、現実世界のアプリケーションにデプロイするとき、重大な課題に直面します。 Retrieval And Structuring (RAS) Augmented Generationは、動的情報検索と構造化知識表現を統合することで、これらの制限に対処する。本研究では,(1)外的知識にアクセスするための疎密でハイブリッドなアプローチ,(2)非構造的テキストを組織化された表現に変換する分類構築,階層分類,情報抽出などのテキスト構造化手法,(3)これらの構造化された表現が,素早い手法,推論フレームワーク,知識埋め込み技術を通じてLLMとどのように統合されるかを検討する。また、検索効率、構造品質、知識統合における技術的課題を特定し、マルチモーダル検索、言語間構造、インタラクティブシステムの研究機会を強調している。この包括的概要は、RASメソッド、アプリケーション、将来の方向性に関する洞察を研究者や実践者に提供する。

論文の概要: A Survey on Retrieval And Structuring Augmented Generation with Large Language Models

関連論文リスト