Fugu-MT 論文翻訳(概要): Automating Categorization of Scientific Texts with In-Context Learning and Prompt-Chaining in Large Language Models

論文の概要: Automating Categorization of Scientific Texts with In-Context Learning and Prompt-Chaining in Large Language Models

arxiv url: http://arxiv.org/abs/2604.23430v1
Date: Sat, 25 Apr 2026 19:52:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:07.332178
Title: Automating Categorization of Scientific Texts with In-Context Learning and Prompt-Chaining in Large Language Models
Title（参考訳）: 大規模言語モデルにおける文脈学習とPrompt-Chainingによる科学テキストの自動分類
Authors: Gautam Kishore Shahi, Oliver Hummel,
Abstract要約: 本研究は,科学的テキストの分析において,既成の大規模言語モデル(LLM)の性能を体系的に評価する。 In-Context Learning (ICL) と Prompt Chaining の先進的なエンジニアリング戦略の有効性を検討した。実験の結果, プロンプト連鎖は純粋なICLに比べ, 分類精度が優れていることがわかった。
参考スコア（独自算出の注目度）: 4.1824815480811806
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The relentless expansion of scientific literature presents significant challenges for navigation and knowledge discovery. Within Research Information Retrieval, established tasks such as text summarization and classification remain crucial for enabling researchers and practitioners to effectively navigate this vast landscape, so that efforts have increasingly been focused on developing advanced research information systems. These systems aim not only to provide standard keyword-based search functionalities but also to incorporate capabilities for automatic content categorization within knowledge-intensive organizations across academia and industry. This study systematically evaluates the performance of off-the-shelf Large Language Models (LLMs) in analyzing scientific texts according to a given classification scheme. We utilized the hierarchical ORKG taxonomy as a classification framework, employing the FORC dataset as ground truth. We investigated the effectiveness of advanced prompt engineering strategies, namely In-Context Learning (ICL) and Prompt Chaining, and experimentally explored the influence of the LLMs' temperature hyperparameter on classification accuracy. Our experiments demonstrate that Prompt Chaining yields superior classification accuracy compared to pure ICL, particularly when applied to the nested structure of the ORKG taxonomy. LLMs with prompt chaining outperform the state-of-the-art models for domain (1st level) prediction and show even better performance for subject (2nd level) prediction compared to the older BERT model. However, LLMs are not yet able to perform well in classifying the topic (3rd level) of research areas based on this specific hierarchical taxonomy, as they only reach about 50% accuracy even with prompt chaining.
Abstract（参考訳）: 科学文献の絶え間ない拡大は、ナビゲーションと知識発見に重大な課題をもたらす。研究情報検索の中では、研究者や実践者がこの広大な景観を効果的にナビゲートするためには、テキスト要約や分類などの確立されたタスクが不可欠であり、高度な研究情報システムの開発に力を入れている。これらのシステムは,標準的なキーワードベースの検索機能の提供だけでなく,学術・産業の知識集約型組織におけるコンテンツの自動分類機能の導入も目指している。本研究は, 既成の大規模言語モデル(LLM)の性能を, 与えられた分類体系に従って分析する手法として, 系統的に評価する。我々は、階層的なORKG分類を分類の枠組みとして利用し、FORCデータセットを基礎的真理として利用した。本研究では,ICL(In-Context Learning)とPrompt Chaining(Prompt Chaining)の高度な技術戦略の有効性について検討し,LLMの温度ハイパーパラメータが分類精度に与える影響を実験的に検討した。以上の結果から,Pmpt Chainingは純粋なICLよりも高い分類精度を示し,特にORKG分類のネスト構造に適用した場合に有効であることがわかった。即時連鎖によるLLMは、ドメイン(第1レベル)予測のための最先端モデルよりも優れており、古いBERTモデルよりも被写体(第2レベル)予測の方が優れた性能を示している。しかし、LLMは、この特定の階層的な分類に基づいて研究領域のトピック(第3レベル)を分類する上で、まだうまく機能していない。

論文の概要: Automating Categorization of Scientific Texts with In-Context Learning and Prompt-Chaining in Large Language Models

関連論文リスト