Fugu-MT 論文翻訳(概要): Large Language Models and Book Summarization: Reading or Remembering, Which Is Better?

論文の概要: Large Language Models and Book Summarization: Reading or Remembering, Which Is Better?

arxiv url: http://arxiv.org/abs/2603.09981v1
Date: Mon, 09 Feb 2026 17:02:02 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-15 16:38:22.520105
Title: Large Language Models and Book Summarization: Reading or Remembering, Which Is Better?
Title（参考訳）: 大規模言語モデルと書籍要約 - 読み書きか覚えか - どちらがよいか?
Authors: Tairan Fu, Javier Conde, Pedro Reviriego, Javier Coronado-Blázquez, Nina Melero, Elena Merino-Gómez,
Abstract要約: モデルの内部知識と本の全文のみを用いて作成した要約を比較した。以上の結果から,全文は概してより詳細な要約を提供するが,内部知識要約のスコアが向上する本もある。これは、長文の要約を行うモデルの能力に疑問を呈する。
参考スコア（独自算出の注目度）: 3.791270012079502
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Summarization is a core task in Natural Language Processing (NLP). Recent advances in Large Language Models (LLMs) and the introduction of large context windows reaching millions of tokens make it possible to process entire books in a single prompt. At the same time, for well-known books, LLMs can generate summaries based only on internal knowledge acquired during training. This raises several important questions: How do summaries generated from internal memory compare to those derived from the full text? Does prior knowledge influence summaries even when the model is given the book as input? In this work, we conduct an experimental evaluation of book summarization with state-of-the-art LLMs. We compare summaries of well-known books produced using (i) only the internal knowledge of the model and (ii) the full text of the book. The results show that having the full text provides more detailed summaries in general, but some books have better scores for the internal knowledge summaries. This puts into question the capabilities of models to perform summarization of long texts, as information learned during training can outperform summarization of the full text in some cases.
Abstract（参考訳）: 要約は自然言語処理(NLP)のコアタスクである。大規模言語モデル(LLM)の最近の進歩と数百万のトークンに到達した大きなコンテキストウィンドウの導入により、本全体をひとつのプロンプトで処理することが可能になった。同時に、有名な書籍では、LLMはトレーニング中に得られた内部知識に基づいて要約を生成することができる。内部メモリから生成されたサマリは、フルテキストから生成されたサマリとどのように比較されますか? モデルが入力として与えられた場合でも,事前知識が要約に影響を与えるか? 本研究では,本書の要約を最先端のLCMを用いて実験的に評価する。私たちはよく知られた書籍の要約を比較する一モデルの内部知識及び内部知識のみ (二)本の全文以上の結果から,全文は概してより詳細な要約を提供するが,内部知識要約のスコアが向上する本もある。このことは、長いテキストの要約を行うモデルの能力に疑問を呈し、トレーニング中に学んだ情報は、場合によっては全テキストの要約よりも優れている。

論文の概要: Large Language Models and Book Summarization: Reading or Remembering, Which Is Better?

関連論文リスト