Fugu-MT 論文翻訳(概要): Large Language Models and Scientific Discourse: Where's the Intelligence?

論文の概要: Large Language Models and Scientific Discourse: Where's the Intelligence?

arxiv url: http://arxiv.org/abs/2603.23543v1
Date: Fri, 13 Mar 2026 14:23:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-06 02:36:13.027043
Title: Large Language Models and Scientific Discourse: Where's the Intelligence?
Title（参考訳）: 大規模言語モデルと科学談話 : インテリジェンスはどこにあるのか?
Authors: Harry Collins, Simon Thorne,
Abstract要約: 科学的知識の作り方について検討し,それを大規模言語モデルと比較する。 LLMはそのような言説にはアクセスできない、あるいはアクセスできない、と論じられている。次に、新しいモンティホールプロンプトを発明し、LCMのパネルと人間のパネルの反応を比較します。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: We explore the capabilities of Large Language Models (LLMs) by comparing the way they gather data with the way humans build knowledge. Here we examine how scientific knowledge is made and compare it with LLMs. The argument is structured by reference to two figures, one representing scientific knowledge and the other LLMs. In a 2014 study, scientists explain how they choose to ignore a 'fringe science' paper in the domain of gravitational wave physics: the decisions are made largely as a result of tacit knowledge built up in social discourse, most spoken discourse, within closed groups of experts. It is argued that LLMs cannot or do not currently access such discourse, but it is typical of the early formation of scientific knowledge. LLMs 'understanding' builds on written literatures and is therefore insecure in the case of the initial stages of knowledge building. We refer to Colin Fraser's 'Dumb Monty Hall problem' where in 2023 ChatGPT failed though a year later or so later LLMs were succeeding. We argue that this is not a matter of improvement in LLMs ability to reason but in the change in the body of human written discourse on which they can draw (or changes being put in by humans 'by hand'). We then invent a new Monty Hall prompt and compare the responses of a panel of LLMs and a panel of humans: they are starkly different but we explain that the previous mechanisms will soon allow the LLMs to align themselves to humans once more. Finally, we look at 'overshadowing' where a settled body of discourse becomes so dominant that LLMs fail to respond to small variations in prompts which render the old answers nonsensical. The 'intelligence' we argue is in the humans not the LLMs
Abstract（参考訳）: 大規模言語モデル(LLM)の能力について、人間の知識構築方法とデータ収集方法を比較して検討する。ここでは、科学的知識の作り方について検討し、LLMと比較する。この議論は、科学的知識を表す2つの図と、他の LLM を参照して構成されている。 2014年の研究では、重力波物理学の分野における「フリンジ・サイエンス」の論文を無視する方法が説明されている。 LLMはそのような言説に現在アクセスできない、あるいはアクセスしていないと論じられているが、これは科学知識の初期形成の典型である。 LLMは文献に基づいて構築されており、知識構築の初期段階では安全ではない。コリン・フレーザーの「ダムブ・モンティホール問題」に言及し、2023年にはChatGPTが失敗したが、その1年後にはLSMが成功した。我々は、LLMの推論能力の改善の問題ではなく、人間の記述した言説の身体の変化(あるいは人間によって「手で」入れられること)の問題であると主張している。私たちは新しいモンティホールのプロンプトを発明し、LLMのパネルと人間のパネルの反応を比較しました。最後に、落ち着いた談話の体が支配的になるような「オーバーシャドウイング」に注目し、LLMは古い答えを無意味なものにするプロンプトの小さなバリエーションに反応しない。我々が主張する「知性」はLLMではなく人間にある

論文の概要: Large Language Models and Scientific Discourse: Where's the Intelligence?

関連論文リスト