Fugu-MT 論文翻訳(概要): Physical Foundation Models: Fixed hardware implementations of large-scale neural networks

論文の概要: Physical Foundation Models: Fixed hardware implementations of large-scale neural networks

arxiv url: http://arxiv.org/abs/2604.27911v1
Date: Thu, 30 Apr 2026 14:18:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-01 16:31:54.131727
Title: Physical Foundation Models: Fixed hardware implementations of large-scale neural networks
Title（参考訳）: 物理基礎モデル:大規模ニューラルネットワークのハードウェア実装
Authors: Logan G Wright, Tianyu Wang, Tatsuhiro Onodera, Peter L. McMahon,
Abstract要約: ファンデーションモデルは、さまざまな下流タスクを実行できる大規模なデータセットに基づいてトレーニングされたディープニューラルネットワークである。ファンデーションモデルの台頭は、ハードウェアエンジニアにチャンスをもたらすと我々は主張する。我々は、より急進的な再考を提唱する: ニューラルネットワークが物理的設計のレベルで直接実現されるハードウェア。
参考スコア（独自算出の注目度）: 6.1610941441344815
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Foundation models are deep neural networks (such as GPT-5, Gemini~3, and Opus~4) trained on large datasets that can perform diverse downstream tasks -- text and code generation, question answering, summarization, image classification, and so on. The philosophy of foundation models is to put effort into a single, large (${\sim}10^{12}$-parameter) general-purpose model that can be adapted to many downstream tasks with no or minimal additional training. We argue that the rise of foundation models presents an opportunity for hardware engineers: in contrast to when different models were used for different tasks, it now makes sense to build special-purpose, fixed hardware implementations of neural networks, manufactured and released at the roughly 1-year cadence of major new foundation-model versions. Beyond conventional digital-electronic inference hardware with read-only weight memory, we advocate a more radical re-thinking: hardware in which the neural network is realized directly at the level of the physical design and operates via the hardware's natural physical dynamics -- \textit{Physical Foundation Models} (PFMs). PFMs could enable orders-of-magnitude advantages in energy efficiency, speed, and parameter density. For ${\sim}10^{12}$-parameter models, this would both reduce the high energy burden of AI in datacenters and enable AI in edge devices that today are power-constrained to far smaller models. PFMs could also enable inference hardware for models much larger than current ones: $10^{15}$- or even $10^{18}$-parameter PFMs seem plausible by some measures. We present back-of-the-envelope calculations illustrating PFM scaling using an optical example -- a 3D nanostructured glass medium -- and discuss prospects in nanoelectronics and other physical platforms. We conclude with the major research challenges that must be resolved for trillion-parameter PFMs and beyond to become reality.
Abstract（参考訳）: 基礎モデルはディープニューラルネットワーク(GPT-5、Gemini~3、Ops~4など)で、テキストやコード生成、質問応答、要約、画像分類など、さまざまな下流タスクを実行できる大規模なデータセットでトレーニングされている。基礎モデルの哲学は、1つの大きな({\sim}10^{12}$-parameter)汎用モデルに力を注ぐことである。さまざまなモデルが異なるタスクに使用されているのとは対照的に、ニューラルネットワークの専用で固定されたハードウェア実装を構築するのは理にかなっているのです。我々は、リードオンリーの重み付けメモリを備えた従来のデジタル電子推論ハードウェア以外にも、より急進的な再考を提唱する: ニューラルネットワークが物理的設計のレベルで直接実現され、ハードウェアの自然な物理力学 -- \textit{Physical Foundation Models} (PFMs)を介して動作するハードウェア。 PFMはエネルギー効率、速度、パラメータ密度のオーダー・オブ・マグニチュード・アドバンテージを可能にする。 ${\sim}10^{12}$-parameterモデルの場合、これはデータセンターにおけるAIの高エネルギー負担を低減し、今日のより小さなモデルに電力を制約するエッジデバイスにおけるAIを可能にする。 10^{15}$-または10^{18}$-parameter PFMは、いくつかの測度によっては実証可能であるように思われる。我々は3次元ナノ構造ガラス媒体の光学的例を用いてPFMスケーリングを実証するバック・オブ・ザ・エンベロープ計算を行い、ナノエレクトロニクスや他の物理プラットフォームの将来について論じる。我々は、数兆マイルのPFMで解決しなければならない主要な研究課題を、現実になるために締めくくります。

論文の概要: Physical Foundation Models: Fixed hardware implementations of large-scale neural networks

関連論文リスト