Fugu-MT 論文翻訳(概要): An Empirical Investigation of Pre-Trained Deep Learning Model Reuse in the Scientific Process

論文の概要: An Empirical Investigation of Pre-Trained Deep Learning Model Reuse in the Scientific Process

arxiv url: http://arxiv.org/abs/2603.13584v1
Date: Fri, 13 Mar 2026 20:49:02 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.28468
Title: An Empirical Investigation of Pre-Trained Deep Learning Model Reuse in the Scientific Process
Title（参考訳）: 科学的プロセスにおける事前学習深層学習モデルの再利用に関する実証的研究
Authors: Nicholas M. Synovic, Karolina Ryzka, Alessandra V. Vellucci Solari, Kenny Lyons, James C. Davis, George K. Thiruvathukal,
Abstract要約: 自然科学におけるPTMの再利用パターンに関する最初の実証的研究について述べる。我々は、17,511個のピアレビュー、オープンアクセス論文を分析し、科学分野によるPTMの再利用、関連する再利用パターン、および科学的プロセスへのPTMの統合の影響を明らかにする。
参考スコア（独自算出の注目度）: 40.399530303181265
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Deep learning has achieved recognition for its impact within natural sciences, however scientists are inhibited by the prohibitive technical cost and computational complexity of training project specific models from scratch. Following software engineering community guidance, natural scientists are reusing pre-trained deep learning models (PTMs) to amortize these costs. While prior works recommend PTM reuse patterns, to our knowledge, little work has been done to empirically evaluate their usage and impact within the natural sciences. We present the first empirical study of PTM reuse patterns in the natural sciences, quantifying the utilization and impact of conceptual, adaptation, and deployment reuse within the scientific process. Leveraging an automated large language model driven pipeline, we analyze 17,511 peer reviewed, open access papers to identify PTM reuse by scientific field, associated reuse patterns, and the impact of PTM integration into the scientific process from January 1st, 2000 to December 10th, 2025. Our results show that "Biochemistry, Genetics and Molecular Biology" has outpaced other natural scientific fields in PTM reuse, "adaptation" reuse is the most prevalent PTM reuse pattern identified across all natural science fields, and the "Test" stage of the scientific process has been most impacted by PTM integration. This aligns with the growing interest of leveraging computational methods to conduct high throughput, data driven scientific research. Our work characterizes and identifies current PTM reuse practices within the natural sciences, evaluates their impact on the scientific process, and establishes a foundation for future work into the implementation and broader scientific implications of PTM reuse.
Abstract（参考訳）: ディープラーニングは、自然科学におけるその影響について認識されているが、科学者は、プロジェクト固有のモデルをゼロからトレーニングすることによる、技術的コストと計算の複雑さによって妨げられている。ソフトウェアエンジニアリングコミュニティのガイダンスに従って、自然科学者はこれらのコストを償却するために事前訓練されたディープラーニングモデル(PTM)を再利用している。従来の研究では、PTMの再利用パターンを推奨していましたが、私たちの知る限り、自然科学におけるその使用と影響を実証的に評価する作業はほとんど行われていません。本研究は, 自然科学におけるPTM再利用パターンの実証的研究であり, 科学プロセスにおける概念的, 適応的, 展開的再利用の活用と影響を定量化するものである。 2000年1月1日から2025年12月10日まで,大規模言語モデル駆動パイプラインの自動化を活用して17,511個のピアレビュー,オープンアクセス論文を分析し,科学分野によるPTM再利用の特定,関連する再利用パターン,および科学的プロセスへのPTM統合の影響について検討した。以上の結果から, 生物化学, 遺伝学, 分子生物学は, PTM の再利用における他の自然科学分野よりも大きくなり, 適応的再利用はすべての自然科学分野において最も広く認識されている PTM の再利用パターンであり, 科学的プロセスの「テスト」段階は PTM 統合によって最も影響を受けていることが明らかとなった。これは、高いスループット、データ駆動科学研究を実行するために計算手法を活用することへの関心の高まりと一致している。本研究は, 自然科学における現在のPTM再利用の実践を特徴づけ, 同定し, 科学的プロセスへの影響を評価し, PTM再利用の実践と幅広い科学的影響に関する今後の研究の基盤を確立するものである。

論文の概要: An Empirical Investigation of Pre-Trained Deep Learning Model Reuse in the Scientific Process

関連論文リスト