Fugu-MT 論文翻訳(概要): How Do Electrocardiogram Models Scale?

論文の概要: How Do Electrocardiogram Models Scale?

arxiv url: http://arxiv.org/abs/2605.17276v1
Date: Sun, 17 May 2026 05:53:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:47.823464
Title: How Do Electrocardiogram Models Scale?
Title（参考訳）: 心電図モデルはどうスケールするか?
Authors: Jiawei Li, Fabio Bonassi, Ming Jin, Stefan Gustafsson, Johan Sundström, Thomas B. Schön, Antônio H. Ribeiro,
Abstract要約: 心電図領域における神経と損失-損失-損失スケーリング法則を系統的に検討した。モデルアーキテクチャ(ResNet vs. Transformer)と事前学習パラダイム、すなわち教師付き学習(SL)と自己教師付き学習(SSL)の効果を分離する。この結果から,効率的なECG基盤モデルへの道筋は,ブルートフォーススケーリングではなく,アーキテクチャとパラダイムの戦略的アライメントにあることが示唆された。
参考スコア（独自算出の注目度）: 19.332418587904375
License: http://creativecommons.org/licenses/by/4.0/
Abstract: While scaling laws have established a fundamental framework for foundation models in natural language processing, their applicability to electrocardiogram (ECG) models remains poorly characterized. Indeed, recent studies do not always yield consistent downstream gains as one increases the model size or pre-training dataset size of ECG models, leaving the exact roles of architectural inductive biases, pre-training paradigms, and expected improvements with size largely unanswered. In this work, we systematically investigate neural and loss-to-loss scaling laws within the ECG domain. By pre-training over $120$ models (ranging from $20$K to $200$M parameters) on the large-scale CODE dataset ($2.3$M records), we decouple the effects of model architecture (ResNet vs. Transformer) and pre-training paradigm, namely supervised learning (SL) versus self-supervised learning (SSL). We found that (i) SL models are data-bottlenecked in-distribution, whereas SSL models scale robustly across both model and data sizes; (ii) for out-of-distribution (OOD) generalization, ResNets are $1.3$ to $2.5$ times more parameter-efficient than Transformers, while SSL is up to $16$ times more data-efficient and achieves up to $7.6$ times higher transfer efficiency than SL on unseen clinical tasks; (iii) across the observed scales, ResNet-based models generally achieve the lowest OOD loss, with SSL dominating on unseen clinical tasks and self-supervised Transformers overtaking at very large model sizes. Our results suggest that the path to effective ECG foundation models lies in the strategic alignment of architecture and paradigm rather than brute-force scaling.
Abstract（参考訳）: スケーリング法則は自然言語処理の基礎モデルの基礎的枠組みを確立してきたが、心電図(ECG)モデルの適用性は依然として不十分である。実際、最近の研究では、ECGモデルのモデルサイズや事前学習データセットサイズが増加し、アーキテクチャ上の帰納的バイアス、事前学習パラダイム、サイズによる期待される改善といった正確な役割を残しているため、常に下流での利得が得られるとは限らない。本研究では,ECG領域におけるニューラルおよびロス・ツー・ロスのスケーリング法則を系統的に検討する。大規模なCODEデータセット(2.3$Mレコード)上で120ドル以上のモデル(20ドルから200ドルまで)を事前トレーニングすることにより、モデルアーキテクチャ(ResNet vs. Transformer)と事前トレーニングパラダイム、すなわち教師付き学習(SL)と自己教師付き学習(SSL)の効果を分離する。私たちはそれを発見しました (i)SLモデルはデータ・ブートネック・イン・ディストリビューションであり、SSLモデルはモデルとデータサイズの両方にわたって堅牢にスケールする。 (ii)アウト・オブ・ディストリビューション(OOD)の一般化では、ResNetsはTransformersよりもパラメータ効率が1.3ドルから2.5ドル高い。 3) ResNet をベースとしたモデルでは,一般的に OOD の損失が最少であり,SSL は未確認の臨床的タスクに支配的であり,トランスフォーマーは極めて大きなモデルサイズを超越している。この結果から,効率的なECG基盤モデルへの道筋は,ブルートフォーススケーリングではなく,アーキテクチャとパラダイムの戦略的アライメントにあることが示唆された。

論文の概要: How Do Electrocardiogram Models Scale?

関連論文リスト