Fugu-MT 論文翻訳(概要): Measurement noise limits the advantage of nonlinear models over linear models in biomedical prediction

論文の概要: Measurement noise limits the advantage of nonlinear models over linear models in biomedical prediction

arxiv url: http://arxiv.org/abs/2606.18420v1
Date: Tue, 16 Jun 2026 19:15:08 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-18 17:16:50.861242
Title: Measurement noise limits the advantage of nonlinear models over linear models in biomedical prediction
Title（参考訳）: 生体医学的予測における非線形モデルの非線形モデルに対する利点を計測ノイズが制限する
Authors: Marc-Andre Schulz, Kerstin Ritter,
Abstract要約: 生体医学データでは、フレキシブルモデルが線形回帰とロジスティック回帰によって繰り返し一致または打ち負かされる。これらの修正は、バインディング制限がモデルではなく測定値である場合に役に立ちません。 140の英国バイオバンクタスクで、フレキシブルモデルとリニアモデルの間には、予測されたノイズシグネチャがある。
参考スコア（独自算出の注目度）: 0.3093890460224435
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: On biomedical tabular data, flexible models such as deep networks, gradient-boosted trees, and kernel methods are repeatedly matched or beaten by linear and logistic regression given the same features. The usual reaction is to treat this as a model-side shortfall, to be fixed with more data, a better architecture, or tuning, on the assumption that the nonlinear structure is there and the model has failed to capture it. We argue that these fixes cannot help when the binding limit is the measurement rather than the model, as it frequently is in biomedicine. Additive noise blurs the population-optimal predictor, and because blurring removes a function's fine, rapidly varying detail before its broad shape, it erases nonlinear structure faster than linear structure. A degree-$k$ interaction is attenuated by the $k$-th power of feature reliability, while the linear part is attenuated only once. At the reliabilities typical of biomedical measurement, the nonlinear advantage can vanish even when the underlying biology is strongly nonlinear, and what the noise removes cannot be recovered by a larger cohort or a more flexible model, only by better measurement. The nonlinearity is hidden, not absent, and a tie between linear and flexible models is not by itself a verdict on the biology. These pieces are classical, drawn from measurement-error statistics, psychometrics, and Gaussian analysis, and we assemble them into an exact excess-risk identity. Measurement reliability is one of three conditions, alongside sample size and feature representation, that must align for a flexible model to help, and together they leave only a narrow window that most biomedical tasks fall outside. Across 140 UK Biobank tasks, the gap between flexible and linear models, where it exists, carries the predicted noise signature, and the three conditions can be separated by intervention but not by a benchmark alone.
Abstract（参考訳）: 生体図表データでは, 深層ネットワーク, 勾配木, カーネルメソッドなどの柔軟なモデルが, 同じ特徴から線形回帰やロジスティック回帰によって繰り返し一致するか, 打ち負かされる。通常の反応は、これをモデル側の欠点として扱い、より多くのデータで固定され、より優れたアーキテクチャやチューニングが、非線形構造が存在し、モデルがそれをキャプチャできなかったという仮定に基づいて行われる。これらの修正は、しばしば生体医学において、結合限界がモデルではなく測定値である場合に役に立たないと論じる。付加雑音は集団最適予測器を曖昧にし、ぼかしによって関数の細かな細部が広くなる前に急速に変化するため、線形構造よりも高速に非線形構造を消去する。次数-$k$相互作用は、機能信頼性の$k$-第3のパワーによって減衰されるが、線形部分は1度だけ減衰される。バイオメディカル測定の典型的な信頼性において、基礎となる生物学が強く非線形である場合でも非線形の利点は消え、より大きなコホートやより柔軟なモデルによって除去されるノイズは、より良い測定によってのみ回収できない。非線形性は隠され、欠落せず、線形モデルとフレキシブルモデルの間の結びつきは、それ自体生物学の判断ではない。これらの楽曲は古典的であり、測定エラー統計、サイコメトリック、ガウス分析から抽出され、我々はそれらを正確に過度なリスクのアイデンティティに組み立てる。測定信頼性は、サンプルサイズと特徴表現の3つの条件のうちの1つであり、柔軟性のあるモデルが役に立つように調整する必要がある。 140の英国バイオバンクタスク全体では、フレキシブルモデルと線形モデルの間のギャップは予測されたノイズシグネチャを持ち、これら3つの条件は介入によって分離できるが、ベンチマークだけでは分離できない。

論文の概要: Measurement noise limits the advantage of nonlinear models over linear models in biomedical prediction

関連論文リスト