Fugu-MT 論文翻訳(概要): Resolving gradient pathology in physics-informed epidemiological models

論文の概要: Resolving gradient pathology in physics-informed epidemiological models

arxiv url: http://arxiv.org/abs/2603.23799v1
Date: Wed, 25 Mar 2026 00:19:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-26 21:06:11.065789
Title: Resolving gradient pathology in physics-informed epidemiological models
Title（参考訳）: 物理学インフォームド疫学モデルにおける解決的勾配病理学
Authors: Nickson Golooba, Woldegebriel Assefa Woldegerima,
Abstract要約: 疫学モデリングのための物理インフォームドニューラルネットワークにおける勾配競合に対処する新しい手法であるコンフリクトゲート勾配スケーリング(CGGS)を提案する。正規化のみを行うスキームとは異なり、CGGSはコンフリクトが高い場合の物理的な制約を抑え、データの忠実度を優先することができる。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Physics-informed neural networks (PINNs) are increasingly used in mathematical epidemiology to bridge the gap between noisy clinical data and compartmental models, such as the susceptible-exposed-infected-removed (SEIR) model. However, training these hybrid networks is often unstable due to competing optimization objectives. As established in recent literature on ``gradient pathology," the gradient vectors derived from the data loss and the physical residual often point in conflicting directions, leading to slow convergence or optimization deadlock. While existing methods attempt to resolve this by balancing gradient magnitudes or projecting conflicting vectors, we propose a novel method, conflict-gated gradient scaling (CGGS), to address gradient conflicts in physics-informed neural networks for epidemiological modelling, ensuring stable and efficient training and a computationally efficient alternative. This method utilizes the cosine similarity between the data and physics gradients to dynamically modulate the penalty weight. Unlike standard annealing schemes that only normalize scales, CGGS acts as a geometric gate: it suppresses the physical constraint when directional conflict is high, allowing the optimizer to prioritize data fidelity, and restores the constraint when gradients align. We prove that this gating mechanism preserves the standard $O(1/T)$ convergence rate for smooth non-convex objectives, a guarantee that fails under fixed-weight or magnitude-balanced training when gradients conflict. We demonstrate that this mechanism autonomously induces a curriculum learning effect, improving parameter estimation in stiff epidemiological systems compared to magnitude-based baselines. Our empirical results show improved peak recovery and convergence over magnitude-based methods.
Abstract（参考訳）: 物理インフォームドニューラルネットワーク(PINN)は、ノイズのある臨床データと包括的モデルのギャップを埋めるために数学疫学でますます使われてきている。しかし、これらのハイブリッドネットワークのトレーニングは、しばしば競合する最適化の目的のために不安定である。最近の『段階的病理学』で確立されているように、データ損失と物理的残留から導かれる勾配ベクトルは、しばしば矛盾する方向を指し、収束が遅くなり、デッドロックが最適化される。既存の手法では、勾配の等級のバランスをとることや、矛盾するベクトルを投影することによる解決を試みているが、我々は、物理インフォームドニューラルネットワークによる疫学モデリングのための勾配の競合に対処し、安定かつ効率的なトレーニングを確実にし、計算効率の良い代替手段であるCGGS(コンフリクトゲート勾配スケーリング)を提案する。この方法は、データと物理勾配のコサイン類似性を利用して、ペナルティウェイトを動的に変調する。スケールのみを正規化する標準的なアニール方式とは異なり、CGGSは幾何学的なゲートとして機能し、方向の衝突が大きいときの物理的制約を抑え、最適化器がデータの忠実度を優先順位付けし、勾配が整ったときの制約を復元する。このゲーティング機構は, 勾配が矛盾する場合, 一定の重み付けや等級バランスのトレーニングで失敗するスムーズな非凸目標に対して, 標準の$O(1/T)$収束率を保っていることを証明した。本研究では,このメカニズムがカリキュラム学習効果を自律的に引き起こし,等級ベースラインと比較して硬質疫学システムにおけるパラメータ推定を改善することを実証する。実験の結果, ピーク回復と収束性は, 等級法よりも向上した。

論文の概要: Resolving gradient pathology in physics-informed epidemiological models

関連論文リスト