Fugu-MT 論文翻訳(概要): A Principled Framework for Safe Algorithm Updates in Automated Insulin Delivery Systems

論文の概要: A Principled Framework for Safe Algorithm Updates in Automated Insulin Delivery Systems

arxiv url: http://arxiv.org/abs/2606.13882v1
Date: Thu, 11 Jun 2026 20:18:13 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-15 16:00:42.63622
Title: A Principled Framework for Safe Algorithm Updates in Automated Insulin Delivery Systems
Title（参考訳）: 自動インスリンデリバリーシステムにおける安全アルゴリズム更新のための原則的フレームワーク
Authors: Thomas Screven, Ziqiang "Joe" Zhu, Deniz Cengiz, Rayhan A. Lal, Korey K. Hood, Samuel T. King,
Abstract要約: 我々のフレームワークはバグを分類し、AIDシステムソフトウェア更新の臨床的等価性を評価する。システムに依存しず、広く使われているすべてのOS-AIDシステムに適用できる。
参考スコア（独自算出の注目度）: 0.1915630210833957
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Background: AID algorithms require ongoing software updates and bug fixes. In co-adapted systems, where users tune settings around existing algorithmic behavior, bug fixes can paradoxically disrupt glycemic control. No principled framework evaluates the safety of AID algorithm updates. Methods: Our two-part framework classifies bugs and evaluates the clinical equivalence of AID system software updates. Bugs are classified as factual, heuristic, or computational, each with distinct management strategies. Classifications were validated from porting Trio's oref algorithm from Javascript to a bug-fixed Swift implementation. We compared implementations using shadow execution on 736,480 invocations from eight Trio users. The second component assesses clinical equivalence with error analysis on paired glucose values, applied to both Trio implementations using mechanistic in silico and data-driven replay simulation. Results: In mechanistic in silico simulation, the Swift and Javascript implementations produced nearly identical Time in Range (84.9% vs. 84.9%) and Glycemia Risk Index (23.5% vs. 23.9%), with more than 99% of paired glucose in Parkes Error Grid Zones A and B, meeting our clinical equivalence threshold. Shadow execution showed low mismatch rates in oref components (iob 0.43%, autosens 1.22%, determineBasal 0.07%, meal 0.01%), with clinically meaningful differences in 0.03% of iob invocations. Data-driven replay simulations of bugs revealed more than 99% of downstream paired glucose in Parkes Error Grid Zones A and B, also meeting our clinical equivalence threshold. Conclusions: Our framework integrates bug-fixing principles with multi-method clinical evaluation to assess AID algorithm update safety. It is system-agnostic and applicable to all widely used OS-AID systems, with case studies highlighting the need for systematic remediation of factual and computational bugs.
Abstract（参考訳）: 背景: AIDアルゴリズムは進行中のソフトウェア更新とバグ修正を必要とする。既存のアルゴリズムの動作に関する設定をユーザが調整する、共適応システムでは、バグ修正がグリセミック制御をパラドックス的に破壊する可能性がある。 AIDアルゴリズムのアップデートの安全性を評価するフレームワークは存在しない。方法: この2つのフレームワークはバグを分類し, AID システムソフトウェア更新の臨床的等価性を評価する。バグは現実的、ヒューリスティック的、あるいは計算的に分類され、それぞれ異なる管理戦略を持つ。分類は、TrioのオレフアルゴリズムをJavascriptからバグ修正されたSwift実装に移植することから検証された。 8人のTrioユーザの736,480件の呼び出しに対して,シャドウ実行を用いた実装を比較した。第2の構成要素は, 2組のグルコース値の誤差解析による臨床等価性を評価し, メカニスティック・イン・サイリコとデータドリブン・リプレイ・シミュレーションを用いて, 両方のトリオ実装に適用した。結果:シリコシミュレーションのメカニスティックでは、SwiftとJavascriptの実装はほぼ同じ時間帯(84.9%対84.9%)とグリセミアリスク指数(23.5%対23.9%)を生成し、Parkes Error Grid Zones AとBのペアブドウ糖の99%以上を臨床等価値を満たした。シャドー実行では,オリーフ成分のミスマッチ率 (ob 0.43%,Autosens 1.22%,DeferBasal 0.07%,食食0.01%) が低く,iob投与の0.03%に臨床的に有意な差が認められた。データ駆動によるバグリプレイシミュレーションの結果,Parkes Error Grid Zones A と B の下流のペアブドウ糖の99%以上が検出された。結論: このフレームワークは, AIDアルゴリズムの安全性を評価するために, バグフィックスの原則とマルチメソッド臨床評価を統合している。システムに依存しず、広く使われているすべてのOS-AIDシステムに適用できる。

論文の概要: A Principled Framework for Safe Algorithm Updates in Automated Insulin Delivery Systems

関連論文リスト