Fugu-MT 論文翻訳(概要): Reliability-Oriented Multilingual Orthopedic Diagnosis: A Domain-Adaptive Modeling and a Conceptual Validation Framework

論文の概要: Reliability-Oriented Multilingual Orthopedic Diagnosis: A Domain-Adaptive Modeling and a Conceptual Validation Framework

arxiv url: http://arxiv.org/abs/2605.02266v1
Date: Mon, 04 May 2026 06:20:36 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:50.161334
Title: Reliability-Oriented Multilingual Orthopedic Diagnosis: A Domain-Adaptive Modeling and a Conceptual Validation Framework
Title（参考訳）: 信頼性を指向した多言語矯正診断:ドメイン適応モデリングと概念検証フレームワーク
Authors: Danish Ali, Li Xiaojian, Sundas Iqbal, Farrukh Zaidi,
Abstract要約: 英語,ヒンディー語,パンジャービ語におけるフリーテキストによる多言語整形外科診断のシステムレベルでの分析を行った。 i)タスク整列型多言語トランスフォーマーエンコーダ,(ii)タスク細調整ベースライン(DistilBERT),(iii)整形テキストに適したドメイン適応型アーキテクチャの3つのモデリング方式を評価する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) are increasingly proposed for clinical decision support including multilingual diagnosis in low-resource settings. However, their reliability, calibration and safety characteristics remain insufficiently understood for structured, high-risk tasks. We present a system-level analysis of multilingual orthopedic diagnosis from free-text clinical notes in English, Hindi and Punjabi. We evaluate three modeling regimes: (i) task-aligned multilingual transformer encoders, (ii) a task-fine-tuned baseline (DistilBERT), and (iii) a domain-adaptive architecture tailored to orthopedic text (IndicBERT-HPA). These models are compared with zero-shot, instruction-tuned LLMs to assess suitability for structured diagnostic classification. Results indicate that while LLMs exhibit strong linguistic fluency, they show unstable calibration and reduced reliability under structured multilingual conditions, particularly in low-resource languages. These findings are specific to zero-shot evaluation and do not imply limitations of fine-tuned models. Domain-adaptive specialization substantially improves cross-lingual discrimination and confidence behavior. IndicBERT-HPA, with language-specific orthopedic adapter heads achieves consistently strong performance across six diagnostic categories and more predictable deployment characteristics than task-only adaptation. Building on these observations, we outline a conceptual deterministic agent-based validation framework for future implementation, formalizing evidence checks, language-sensitive validation and conservative human-in-the-loop gating. Reliable multilingual clinical decision support requires specialized architecture, explicit reliability analysis, and structured validation for safety-critical systems.
Abstract（参考訳）: 低リソース環境における多言語診断を含む臨床診断支援のために,大規模言語モデル (LLMs) がますます提案されている。しかし、その信頼性、キャリブレーション、安全性は、構造化された高リスクタスクでは十分に理解されていない。英語,ヒンディー語,パンジャービ語におけるフリーテキストによる多言語整形外科診断のシステムレベルでの分析を行った。我々は3つのモデリング体制を評価する。 (i)タスク整合多言語変換器エンコーダ (ii)タスクファインチューニングベースライン(DistilBERT)及び (iii)整形文字(IndicBERT-HPA)に適合したドメイン適応型アーキテクチャ。これらのモデルは、構造化診断分類の適合性を評価するため、ゼロショットの命令調整LDMと比較される。その結果,LLMは言語流速が強いが,低リソース言語では不安定な校正と,構造化多言語条件下での信頼性の低下が示唆された。これらの結果はゼロショット評価に特有であり、微調整モデルの制限を含まない。ドメイン適応型特殊化は言語間差別と信頼行動を大幅に改善する。 IndicBERT-HPAは、言語固有の整形アダプターヘッドを持ち、6つの診断カテゴリで一貫して強力な性能を達成し、タスクのみの適応よりも予測可能なデプロイメント特性を実現している。これらの観測結果に基づいて,概念的決定論的エージェントベース検証フレームワークの概要,エビデンスチェックの形式化,言語に敏感な検証,保守的な人間-イン-ザ-ループゲーティングについて述べる。信頼性の高い多言語臨床決定支援には、特別なアーキテクチャ、明示的な信頼性分析、安全クリティカルシステムのための構造化された検証が必要である。

論文の概要: Reliability-Oriented Multilingual Orthopedic Diagnosis: A Domain-Adaptive Modeling and a Conceptual Validation Framework

関連論文リスト