Fugu-MT 論文翻訳(概要): Towards Multidisciplinary Summarization of Hospital Stays: Efficient Sentence-Level Clinical Provenance Categorization

論文の概要: Towards Multidisciplinary Summarization of Hospital Stays: Efficient Sentence-Level Clinical Provenance Categorization

arxiv url: http://arxiv.org/abs/2606.02487v1
Date: Mon, 01 Jun 2026 16:57:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:32.518353
Title: Towards Multidisciplinary Summarization of Hospital Stays: Efficient Sentence-Level Clinical Provenance Categorization
Title（参考訳）: 病院滞在者の複数学際的要約に向けて : 効果的な文素レベルクリニカル・プロヴァンス・カテゴリー化
Authors: Baris Karacan, Vaibhav Bhargava, Barbara Di Eugenio, Natalie Parde, Mary Khetani, Yu-Shan Tseng, Vanessa Barbosa, Julie Vignato, Lindsey Knake, Rajashree Dahal, Emily Spellman, Danielle Hitzel, Janine Petitgout, Kristi Haughey, Amanda Karstens, Brianna Clarahan, Rachel Dawson, Lauren Boyd, Mackenzie Weis, Angie Tipton, Jaewon Bae, Catherine K. Craven, Karen Dunn Lopez, Andrew D. Boyd,
Abstract要約: 本研究では,大規模言語モデル (LLM) の教師付き微調整 (SFT) を用いた臨床経験分類パイプラインを提案する。 2つのLlama-3モデルを2,002MIMIC-III(Adult ICU)コーパスであるMedSecIdに適応した。モデル容量 (8B vs. 70B) と227文レベルのゴールド標準データセットを用いた定量化を行った。
参考スコア（独自算出の注目度）: 10.976481873409531
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Effective "all-team" summarization in high-complexity settings like the Neonatal Intensive Care Unit (NICU) requires aggregating insights from diverse disciplines (physicians, nurses, therapists) spread across hundreds of clinical free-text notes. Simply pooling heterogeneous text often leads to incoherent outputs. Structured summarization therefore first requires accurate categorization of sentence-level provenance across multi-source notes. This pilot study introduces a clinical provenance categorization pipeline using supervised fine-tuning (SFT) of large language models (LLMs). We adapted two Llama-3 models (8B and 70B) to MedSecId, a corpus of 2,002 MIMIC-III (Adult ICU) notes annotated with clinical provenance headers, achieving in-domain Macro F1 scores above 92% for both models. To evaluate cross-domain generalization, we assessed model capacity (8B vs. 70B) and quantization on a gold-standard dataset of 227 sentence-level spans derived from three multi-disciplinary NICU summaries. Experimental results demonstrate a scale-dependent transfer effect: while SFT produced only marginal changes for the 8B model, it substantially improved the 70B model, increasing Macro F1 by 7%. Notably, the quantized fine-tuned 70B model outperformed its full-precision baseline while substantially reducing computational requirements. These findings suggest that sufficient model capacity is critical for preserving semantic flexibility during cross-domain clinical transfer and that efficient quantized adaptation can enable structured provenance modeling for downstream summarization.
Abstract（参考訳）: 新生児集中治療ユニット(NICU)のような複雑度の高い環境での効果的な「全チーム」要約は、何百もの臨床自由テキストノートに広がる様々な分野(医師、看護師、セラピスト)からの洞察を集約する必要がある。単純な不均一なテキストのプールは、しばしば不整合出力につながる。したがって、構造化要約は、まず、複数ソースノート間での文レベルの証明の正確な分類を必要とする。本研究では,大規模言語モデル (LLM) の教師付き微調整 (SFT) を用いた臨床経過分類パイプラインを提案する。 2つのLlama-3モデル (8B, 70B) を2,002MIMIC-III (Adult ICU) のコーパスであるMedSecIdに適応させ, 両モデルともにマクロF1スコアが92%以上であった。クロスドメインの一般化を評価するため,3つの学際的NICU要約から得られた227文レベルのゴールド標準データセット上で,モデル容量(8B vs. 70B)と量子化を評価した。 SFTは8Bモデルに対して限界変化しか生じなかったが、70Bモデルを大幅に改善し、マクロF1を7%増加させた。特に、量子化された微調整70Bモデルは、計算要求を大幅に減らしながら、その完全精度ベースラインを上回った。これらの結果から, ドメイン間転写におけるセマンティック・フレキシビリティの維持には十分なモデルキャパシティが不可欠であることが示唆された。

論文の概要: Towards Multidisciplinary Summarization of Hospital Stays: Efficient Sentence-Level Clinical Provenance Categorization

関連論文リスト