Fugu-MT 論文翻訳(概要): MaLei at MultiClinSUM: Summarisation of Clinical Documents using Perspective-Aware Iterative Self-Prompting with LLMs

論文の概要: MaLei at MultiClinSUM: Summarisation of Clinical Documents using Perspective-Aware Iterative Self-Prompting with LLMs

arxiv url: http://arxiv.org/abs/2509.07622v1
Date: Tue, 09 Sep 2025 11:52:16 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-10 14:38:27.289511
Title: MaLei at MultiClinSUM: Summarisation of Clinical Documents using Perspective-Aware Iterative Self-Prompting with LLMs
Title（参考訳）: MultiClinSUMにおけるMaLei : LLMを用いたパースペクティブ・イテレーティブ・セルフプロンピングによる臨床文書の要約
Authors: Libo Ren, Yee Man Ng, Lifeng Han,
Abstract要約: 本稿では,MultiClinSUM共有タスクにおける臨床事例文書の要約手法について述べる。我々はLLMにタスク固有のプロンプトを生成するよう依頼することで,大規模言語モデル(LLM)に反復的自己プロンプト手法を適用した。語彙と埋め込み空間のメトリクスであるROUGEとBERTスコアを用いて、エポックによるモデル微調整をガイドした。
参考スコア（独自算出の注目度）: 5.40185721303932
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Efficient communication between patients and clinicians plays an important role in shared decision-making. However, clinical reports are often lengthy and filled with clinical jargon, making it difficult for domain experts to identify important aspects in the document efficiently. This paper presents the methodology we applied in the MultiClinSUM shared task for summarising clinical case documents. We used an Iterative Self-Prompting technique on large language models (LLMs) by asking LLMs to generate task-specific prompts and refine them via example-based few-shot learning. Furthermore, we used lexical and embedding space metrics, ROUGE and BERT-score, to guide the model fine-tuning with epochs. Our submission using perspective-aware ISP on GPT-4 and GPT-4o achieved ROUGE scores (46.53, 24.68, 30.77) and BERTscores (87.84, 83.25, 85.46) for (P, R, F1) from the official evaluation on 3,396 clinical case reports from various specialties extracted from open journals. The high BERTscore indicates that the model produced semantically equivalent output summaries compared to the references, even though the overlap at the exact lexicon level is lower, as reflected in the lower ROUGE scores. This work sheds some light on how perspective-aware ISP (PA-ISP) can be deployed for clinical report summarisation and support better communication between patients and clinicians.
Abstract（参考訳）: 患者と臨床医の効果的なコミュニケーションは意思決定の共有において重要な役割を担っている。しかし、臨床報告は長く、臨床用語で満たされることが多いため、ドメインの専門家が文書の重要な側面を効率的に特定することは困難である。本稿では,MultiClinSUM共有タスクにおける臨床事例文書の要約手法について述べる。大規模言語モデル (LLM) 上で反復的自己プロンピング手法を用いて, LLM にタスク固有のプロンプトを生成し,サンプルベースによる少数ショット学習によってそれらを洗練させる。さらに,語彙および埋め込み空間の指標であるROUGEとBERTスコアを用いて,エポックを用いたモデル微調整を指導した。 GPT-4 および GPT-4o 上での視点対応ISP を用いた提案は,オープンジャーナルから抽出した各種専門分野の3,396 件の臨床報告から,ROUGE スコア (46.53, 24.68, 30.77) とBERTスコア (87.84, 83.25, 85.46) を (P, R, F1) に対して公式評価した。高いBERTスコアは、下のROUGEスコアに反映されるように、正確なレキシコンレベルの重なり合いが低いにもかかわらず、モデルが参照よりも意味的に等価な出力サマリーを生成することを示している。この研究は、臨床報告の要約のために、パースペクティブ・アウェア・ISP(PA-ISP)をどのように展開し、患者と臨床医とのコミュニケーションを改善するかに光を当てている。

論文の概要: MaLei at MultiClinSUM: Summarisation of Clinical Documents using Perspective-Aware Iterative Self-Prompting with LLMs

関連論文リスト