Fugu-MT 論文翻訳(概要): Scalable LLM-based Coding of Dialogue in Healthcare Simulation: Balancing Coding Performance, Processing Time, and Environmental Impact

論文の概要: Scalable LLM-based Coding of Dialogue in Healthcare Simulation: Balancing Coding Performance, Processing Time, and Environmental Impact

arxiv url: http://arxiv.org/abs/2604.23255v1
Date: Sat, 25 Apr 2026 11:31:28 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:07.228956
Title: Scalable LLM-based Coding of Dialogue in Healthcare Simulation: Balancing Coding Performance, Processing Time, and Environmental Impact
Title（参考訳）: 医療シミュレーションにおけるLLMに基づくスケーラブルな対話符号化:符号化性能のバランス、処理時間、環境影響
Authors: Kiyoshige Garces, Gloria Milena Fernandez-Nieto, Linxuan Zhao, Sachini Samaraweera, Dragan Gasevic, Roberto Martinez-Maldonado, Vanessa Echeverria,
Abstract要約: 対話内容の分析は、チーム学習理論の進歩と、コンピュータが支援する協調学習環境の設計を通知する上で重要である。本稿では, チームベース医療シミュレーションにおけるコーディング精度, 処理時間, 環境影響のバランスをとるために, 迅速な設計と戦略を最適化する方法について検討する。
参考スコア（独自算出の注目度）: 7.255541676420198
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Research shows that dialogue, the interactive process through which participants articulate their thinking, plays a central role in constructing shared understanding, coordinating action, and shaping learning outcomes in teams. Analysing dialogue content has been central to advancing team learning theory and informing the design of computer-supported collaborative learning environments, yet this progress has depended on labour-intensive qualitative coding. LLMs offer new possibilities for automating and enhancing the dialogue layer within emerging multimodal learning analytics approaches, with recent studies showing that they can approximate human coding through few-shot prompting. However, prior work has focused on replicating human coding accuracy for research purposes, rather than addressing a more educationally consequential question: how can we design prompts that allow an LLM to label team dialogue accurately and fast enough to be useful in real settings, such as in-person healthcare simulations, where results must be returned quickly and computational cost and sustainability also matter? This paper investigates how prompt design and batching strategies can be optimised to balance coding accuracy, processing time, and environmental impact in team-based healthcare simulation debriefing. Using a dataset of 11,647 utterances coded across 6 dialogue constructs, we compared 4 prompt designs across varying batch sizes, evaluating coding performance, processing time, and energy consumption, as well as the trade-offs between these metrics. Results indicate that increasing batch size improves speed and reduces energy use, but negatively impacts coding performance. Beyond demonstrating the feasibility of LLM-based qualitative analysis, this study offers practical guidance for scaling dialogue analytics in contexts where timeliness, privacy, and sustainability are critical.
Abstract（参考訳）: 研究は、参加者が自分の思考を明確にする対話的なプロセスである対話が、共通の理解の構築、行動の調整、チームでの学習成果の形成において中心的な役割を果たすことを示している。対話内容の分析は、チーム学習理論の進歩と、コンピュータ支援による協調学習環境の設計の報知の中心であるが、この進歩は労働集約的な定性的なコーディングに依存している。 LLMは、新たなマルチモーダル学習分析アプローチにおいて、対話層を自動化し、拡張する新たな可能性を提供する。しかし、以前の研究は、研究目的のために人間のコーディング精度を複製することに重点を置いており、より教育的に簡潔な問題に対処するよりも、どのようにしてLLMがチームダイアログを正確かつ迅速にラベル付けできるプロンプトを設計できるのか? 本稿では, チームベース医療シミュレーションにおけるコーディング精度, 処理時間, 環境影響のバランスをとるために, 迅速な設計・バッチ化戦略を最適化する方法について検討する。 6つの対話構造で符号化された11,647の発話のデータセットを用いて、4つのプロンプトをバッチサイズで比較し、コーディング性能、処理時間、エネルギー消費を評価し、これらのメトリクス間のトレードオフを評価した。その結果,バッチサイズの増加は速度を向上し,エネルギー消費を減少させるが,符号化性能に悪影響を及ぼすことが示された。 LLMに基づく質的分析の実現可能性を示すだけでなく、この研究は、タイムライン、プライバシー、持続可能性が重要なコンテキストにおいて対話分析をスケールするための実践的なガイダンスを提供する。

論文の概要: Scalable LLM-based Coding of Dialogue in Healthcare Simulation: Balancing Coding Performance, Processing Time, and Environmental Impact

関連論文リスト