Fugu-MT 論文翻訳(概要): Seizure-Semiology-Suite (S3): A Clinically Multimodal Dataset, Benchmark, and Models for Seizure Semiology Understanding

論文の概要: Seizure-Semiology-Suite (S3): A Clinically Multimodal Dataset, Benchmark, and Models for Seizure Semiology Understanding

arxiv url: http://arxiv.org/abs/2605.21852v1
Date: Thu, 21 May 2026 00:57:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 16:35:42.041884
Title: Seizure-Semiology-Suite (S3): A Clinically Multimodal Dataset, Benchmark, and Models for Seizure Semiology Understanding
Title（参考訳）: せずれ・セミロジー・スタイト(S3) : セズーア・セミロジー理解のための臨床用マルチモーダルデータセット,ベンチマーク,モデル
Authors: Lina Zhang, Tonmoy Monsoor, Peizheng Li, Jiarui Cui, Xinyi Peng, Chong Han, Prateik Sinha, Siyuan Dai, Jessica Nichole Pasqua, Colin M McCrimmon, Weiting Liu, Hailey Marie Miranda, Bing Hu, Xiangting Wu, Tengyou Xu, Chunhan Li, Jiaye Tian, Jiarui Tang, Detao Ma, Lingye Kong, Junnan Lyu, Jungang Li, Yan Zan, Junhua Huang, Rajarshi Mazumder, Vwani Roychowdhury,
Abstract要約: Seizure-Semiology-Suiteは、微細で構造化されたてんかんのセミロジー理解のための臨床基盤となるデータセットである。データセットには、20438AE定義のセミロジカルな特徴をカバーする35,000以上の高密度ラベルで注釈付けされた438個の発作ビデオが含まれている。低レベルの視覚知覚から時間的シークエンシング、物語レポート生成、発作診断に至るまで、MLLMを体系的に評価する7つの階層型ベンチマークを提案する。
参考スコア（独自算出の注目度）: 6.3004976146416025
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: While Multimodal Large Language Models (MLLMs) have demonstrated remarkable proficiency in general video understanding, their capacity to interpret involuntary, and spatio-temporally evolving pathologic motor behaviors such as seizure semiology remains largely untested. To address this gap, we introduce Seizure-Semiology-Suite, a clinically grounded dataset and benchmark for fine-grained, structured seizure semiology understanding. The dataset includes 438 seizure videos annotated with over 35,000 dense labels covering 20 ILAE-defined semiological features. Building on this dataset, we propose a seven-task hierarchical benchmark that systematically evaluates MLLMs from low-level visual perception to temporal sequencing, narrative report generation, and seizure diagnosis. To enable clinically meaningful evaluation of generated reports, we further introduce the Report Quality Index for Seizure Semiology (Seizure-RQI). Extensive baselines across 11 open-weight MLLMs reveal systematic weaknesses in laterality reasoning, temporal localization, symptom sequencing, and clinically faithful reporting. We show that seizure-specific fine-tuning substantially improves performance across tasks, and that a two-stage neuro-symbolic framework achieves an F1 score of 0.96 on epileptic versus non-epileptic seizure classification. Seizure-Semiology-Suite establishes a rigorous benchmark for evaluating multimodal models in safety-critical medical video understanding and guides the development of clinically reliable, domain-adaptive multimodal intelligence.
Abstract（参考訳）: MLLM(Multimodal Large Language Models)は、一般的なビデオ理解において顕著な習熟性を示しているが、不随意の解釈能力や、発作性セミロジーのような時空間的に進化する病的運動の挙動はほとんど検証されていない。このギャップに対処するために、臨床基盤のデータセットと、微細で構造化された発作セミロジー理解のためのベンチマークであるSezure-Semiology-Suiteを紹介した。データセットには、ILAEが定義した20のセミロジカルな特徴をカバーする35,000以上の高密度ラベルが注釈付けされた438の発作ビデオが含まれている。このデータセットに基づいて,低レベルの視覚知覚から時間的シークエンシング,物語レポート生成,発作診断に至るまで,MLLMを体系的に評価する7タスク階層型ベンチマークを提案する。また, 報告の臨床的意義を評価するために, 清水神学報告品質指標(Seizure-RQI)を新たに導入した。 11個のオープンウェイトMLLMにまたがる広範囲なベースラインは、側方性推論、時間的局所化、症状シークエンシング、臨床的に忠実な報告の体系的な弱点を明らかにしている。発作特異的微調整はタスク間のパフォーマンスを大幅に改善し、2段階のニューロシンボリック・フレームワークはてんかんと非てんかん性発作の分類において0.96のスコアを達成している。 Seizure-Semiology-Suiteは、安全クリティカルな医療ビデオ理解におけるマルチモーダルモデルの評価のための厳格なベンチマークを確立し、臨床に信頼性のあるドメイン適応型マルチモーダルインテリジェンスの開発を導く。

論文の概要: Seizure-Semiology-Suite (S3): A Clinically Multimodal Dataset, Benchmark, and Models for Seizure Semiology Understanding

関連論文リスト