Fugu-MT 論文翻訳(概要): OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data

論文の概要: OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data

arxiv url: http://arxiv.org/abs/2510.02410v1
Date: Thu, 02 Oct 2025 09:58:23 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-06 16:35:52.092301
Title: OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data
Title（参考訳）: OpenTSLM:多変量医療用テキストと時系列データを用いた時系列言語モデル
Authors: Patrick Langer, Thomas Kaar, Max Rosenblattl, Maxwell A. Xu, Winnie Chow, Martin Maritsch, Aradhana Verma, Brian Han, Daniel Seung Kim, Henry Chubb, Scott Ceresnak, Aydin Zahedivash, Alexander Tarlochan Singh Sandhu, Fatima Rodriguez, Daniel McDuff, Elgar Fleisch, Oliver Aalami, Filipe Barata, Paul Schmiedmayer,
Abstract要約: 時系列言語モデル(TSLM)のファミリであるOpenTSLMについて述べる。 OpenTSLMは学習可能な時系列トークンとテキストトークンをソフトプロンプトで結合することで、暗黙的に時系列をモデル化する。本稿では,HAR-CoT,Sleep-CoT,ECG-QA-CoTの3つのデータセットを紹介する。 OpenTSLMのベースラインは69.9 F1、HARは65.4に達し、微調整されたテキストのみのモデルでは9.05と52.2であった。
参考スコア（独自算出の注目度）: 35.931918163980605
License: http://creativecommons.org/licenses/by/4.0/
Abstract: LLMs have emerged as powerful tools for interpreting multimodal data. In medicine, they hold particular promise for synthesizing large volumes of clinical information into actionable insights and digital health applications. Yet, a major limitation remains their inability to handle time series. To overcome this gap, we present OpenTSLM, a family of Time Series Language Models (TSLMs) created by integrating time series as a native modality to pretrained LLMs, enabling reasoning over multiple time series of any length. We investigate two architectures for OpenTSLM. The first, OpenTSLM-SoftPrompt, models time series implicitly by concatenating learnable time series tokens with text tokens via soft prompting. Although parameter-efficient, we hypothesize that explicit time series modeling scales better and outperforms implicit approaches. We thus introduce OpenTSLM-Flamingo, which integrates time series with text via cross-attention. We benchmark both variants against baselines that treat time series as text tokens or plots, across a suite of text-time-series Chain-of-Thought (CoT) reasoning tasks. We introduce three datasets: HAR-CoT, Sleep-CoT, and ECG-QA-CoT. Across all, OpenTSLM models outperform baselines, reaching 69.9 F1 in sleep staging and 65.4 in HAR, compared to 9.05 and 52.2 for finetuned text-only models. Notably, even 1B-parameter OpenTSLM models surpass GPT-4o (15.47 and 2.95). OpenTSLM-Flamingo matches OpenTSLM-SoftPrompt in performance and outperforms on longer sequences, while maintaining stable memory requirements. By contrast, SoftPrompt grows exponentially in memory with sequence length, requiring around 110 GB compared to 40 GB VRAM when training on ECG-QA with LLaMA-3B. Expert reviews by clinicians find strong reasoning capabilities exhibited by OpenTSLMs on ECG-QA. To facilitate further research, we provide all code, datasets, and models open-source.
Abstract（参考訳）: LLMはマルチモーダルデータを解釈するための強力なツールとして登場した。医学では、大量の臨床情報を実用的な洞察とデジタルヘルスの応用に合成する、と特に約束している。しかし、大きな制限は、時系列を扱うことができないままである。このギャップを克服するために, 時系列言語モデル (TSLM) のファミリーである OpenTSLM を提案する。 OpenTSLMの2つのアーキテクチャについて検討する。最初のOpenTSLM-SoftPromptは、学習可能な時系列トークンとテキストトークンをソフトプロンプトで結合することで、暗黙的に時系列をモデル化する。パラメータ効率は高いが、明示的な時系列モデリングはより良くスケールし、暗黙的なアプローチより優れていると仮定する。そこで我々は,時系列とテキストをクロスアテンションで統合するOpenTSLM-Flamingoを紹介した。時系列をテキストトークンやプロットとして扱うベースラインに対して、テキスト時系列のChain-of-Thought(CoT)推論タスクをベンチマークします。本稿では,HAR-CoT,Sleep-CoT,ECG-QA-CoTの3つのデータセットを紹介する。全般的にOpenTSLMモデルはベースラインを上回り、スリープステージでは69.9 F1、HARでは65.4に達し、微調整されたテキストのみのモデルでは9.05と52.2だった。特に、1BパラメーターのOpenTSLMモデルでさえGPT-4o (15.47と2.95)を上回っている。 OpenTSLM-FlamingoはOpenTSLM-SoftPromptと性能が良く、より長いシーケンスで性能が良く、安定したメモリ要件を維持している。対照的にSoftPromptは、LLaMA-3BでECG-QAのトレーニングを行う際には、40GBのVRAMに比べて110GB程度のメモリで指数関数的に成長する。臨床医による専門家によるレビューでは、ECG-QA上でOpenTSLMsが示す強力な推論能力が確認されている。さらなる研究を容易にするため、オープンソースですべてのコード、データセット、モデルを提供しています。

論文の概要: OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data

関連論文リスト