Fugu-MT 論文翻訳(概要): BioVERSE: Representation Alignment of Biomedical Modalities to LLMs for Multi-Modal Reasoning

論文の概要: BioVERSE: Representation Alignment of Biomedical Modalities to LLMs for Multi-Modal Reasoning

arxiv url: http://arxiv.org/abs/2510.01428v1
Date: Wed, 01 Oct 2025 20:07:36 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 16:59:20.853081
Title: BioVERSE: Representation Alignment of Biomedical Modalities to LLMs for Multi-Modal Reasoning
Title（参考訳）: BioVERSE:マルチモーダル推論のためのLCMへのバイオメディカルモダリティの表現アライメント
Authors: Ching-Huei Tsou, Michal Ozery-Flato, Ella Barkan, Diwakar Mahajan, Ben Shapira,
Abstract要約: 本稿では,事前学習したBioFMをモダリティエンコーダとして適応する2段階のアプローチであるBIOVERSEを提案する。このアプローチはまず、各モダリティを共有 LLM 空間に整列する。次に、マルチモーダルデータによる標準的な命令チューニングを適用して、下流の推論のためにそれらをまとめる。
参考スコア（独自算出の注目度）: 0.36855563110245826
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Recent advances in large language models (LLMs) and biomedical foundation models (BioFMs) have achieved strong results in biological text reasoning, molecular modeling, and single-cell analysis, yet they remain siloed in disjoint embedding spaces, limiting cross-modal reasoning. We present BIOVERSE (Biomedical Vector Embedding Realignment for Semantic Engagement), a two-stage approach that adapts pretrained BioFMs as modality encoders and aligns them with LLMs through lightweight, modality-specific projection layers. The approach first aligns each modality to a shared LLM space through independently trained projections, allowing them to interoperate naturally, and then applies standard instruction tuning with multi-modal data to bring them together for downstream reasoning. By unifying raw biomedical data with knowledge embedded in LLMs, the approach enables zero-shot annotation, cross-modal question answering, and interactive, explainable dialogue. Across tasks spanning cell-type annotation, molecular description, and protein function reasoning, compact BIOVERSE configurations surpass larger LLM baselines while enabling richer, generative outputs than existing BioFMs, establishing a foundation for principled multi-modal biomedical reasoning.
Abstract（参考訳）: 大規模言語モデル (LLMs) やバイオメディカル基礎モデル (BioFMs) の最近の進歩は、生物学的テキスト推論、分子モデリング、単細胞解析において大きな成果を上げている。本稿では,BioVERSE (Biomedical Vector Embedding Realignment for Semantic Engagement) について述べる。このアプローチはまず、独立に訓練されたプロジェクションを通じて、各モードを共有LLM空間に整列させ、自然に相互運用できるようにし、その後、マルチモーダルデータによる標準命令チューニングを適用して、下流の推論のためにそれらをまとめる。 LLMに埋め込まれた知識で生のバイオメディカルデータを統一することにより、ゼロショットアノテーション、モーダルな質問応答、インタラクティブで説明可能な対話を可能にする。細胞型アノテーション、分子記述、タンパク質機能推論にまたがるタスク、コンパクトなBIOVERSE構成は、既存のBioFMよりもリッチで生成的なアウトプットを実現し、原則化されたマルチモーダルなバイオメディカル推論の基礎を確立した。

論文の概要: BioVERSE: Representation Alignment of Biomedical Modalities to LLMs for Multi-Modal Reasoning

関連論文リスト