Fugu-MT 論文翻訳(概要): $\text{M}^{2}$LLM: Multi-view Molecular Representation Learning with Large Language Models

論文の概要: $\text{M}^{2}$LLM: Multi-view Molecular Representation Learning with Large Language Models

arxiv url: http://arxiv.org/abs/2508.08657v1
Date: Tue, 12 Aug 2025 05:46:47 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-13 21:07:34.318081
Title: $\text{M}^{2}$LLM: Multi-view Molecular Representation Learning with Large Language Models
Title（参考訳）: $\text{M}^{2}$LLM: 大規模言語モデルを用いた多視点分子表現学習
Authors: Jiaxin Ju, Yizhen Zheng, Huan Yee Koh, Can Wang, Shirui Pan,
Abstract要約: 分子構造ビュー,分子タスクビュー,分子規則ビューの3つの視点を統合した多視点フレームワークを提案する。実験によると、$textM2$LLMは、分類タスクと回帰タスクをまたいだ複数のベンチマークで最先端のパフォーマンスを達成する。
参考スコア（独自算出の注目度）: 59.125833618091846
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Accurate molecular property prediction is a critical challenge with wide-ranging applications in chemistry, materials science, and drug discovery. Molecular representation methods, including fingerprints and graph neural networks (GNNs), achieve state-of-the-art results by effectively deriving features from molecular structures. However, these methods often overlook decades of accumulated semantic and contextual knowledge. Recent advancements in large language models (LLMs) demonstrate remarkable reasoning abilities and prior knowledge across scientific domains, leading us to hypothesize that LLMs can generate rich molecular representations when guided to reason in multiple perspectives. To address these gaps, we propose $\text{M}^{2}$LLM, a multi-view framework that integrates three perspectives: the molecular structure view, the molecular task view, and the molecular rules view. These views are fused dynamically to adapt to task requirements, and experiments demonstrate that $\text{M}^{2}$LLM achieves state-of-the-art performance on multiple benchmarks across classification and regression tasks. Moreover, we demonstrate that representation derived from LLM achieves exceptional performance by leveraging two core functionalities: the generation of molecular embeddings through their encoding capabilities and the curation of molecular features through advanced reasoning processes.
Abstract（参考訳）: 正確な分子特性予測は、化学、材料科学、薬物発見における幅広い応用において重要な課題である。指紋やグラフニューラルネットワーク(GNN)を含む分子表現法は、分子構造から効果的に特徴を引き出すことにより、最先端の結果が得られる。しかし、これらの手法は、何十年にもわたって蓄積された意味と文脈の知識を見落としていることが多い。大規模言語モデル (LLMs) の最近の進歩は、科学領域全体にわたる顕著な推論能力と事前知識を示し、複数の視点で理性に導かれるとLLMがリッチな分子表現を生成できるという仮説を導いた。これらのギャップに対処するために、分子構造ビュー、分子タスクビュー、分子規則ビューの3つの視点を統合する多視点フレームワークである$\text{M}^{2}$LLMを提案する。これらのビューは、タスク要求に適応するために動的に融合され、$\text{M}^{2}$LLMが、分類タスクと回帰タスクをまたいだ複数のベンチマークで最先端のパフォーマンスを達成することを示す実験である。さらに, LLMから派生した表現は, 符号化機能による分子埋め込みの生成と, 高度な推論プロセスによる分子特性のキュレーションという, 2つのコア機能を活用することで, 例外的な性能を達成できることを実証した。

論文の概要: $\text{M}^{2}$LLM: Multi-view Molecular Representation Learning with Large Language Models

関連論文リスト