Fugu-MT 論文翻訳(概要): Functional Groups are All you Need for Chemically Interpretable Molecular Property Prediction

論文の概要: Functional Groups are All you Need for Chemically Interpretable Molecular Property Prediction

arxiv url: http://arxiv.org/abs/2509.09619v1
Date: Thu, 11 Sep 2025 17:01:31 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-12 16:52:24.485777
Title: Functional Groups are All you Need for Chemically Interpretable Molecular Property Prediction
Title（参考訳）: 機能基は化学的に解釈可能な分子特性予測に必要なすべて
Authors: Roshan Balaji, Joe Bobby, Nirav Pravinbhai Bhatt,
Abstract要約: 本研究は、化学におけるFG(Functional Groups)の概念を用いて分子表現を開発することを提案する。分子の基本的な化学構造に基づいて分子をコードする新しい手法であるFGR(Functional Group Representation)フレームワークを紹介する。我々は、FGRフレームワークが33種類のベンチマークデータセットに対して最先端のパフォーマンスを達成することを実証した。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Molecular property prediction using deep learning (DL) models has accelerated drug and materials discovery, but the resulting DL models often lack interpretability, hindering their adoption by chemists. This work proposes developing molecule representations using the concept of Functional Groups (FG) in chemistry. We introduce the Functional Group Representation (FGR) framework, a novel approach to encoding molecules based on their fundamental chemical substructures. Our method integrates two types of functional groups: those curated from established chemical knowledge (FG), and those mined from a large molecular corpus using sequential pattern mining (MFG). The resulting FGR framework encodes molecules into a lower-dimensional latent space by leveraging pre-training on a large dataset of unlabeled molecules. Furthermore, the proposed framework allows the inclusion of 2D structure-based descriptors of molecules. We demonstrate that the FGR framework achieves state-of-the-art performance on a diverse range of 33 benchmark datasets spanning physical chemistry, biophysics, quantum mechanics, biological activity, and pharmacokinetics while enabling chemical interpretability. Crucially, the model's representations are intrinsically aligned with established chemical principles, allowing chemists to directly link predicted properties to specific functional groups and facilitating novel insights into structure-property relationships. Our work presents a significant step toward developing high-performing, chemically interpretable DL models for molecular discovery.
Abstract（参考訳）: 深層学習(DL)モデルを用いた分子特性予測は薬物や物質発見を加速するが、結果として生じるDLモデルは解釈可能性に欠けることが多く、化学者による導入を妨げている。本研究は、化学におけるFG(Functional Groups)の概念を用いて分子表現を開発することを提案する。分子の基本的な化学構造に基づいて分子をコードする新しい手法であるFGR(Functional Group Representation)フレームワークを紹介する。本手法は, 確立された化学知識(FG)から抽出した官能基と, シーケンシャルパターンマイニング(MFG)を用いた大規模分子コーパスから抽出した官能基の2種類を統合した。結果として生じるFGRフレームワークは、未標識分子の大きなデータセットの事前学習を活用することにより、分子を低次元の潜伏空間にエンコードする。さらに、提案フレームワークは分子の2次元構造に基づく記述子を組み込むことができる。本研究では, 物理化学, 生物物理学, 量子力学, 生物活性, 薬物動態学を対象とし, 化学解釈性を実現しつつ, 様々な33のベンチマークデータセット上で, 最先端の性能を達成できることを実証した。重要なことに、モデルの表現は確立された化学原理と本質的に一致しており、化学者は予測された性質を特定の官能基に直接リンクさせ、構造的優越関係に関する新しい洞察を促進することができる。本研究は,分子発見のための高性能かつ化学的に解釈可能なDLモデルの開発に向けた重要な一歩である。

論文の概要: Functional Groups are All you Need for Chemically Interpretable Molecular Property Prediction

関連論文リスト