Fugu-MT 論文翻訳(概要): Rigidity-Aware Geometric Pretraining for Protein Design and Conformational Ensembles

論文の概要: Rigidity-Aware Geometric Pretraining for Protein Design and Conformational Ensembles

arxiv url: http://arxiv.org/abs/2603.02406v1
Date: Mon, 02 Mar 2026 21:32:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-04 21:38:10.549318
Title: Rigidity-Aware Geometric Pretraining for Protein Design and Conformational Ensembles
Title（参考訳）: タンパク質設計とコンフォーマルアンサンブルのための剛性を考慮した幾何学的事前学習
Authors: Zhanghan Ni, Yanjing Li, Zeju Qiu, Bernhard Schölkopf, Hongyu Guo, Weiyang Liu, Shengchao Liu,
Abstract要約: 幾何学的事前学習フレームワークであるtextbfRigidSSL$(textitRigidity-Aware Self-Supervised Learning$)を紹介した。フェーズI(RigidSSL-Perturb)は、AlphaFoldタンパク質構造データベースから432K構造から、シミュレートされた摂動を持つ幾何学的先行を学習する。フェーズII(RigidSSL-MD)は、1.3K分子動力学軌道上のこれらの表現を洗練し、物理的に現実的な遷移を捉える。
参考スコア（独自算出の注目度）: 74.32932832937618
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Generative models have recently advanced $\textit{de novo}$ protein design by learning the statistical regularities of natural structures. However, current approaches face three key limitations: (1) Existing methods cannot jointly learn protein geometry and design tasks, where pretraining can be a solution; (2) Current pretraining methods mostly rely on local, non-rigid atomic representations for property prediction downstream tasks, limiting global geometric understanding for protein generation tasks; and (3) Existing approaches have yet to effectively model the rich dynamic and conformational information of protein structures. To overcome these issues, we introduce $\textbf{RigidSSL}$ ($\textit{Rigidity-Aware Self-Supervised Learning}$), a geometric pretraining framework that front-loads geometry learning prior to generative finetuning. Phase I (RigidSSL-Perturb) learns geometric priors from 432K structures from the AlphaFold Protein Structure Database with simulated perturbations. Phase II (RigidSSL-MD) refines these representations on 1.3K molecular dynamics trajectories to capture physically realistic transitions. Underpinning both phases is a bi-directional, rigidity-aware flow matching objective that jointly optimizes translational and rotational dynamics to maximize mutual information between conformations. Empirically, RigidSSL variants improve designability by up to 43\% while enhancing novelty and diversity in unconditional generation. Furthermore, RigidSSL-Perturb improves the success rate by 5.8\% in zero-shot motif scaffolding and RigidSSL-MD captures more biophysically realistic conformational ensembles in G protein-coupled receptor modeling. The code is available at: https://github.com/ZhanghanNi/RigidSSL.git.
Abstract（参考訳）: 生成モデルは、最近、自然構造の統計正則性を学ぶことによって、タンパク質設計を進歩させた。しかし、現状のアプローチでは、(1)事前学習が解となるタンパク質の幾何学と設計タスクを共同で学習することはできない、(2)現在の事前学習手法は、主に特性予測タスクの局所的、非厳密な原子表現に依存しており、タンパク質生成タスクのグローバルな幾何学的理解を制限している、(3)既存のアプローチは、タンパク質構造のリッチな動的およびコンフォメーション情報を効果的にモデル化していない、という3つの大きな制限に直面している。これらの問題を克服するために、生成微調整の前に幾何学学習を前もってロードする幾何学事前学習フレームワークである$\textit{Rigidity-Aware Self-Supervised Learning}$を紹介した。フェーズI(RigidSSL-Perturb)は、AlphaFoldタンパク質構造データベースから432K構造から、シミュレートされた摂動を持つ幾何学的先行を学習する。フェーズII(RigidSSL-MD)は、1.3K分子動力学軌道上のこれらの表現を洗練し、物理的に現実的な遷移を捉える。両相のアンサンピングは、コンフォーメーション間の相互情報を最大化するために、翻訳力学と回転力学を共同で最適化する、双方向で剛性を考慮したフローマッチングの目的である。経験的に、RigidSSLの変種は、新規性と非条件生成の多様性を高めながら、設計性を最大43%向上させる。さらに、RigidSSL-Perturbはゼロショットモチーフの足場における成功率を5.8倍に改善し、RigidSSL-MDはGタンパク質結合受容体モデリングにおけるより生物学的に現実的なコンフォメーションアンサンブルをキャプチャする。コードは、https://github.com/ZhanghanNi/RigidSSL.git.comで入手できる。

論文の概要: Rigidity-Aware Geometric Pretraining for Protein Design and Conformational Ensembles

関連論文リスト