Fugu-MT 論文翻訳(概要): PPDiff: Diffusing in Hybrid Sequence-Structure Space for Protein-Protein Complex Design

論文の概要: PPDiff: Diffusing in Hybrid Sequence-Structure Space for Protein-Protein Complex Design

arxiv url: http://arxiv.org/abs/2506.11420v1
Date: Fri, 13 Jun 2025 02:39:14 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-16 17:50:49.634282
Title: PPDiff: Diffusing in Hybrid Sequence-Structure Space for Protein-Protein Complex Design
Title（参考訳）: PPDiff:タンパク質-タンパク質複合体設計のためのハイブリッド配列構造空間の拡散
Authors: Zhenqiao Song, Tiaoxiao Li, Lei Li, Martin Renqiang Min,
Abstract要約: PPDiffは、任意のタンパク質標的に対するバインダーの配列と構造を共同で設計する拡散モデルである。このモデルは、一般的なタンパク質-タンパク質複合体データセットであるPPBenchで訓練されている。
参考スコア（独自算出の注目度）: 15.80665825271378
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Designing protein-binding proteins with high affinity is critical in biomedical research and biotechnology. Despite recent advancements targeting specific proteins, the ability to create high-affinity binders for arbitrary protein targets on demand, without extensive rounds of wet-lab testing, remains a significant challenge. Here, we introduce PPDiff, a diffusion model to jointly design the sequence and structure of binders for arbitrary protein targets in a non-autoregressive manner. PPDiffbuilds upon our developed Sequence Structure Interleaving Network with Causal attention layers (SSINC), which integrates interleaved self-attention layers to capture global amino acid correlations, k-nearest neighbor (kNN) equivariant graph layers to model local interactions in three-dimensional (3D) space, and causal attention layers to simplify the intricate interdependencies within the protein sequence. To assess PPDiff, we curate PPBench, a general protein-protein complex dataset comprising 706,360 complexes from the Protein Data Bank (PDB). The model is pretrained on PPBenchand finetuned on two real-world applications: target-protein mini-binder complex design and antigen-antibody complex design. PPDiffconsistently surpasses baseline methods, achieving success rates of 50.00%, 23.16%, and 16.89% for the pretraining task and the two downstream applications, respectively.
Abstract（参考訳）: 高い親和性を持つタンパク質結合タンパク質の設計は、生物医学研究やバイオテクノロジーにおいて重要である。特定のタンパク質を標的とする最近の進歩にもかかわらず、需要に応じて任意のタンパク質標的に対して高い親和性バインダーを作成できる能力は、湿式試験の広範なラウンドなしでは、依然として大きな課題である。本稿では,非自己回帰的手法で任意のタンパク質標的に対するバインダーの配列と構造を共同設計する拡散モデルであるPDiffを紹介する。 PPDiffbuilds on our developed Sequence Structure Interleaving Network with Causal attention layer (SSINC) which integrates interleaved self-attention layers to capture global amino acid correlations, k-nearest neighbor (kNN) equivariant graph layer to model local interaction in three-dimensional (3D) space, and causal attention layer to simple the intricate interdeendency in the protein sequence。 PPDiffを評価するため,タンパク質データバンク (PDB) の706,360複合体からなる一般タンパク質-タンパク質複合体データセットPPBenchをキュレートした。このモデルは、ターゲットタンパク質のミニバインダー複合体設計と抗原抗体複合体設計という、2つの現実世界の応用に微調整されたPPBenchand上で事前訓練されている。 PPDiffはベースラインの手法をはるかに超え、それぞれ50.00%、23.16%、および16.89%の成功率を達成した。

関連論文リスト

PRING: Rethinking Protein-Protein Interaction Prediction from Pairs to Graphs [80.08310253195144]
PRINGは、タンパク質とタンパク質の相互作用予測をグラフレベルで評価する最初のベンチマークである。 PRINGは、21,484タンパク質と186,818の相互作用からなる高品質な多種PPIネットワークデータセットをキュレートする。
論文参考訳（メタデータ） (2025-07-07T15:21:05Z)
Beyond Simple Concatenation: Fairly Assessing PLM Architectures for Multi-Chain Protein-Protein Interactions Prediction [0.2509487459755192]
タンパク質とタンパク質の相互作用 (PPIs) は、多くの細胞プロセスの基礎である。 PLMはタンパク質の構造と機能を予測するのに顕著な成功を収めた。シークエンスベースのPPI結合親和性予測への応用は、いまだに未検討である。
論文参考訳（メタデータ） (2025-05-26T14:23:08Z)
ProteinWeaver: A Divide-and-Assembly Approach for Protein Backbone Design [61.19456204667385]
本稿では,タンパク質のバックボーン設計のための2段階フレームワークであるProteinWeaverを紹介する。プロテインウィーバーは、多用途ドメインアセンブリを通じて高品質で新規なタンパク質のバックボーンを生成する。分割組立パラダイムを導入することにより、タンパク質工学を進歩させ、機能的タンパク質設計のための新たな道を開く。
論文参考訳（メタデータ） (2024-11-08T08:10:49Z)
SFM-Protein: Integrative Co-evolutionary Pre-training for Advanced Protein Sequence Representation [97.99658944212675]
タンパク質基盤モデルのための新しい事前学習戦略を導入する。アミノ酸残基間の相互作用を強調し、短距離および長距離の共進化的特徴の抽出を強化する。大規模タンパク質配列データセットを用いて学習し,より優れた一般化能力を示す。
論文参考訳（メタデータ） (2024-10-31T15:22:03Z)
PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for Efficient and Generalizable Compound-Protein Interaction Prediction [63.50967073653953]
化合物-タンパク質相互作用予測は、合理的な薬物発見のための化合物-タンパク質相互作用のパターンと強度を予測することを目的としている。既存のディープラーニングベースの手法では、タンパク質配列や構造が単一のモダリティしか利用していない。 CPI予測のためのマルチスケールタンパク質配列構造コントラストフレームワークを提案する。
論文参考訳（メタデータ） (2024-02-13T03:51:10Z)
Effective Protein-Protein Interaction Exploration with PPIretrieval [46.07027715907749]
PPIretrievalはタンパク質とタンパク質の相互作用を探索する最初の深層学習モデルである。 PPIretrievalは、埋め込み空間における潜在的なPPIを探し、タンパク質表面の豊富な幾何学的および化学的情報を収集する。
論文参考訳（メタデータ） (2024-02-06T03:57:06Z)
A Hierarchical Training Paradigm for Antibody Structure-sequence Co-design [54.30457372514873]
抗体配列構造共設計のための階層的訓練パラダイム(HTP)を提案する。 HTPは4段階の訓練段階から構成され、それぞれが特定のタンパク質のモダリティに対応する。実証実験により、HTPは共同設計問題において新しい最先端性能を設定できることが示されている。
論文参考訳（メタデータ） (2023-10-30T02:39:15Z)
Functional Geometry Guided Protein Sequence and Backbone Structure Co-Design [12.585697288315846]
本稿では,自動検出機能部位に基づくタンパク質配列と構造を共同設計するモデルを提案する。 NAEProは、全シーケンスでグローバルな相関を捉えることができる、注目層と同変層のインターリービングネットワークによって駆動される。実験結果から,本モデルは全競技種の中で,最高アミノ酸回収率,TMスコア,最低RMSDを実現していることがわかった。
論文参考訳（メタデータ） (2023-10-06T16:08:41Z)
Joint Design of Protein Sequence and Structure based on Motifs [11.731131799546489]
タンパク質のバックボーン構造と配列を共同で設計するGeoProを提案する。 GeoProは3次元(3D)バックボーン構造のための同変エンコーダと3次元幾何学でガイドされるタンパク質配列デコーダによって駆動される。本手法はタンパク質データバンク(PDB)やUniProtに存在しない新規な$beta$-lactamasesおよびミオグロビンを発見する。
論文参考訳（メタデータ） (2023-10-04T03:07:03Z)
State-specific protein-ligand complex structure prediction with a multi-scale deep generative model [68.28309982199902]
タンパク質-リガンド複合体構造を直接予測できる計算手法であるNeuralPLexerを提案する。我々の研究は、データ駆動型アプローチがタンパク質と小分子の構造的協調性を捉え、酵素や薬物分子などの設計を加速させる可能性を示唆している。
論文参考訳（メタデータ） (2022-09-30T01:46:38Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。