Fugu-MT 論文翻訳(概要): SafeProtein: Red-Teaming Framework and Benchmark for Protein Foundation Models

論文の概要: SafeProtein: Red-Teaming Framework and Benchmark for Protein Foundation Models

arxiv url: http://arxiv.org/abs/2509.03487v1
Date: Wed, 03 Sep 2025 17:13:56 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-04 21:40:46.605802
Title: SafeProtein: Red-Teaming Framework and Benchmark for Protein Foundation Models
Title（参考訳）: SafeProtein: タンパク質基盤モデルのレッドチームフレームワークとベンチマーク
Authors: Jigang Fan, Zhenghong Zhou, Ruofan Jin, Le Cong, Mengdi Wang, Zaixi Zhang,
Abstract要約: 本稿では,タンパク質基盤モデル用に設計された最初のレッドチームフレームワークであるSafeProteinを紹介する。 SafeProteinはマルチモーダルプロンプトエンジニアリングを組み合わせ、ビームサーチを生成して、レッドチーム方式を体系的に設計する。また、手動で構築したレッドチームベンチマークデータセットと包括的な評価プロトコルを含むSafeProtein-Benchをキュレートした。
参考スコア（独自算出の注目度）: 48.33774972147132
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Proteins play crucial roles in almost all biological processes. The advancement of deep learning has greatly accelerated the development of protein foundation models, leading to significant successes in protein understanding and design. However, the lack of systematic red-teaming for these models has raised serious concerns about their potential misuse, such as generating proteins with biological safety risks. This paper introduces SafeProtein, the first red-teaming framework designed for protein foundation models to the best of our knowledge. SafeProtein combines multimodal prompt engineering and heuristic beam search to systematically design red-teaming methods and conduct tests on protein foundation models. We also curated SafeProtein-Bench, which includes a manually constructed red-teaming benchmark dataset and a comprehensive evaluation protocol. SafeProtein achieved continuous jailbreaks on state-of-the-art protein foundation models (up to 70% attack success rate for ESM3), revealing potential biological safety risks in current protein foundation models and providing insights for the development of robust security protection technologies for frontier models. The codes will be made publicly available at https://github.com/jigang-fan/SafeProtein.
Abstract（参考訳）: タンパク質は、ほとんどすべての生物学的過程において重要な役割を担っている。深層学習の進歩はタンパク質基盤モデルの開発を大いに加速させ、タンパク質の理解と設計に大きな成功を収めた。しかしながら、これらのモデルに系統的なリピートが欠如していることは、生物学的安全性のリスクのあるタンパク質の生成など、それらの潜在的な誤用に対する深刻な懸念を引き起こしている。本稿では,タンパク質基盤モデルのための最初のレッドチームフレームワークであるSafeProteinを紹介する。 SafeProteinはマルチモーダル・プロンプト・エンジニアリングとヒューリスティック・ビーム・サーチを組み合わせて、リピート法を体系的に設計し、タンパク質基盤モデルの試験を行う。また、手動で構築したレッドチームベンチマークデータセットと包括的な評価プロトコルを含むSafeProtein-Benchをキュレートした。 SafeProteinは、最先端のタンパク質基盤モデル(ESM3の攻撃成功率最大70%)に対する継続的なジェイルブレイクを達成し、現在のタンパク質基盤モデルにおける潜在的な生物学的安全性リスクを明らかにし、フロンティアモデルのための堅牢なセキュリティ保護技術開発のための洞察を提供する。コードはhttps://github.com/jigang-fan/SafeProtein.comで公開される。

論文の概要: SafeProtein: Red-Teaming Framework and Benchmark for Protein Foundation Models

関連論文リスト