Fugu-MT 論文翻訳(概要): epiGPTope: A machine learning-based epitope generator and classifier

論文の概要: epiGPTope: A machine learning-based epitope generator and classifier

arxiv url: http://arxiv.org/abs/2509.03351v1
Date: Wed, 03 Sep 2025 14:36:06 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-04 21:40:46.549612
Title: epiGPTope: A machine learning-based epitope generator and classifier
Title（参考訳）: epiGPTope: 機械学習ベースのエピトープジェネレータと分類器
Authors: Natalia Flechas Manrique, Alberto Martínez, Elena López-Martínez, Luc Andrea, Román Orus, Aitor Manteca, Aitziber L. Cortajarena, Llorenç Espinosa-Portalés,
Abstract要約: エピトープは、抗体または免疫細胞受容体によって認識される短い抗原ペプチド配列である。合成ライブラリの設計は、大規模な配列空間、n個のアミノ酸の線形に対する20n$の組み合わせにより困難であり、スクリーニングとテストは不可能である。線形を微調整し,新しい有理的配列を生成する,大規模言語モデル epiGPTope を提案する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Epitopes are short antigenic peptide sequences which are recognized by antibodies or immune cell receptors. These are central to the development of immunotherapies, vaccines, and diagnostics. However, the rational design of synthetic epitope libraries is challenging due to the large combinatorial sequence space, $20^n$ combinations for linear epitopes of n amino acids, making screening and testing unfeasible, even with high throughput experimental techniques. In this study, we present a large language model, epiGPTope, pre-trained on protein data and specifically fine-tuned on linear epitopes, which for the first time can directly generate novel epitope-like sequences, which are found to possess statistical properties analogous to the ones of known epitopes. This generative approach can be used to prepare libraries of epitope candidate sequences. We further train statistical classifiers to predict whether an epitope sequence is of bacterial or viral origin, thus narrowing the candidate library and increasing the likelihood of identifying specific epitopes. We propose that such combination of generative and predictive models can be of assistance in epitope discovery. The approach uses only primary amino acid sequences of linear epitopes, bypassing the need for a geometric framework or hand-crafted features of the sequences. By developing a method to create biologically feasible sequences, we anticipate faster and more cost-effective generation and screening of synthetic epitopes, with relevant applications in the development of new biotechnologies.
Abstract（参考訳）: エピトープは、抗体または免疫細胞受容体によって認識される短い抗原ペプチド配列である。これらは、免疫療法、ワクチン、診断の開発の中心である。しかし、合成エピトープライブラリーの合理的設計は、大きな組合せ配列空間、nアミノ酸の線形エピトープに対する20^n$の組み合わせにより、高いスループットの実験技術であってもスクリーニングと試験が不可能となるため、困難である。本研究では,タンパク質データに基づいて事前学習され,特に線形エピトープで微調整された,新しいエピトープ様の配列を生成できる大規模言語モデルである epiGPTope を提案する。この生成的アプローチはエピトープ候補配列のライブラリを作成するために使用できる。さらに統計分類器を訓練して、エピトープ配列が細菌またはウイルス起源であるかどうかを予測し、候補ライブラリを狭め、特定のエピトープを同定する可能性を高める。このような生成モデルと予測モデルの組み合わせがエピトープ発見の助けになる可能性が示唆された。このアプローチでは、線形エピトープの一次アミノ酸配列のみを使用し、幾何学的枠組みや手作りの配列の特徴を必要としない。生物学的に実現可能な配列を生成する手法を開発することにより,合成エピトープの生成とスクリーニングがより迅速で費用効率の良いものになることを期待できる。

論文の概要: epiGPTope: A machine learning-based epitope generator and classifier

関連論文リスト