Fugu-MT 論文翻訳(概要): Reinforcement Learning-Based Prompt Template Stealing for Text-to-Image Models

論文の概要: Reinforcement Learning-Based Prompt Template Stealing for Text-to-Image Models

arxiv url: http://arxiv.org/abs/2510.00046v1
Date: Sat, 27 Sep 2025 12:29:50 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 16:59:20.133929
Title: Reinforcement Learning-Based Prompt Template Stealing for Text-to-Image Models
Title（参考訳）: テキスト・ツー・イメージモデルのための強化学習型プロンプトテンプレートステアリング
Authors: Xiaotian Zou,
Abstract要約: 少数のサンプル画像のみからテンプレートを復元する強化学習フレームワークであるLStealerを提案する。 RLStealerは最先端のパフォーマンスを得ると同時に、攻撃コストを既存のベースラインに必要な13%以下に削減する。我々の研究は、即時取引に固有の緊急のセキュリティ脅威を強調し、保護基準の策定の基礎となる。
参考スコア（独自算出の注目度）: 0.913755431537592
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Multimodal Large Language Models (MLLMs) have transformed text-to-image workflows, allowing designers to create novel visual concepts with unprecedented speed. This progress has given rise to a thriving prompt trading market, where curated prompts that induce trademark styles are bought and sold. Although commercially attractive, prompt trading also introduces a largely unexamined security risk: the prompts themselves can be stolen. In this paper, we expose this vulnerability and present RLStealer, a reinforcement learning based prompt inversion framework that recovers its template from only a small set of example images. RLStealer treats template stealing as a sequential decision making problem and employs multiple similarity based feedback signals as reward functions to effectively explore the prompt space. Comprehensive experiments on publicly available benchmarks demonstrate that RLStealer gets state-of-the-art performance while reducing the total attack cost to under 13% of that required by existing baselines. Our further analysis confirms that RLStealer can effectively generalize across different image styles to efficiently steal unseen prompt templates. Our study highlights an urgent security threat inherent in prompt trading and lays the groundwork for developing protective standards in the emerging MLLMs marketplace.
Abstract（参考訳）: MLLM(Multimodal Large Language Models)は、テキストから画像へのワークフローを変換し、デザイナが前例のないスピードで新しい視覚概念を作成できるようにする。この進展は、商標スタイルを誘導するキュレートされたプロンプトが売買され、繁栄する商取引市場を生み出した。商業的に魅力的だが、プロンプト取引は、ほとんど検討されていないセキュリティリスクも生み出す:プロンプト自体を盗むことができる。本稿では、この脆弱性を公開し、少数のサンプル画像からのみテンプレートを復元する強化学習ベースのプロンプトインバージョンフレームワークであるLStealerを提示する。 RLStealerはテンプレート盗難をシーケンシャルな意思決定問題として扱い、複数の類似性に基づくフィードバック信号を報酬関数として使い、プロンプト空間を効果的に探索する。公開されているベンチマークに関する総合的な実験は、RSStealerが最先端のパフォーマンスを得る一方で、既存のベースラインに必要な攻撃コストの13%以下に削減されていることを示している。さらなる解析により,RTStealerは画像スタイルの異なる領域を効果的に一般化し,目に見えないプロンプトテンプレートを効率的に盗むことができることを確認した。本研究は、即時取引に固有の緊急セキュリティ脅威を強調し、新興MLLMマーケットプレースにおける保護基準策定の基礎となる。

論文の概要: Reinforcement Learning-Based Prompt Template Stealing for Text-to-Image Models

関連論文リスト