Fugu-MT 論文翻訳(概要): Verify Claimed Text-to-Image Models via Boundary-Aware Prompt Optimization

論文の概要: Verify Claimed Text-to-Image Models via Boundary-Aware Prompt Optimization

arxiv url: http://arxiv.org/abs/2603.26328v1
Date: Fri, 27 Mar 2026 11:46:27 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-30 21:49:48.476948
Title: Verify Claimed Text-to-Image Models via Boundary-Aware Prompt Optimization
Title（参考訳）: 境界認識型プロンプト最適化によるテキスト・画像モデルの検証
Authors: Zidong Zhao, Yihao Huang, Qing Guo, Tianlin Li, Anran Li, Kailong Wang, Jin Song Dong, Geguang Pu,
Abstract要約: 公式モデルを使用したという偽の主張は、ユーザーを誤解させ、モデル所有者の評判を傷つける可能性がある。既存のメソッドは、オフィシャルモデルオーナによって生成された検証プロンプトを使用して、この問題に対処する。本稿では,境界認識型Prompt Optimizationと呼ばれる参照不要なT2Iモデル検証手法を提案する。
参考スコア（独自算出の注目度）: 29.365081306586237
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: As Text-to-Image (T2I) generation becomes widespread, third-party platforms increasingly integrate multiple model APIs for convenient image creation. However, false claims of using official models can mislead users and harm model owners' reputations, making model verification essential to confirm whether an API's underlying model matches its claim. Existing methods address this by using verification prompts generated by official model owners, but the generation relies on multiple reference models for optimization, leading to high computational cost and sensitivity to model selection. To address this problem, we propose a reference-free T2I model verification method called Boundary-aware Prompt Optimization (BPO). It directly explores the intrinsic characteristics of the target model. The key insight is that although different T2I models produce similar outputs for normal prompts, their semantic boundaries in the embedding space (transition zones between two concepts such as "corgi" and "bagel") are distinct. Prompts near these boundaries generate unstable outputs (e.g., sometimes a corgi and sometimes a bagel) on the target model but remain stable on other models. By identifying such boundary-adjacent prompts, BPO captures model-specific behaviors that serve as reliable verification cues for distinguishing T2I models. Experiments on five T2I models and four baselines demonstrate that BPO achieves superior verification accuracy.
Abstract（参考訳）: テキスト・ツー・イメージ(T2I)生成が普及するにつれて、サードパーティプラットフォームはより便利な画像作成のために複数のモデルAPIを統合するようになっている。しかしながら、公式モデルを使用するという誤った主張は、ユーザを誤解させ、モデル所有者の評判を害する可能性がある。既存の手法では、公式モデル所有者が生成した検証プロンプトを使用してこの問題に対処するが、生成は最適化のために複数の参照モデルに依存しており、高い計算コストとモデル選択に対する感度をもたらす。この問題に対処するため,BPO(Boundary-aware Prompt Optimization)と呼ばれる参照不要なT2Iモデル検証手法を提案する。対象モデルの本質的な特性を直接探索する。重要な洞察は、異なるT2Iモデルは通常のプロンプトに対して同様の出力を生成するが、埋め込み空間におけるそれらの意味的境界("corgi" や "bagel" のような2つの概念間の遷移ゾーン)は異なることである。これらの境界付近の確率は、ターゲットモデル上で不安定な出力(例えば、コーギーやベーグル)を生成するが、他のモデルでは安定である。このような境界に隣接したプロンプトを識別することで、BPOはT2Iモデルを識別するための信頼性の高い検証手段として機能するモデル固有の振る舞いをキャプチャする。 5つのT2Iモデルと4つのベースラインの実験は、BPOがより優れた検証精度を達成することを示す。

論文の概要: Verify Claimed Text-to-Image Models via Boundary-Aware Prompt Optimization

関連論文リスト