Fugu-MT 論文翻訳(概要): CraftGraffiti: Exploring Human Identity with Custom Graffiti Art via Facial-Preserving Diffusion Models

論文の概要: CraftGraffiti: Exploring Human Identity with Custom Graffiti Art via Facial-Preserving Diffusion Models

arxiv url: http://arxiv.org/abs/2508.20640v1
Date: Thu, 28 Aug 2025 10:38:13 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-29 18:12:02.338735
Title: CraftGraffiti: Exploring Human Identity with Custom Graffiti Art via Facial-Preserving Diffusion Models
Title（参考訳）: CraftGraffiti: 顔保存拡散モデルによるカスタムグラフィティアートによるヒューマンアイデンティティの探索
Authors: Ayan Banerjee, Fernando Vilariño, Josep Lladós,
Abstract要約: 極端なスタイリスティック・トランスフォーメーションの下で顔のアイデンティティを保持することは、生成芸術において大きな課題である。本稿では,顔の特徴保存を主目的としたエンドツーエンドのテキストガイドグラフィティ生成フレームワークCraftGraffitiを紹介する。 CraftGraffitiはまず、LoRAで調整された事前訓練された拡散トランスフォーマーを介してグラフィティスタイルのトランスファーを適用し、その後、顔に一貫性のある自己認識機構を通じてアイデンティティの忠実さを強制する。
参考スコア（独自算出の注目度）: 43.96017763034248
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Preserving facial identity under extreme stylistic transformation remains a major challenge in generative art. In graffiti, a high-contrast, abstract medium, subtle distortions to the eyes, nose, or mouth can erase the subject's recognizability, undermining both personal and cultural authenticity. We present CraftGraffiti, an end-to-end text-guided graffiti generation framework designed with facial feature preservation as a primary objective. Given an input image and a style and pose descriptive prompt, CraftGraffiti first applies graffiti style transfer via LoRA-fine-tuned pretrained diffusion transformer, then enforces identity fidelity through a face-consistent self-attention mechanism that augments attention layers with explicit identity embeddings. Pose customization is achieved without keypoints, using CLIP-guided prompt extension to enable dynamic re-posing while retaining facial coherence. We formally justify and empirically validate the "style-first, identity-after" paradigm, showing it reduces attribute drift compared to the reverse order. Quantitative results demonstrate competitive facial feature consistency and state-of-the-art aesthetic and human preference scores, while qualitative analyses and a live deployment at the Cruilla Festival highlight the system's real-world creative impact. CraftGraffiti advances the goal of identity-respectful AI-assisted artistry, offering a principled approach for blending stylistic freedom with recognizability in creative AI applications.
Abstract（参考訳）: 極端なスタイリスティック・トランスフォーメーションの下で顔のアイデンティティを保持することは、生成芸術において大きな課題である。落書きでは、高コントラストで抽象的な媒体であり、目、鼻、口への微妙な歪みは、被験者の認識性を消し、個人的および文化的信頼を損なう。本稿では,顔の特徴保存を主目的としたエンドツーエンドのテキストガイドグラフィティ生成フレームワークCraftGraffitiを紹介する。入力画像とスタイルと説明的なプロンプトを与えられたCraftGraffitiは、まずLoRAで調整された事前訓練された拡散トランスフォーマーを介してグラフィティスタイルのトランスファーを適用し、次に、明示的なアイデンティティ埋め込みで注意層を拡大する顔一貫性の自己保持機構を通じてアイデンティティの忠実度を強制する。ポースカスタマイズはキーポイントなしで実現され、CLIP誘導プロンプト拡張を使用して、顔のコヒーレンスを維持しながら動的再考を可能にする。我々は「スタイルファースト、アイデンティティ・アフター」パラダイムを正式に正当化し、実証的に検証し、逆順に比べて属性のドリフトを減少させることを示す。定量的な結果は、競合する顔の特徴の一貫性と最先端の美観と人間の嗜好のスコアを示し、質的な分析とCruilla Festivalでのライブデプロイメントは、システムの現実的な創造的影響を強調している。 CraftGraffitiは、創造的なAIアプリケーションにおいて、スタイリスティックな自由と認識可能性とをブレンドするための原則化されたアプローチを提供する、アイデンティティを尊重するAI支援アーティストの目標を推進している。

論文の概要: CraftGraffiti: Exploring Human Identity with Custom Graffiti Art via Facial-Preserving Diffusion Models

関連論文リスト