Fugu-MT 論文翻訳(概要): FontCrafter: High-Fidelity Element-Driven Artistic Font Creation with Visual In-Context Generation

論文の概要: FontCrafter: High-Fidelity Element-Driven Artistic Font Creation with Visual In-Context Generation

arxiv url: http://arxiv.org/abs/2603.22054v2
Date: Sat, 28 Mar 2026 19:37:24 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-31 13:48:18.783956
Title: FontCrafter: High-Fidelity Element-Driven Artistic Font Creation with Visual In-Context Generation
Title（参考訳）: FontCrafter: ビジュアルインコンテキスト生成による高忠実な要素駆動型アートフォント作成
Authors: Wuyang Luo, Chengkai Tan, Chang Ge, Binye Hong, Su Yang, Yongjiu Ma,
Abstract要約: アーティスティックフォント生成は、参照スタイルに基づいてスタイリングされたグリフを合成することを目的としている。既存のアプローチは、限られたスタイルの多様性と粗いコントロールに悩まされている。フォント作成のための要素駆動フレームワークであるFontCrafterを紹介する。
参考スコア（独自算出の注目度）: 3.113651550552676
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Artistic font generation aims to synthesize stylized glyphs based on a reference style. However, existing approaches suffer from limited style diversity and coarse control. In this work, we explore the potential of element-driven artistic font generation. Elements are the fundamental visual units of a font, serving as reference images for the desired style. Conceptually, we categorize elements into object elements (e.g., flowers or stones) with distinct structures and amorphous elements (e.g., flames or clouds) with unstructured textures. We introduce FontCrafter, an element-driven framework for font creation, and construct a large-scale dataset, ElementFont, which contains diverse element types and high-quality glyph images. However, achieving high-fidelity reconstruction of both texture and structure of reference elements remains challenging. To address this, we propose an in-context generation strategy that treats element images as visual context and uses an inpainting model to transfer element styles into glyph regions at the pixel level. To further control glyph shapes, we design a lightweight Context-aware Mask Adapter (CMA) that injects shape information. Moreover, a training-free attention redirection mechanism enables region-aware style control and suppresses stroke hallucination. In addition, edge repainting is applied to make boundaries more natural. Extensive experiments demonstrate that FontCrafter achieves strong zero-shot generation performance, particularly in preserving structural and textural fidelity, while also supporting flexible controls such as style mixture.
Abstract（参考訳）: アーティスティックフォント生成は、参照スタイルに基づいてスタイリングされたグリフを合成することを目的としている。しかし、既存のアプローチは、限られたスタイルの多様性と粗い制御に悩まされている。本研究では,要素駆動の芸術的フォント生成の可能性を探る。要素はフォントの基本視覚単位であり、所望のスタイルの参照画像として機能する。概念的には、要素を異なる構造と非晶質要素(例えば、炎や雲)と非構造的なテクスチャを持つ対象要素(例えば、花や石)に分類する。フォント作成のための要素駆動フレームワークであるFontCrafterを導入し,多様な要素タイプと高品質なグリフ画像を含む大規模データセットElementFontを構築した。しかし, 基準要素の構造とテクスチャの高忠実化を実現することは依然として困難である。そこで本研究では,要素画像を視覚的コンテキストとして扱うインコンテキスト生成手法を提案する。さらにグリフ形状を制御するために、形状情報を注入する軽量なコンテキスト対応マスクアダプタ(CMA)を設計する。さらに、訓練不要の注意リダイレクト機構により、領域認識スタイルの制御が可能となり、脳卒中幻覚を抑える。さらに、境界をより自然にするためにエッジ再塗装が適用される。大規模な実験により、FontCrafterは、特に構造的およびテクスチュラルな忠実さを保ちながら、スタイルミックスのような柔軟な制御をサポートしながら、強力なゼロショット生成性能を達成することが示された。

論文の概要: FontCrafter: High-Fidelity Element-Driven Artistic Font Creation with Visual In-Context Generation

関連論文リスト