Fugu-MT 論文翻訳(概要): Transformer-Based Vector Font Classification Using Different Font Formats: TrueType versus PostScript

論文の概要: Transformer-Based Vector Font Classification Using Different Font Formats: TrueType versus PostScript

arxiv url: http://arxiv.org/abs/2502.00250v1
Date: Sat, 01 Feb 2025 01:16:27 GMT
ステータス: 翻訳完了
システム内更新日: 2025-02-05 15:21:53.549062
Title: Transformer-Based Vector Font Classification Using Different Font Formats: TrueType versus PostScript
Title（参考訳）: 異なるフォントフォーマットを用いたトランスフォーマーベースベクトルフォント分類:TrueType対PostScript
Authors: Takumu Fujioka, Gouhei Tanaka,
Abstract要約: 変換器を用いたベクトルグラフィックスの深層学習において,アウトラインに基づくフォント表現がTrueTypeのアウトラインよりも優れていることを示す。その結果,情報集約はトランスフォーマーによるベクトルグラフィックスの深層学習において重要であり,言語モデルのトークン化やビットマップによる画像認識モデルのパッチ分割においても重要であることが示唆された。
参考スコア（独自算出の注目度）: 1.1838866556981258
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Modern fonts adopt vector-based formats, which ensure scalability without loss of quality. While many deep learning studies on fonts focus on bitmap formats, deep learning for vector fonts remains underexplored. In studies involving deep learning for vector fonts, the choice of font representation has often been made conventionally. However, the font representation format is one of the factors that can influence the computational performance of machine learning models in font-related tasks. Here we show that font representations based on PostScript outlines outperform those based on TrueType outlines in Transformer-based vector font classification. TrueType outlines represent character shapes as sequences of points and their associated flags, whereas PostScript outlines represent them as sequences of commands. In previous research, PostScript outlines have been predominantly used when fonts are treated as part of vector graphics, while TrueType outlines are mainly employed when focusing on fonts alone. Whether to use PostScript or TrueType outlines has been mainly determined by file format specifications and precedent settings in previous studies, rather than performance considerations. To date, few studies have compared which outline format provides better embedding representations. Our findings suggest that information aggregation is crucial in Transformer-based deep learning for vector graphics, as in tokenization in language models and patch division in bitmap-based image recognition models. This insight provides valuable guidance for selecting outline formats in future research on vector graphics.
Abstract（参考訳）: 現代のフォントはベクターベースのフォーマットを採用しており、品質を損なうことなく拡張性を確保する。フォントに関する多くのディープラーニング研究はビットマップ形式に重点を置いているが、ベクトルフォントのディープラーニングはいまだ探索されていない。ベクトルフォントの深層学習に関する研究において、フォント表現の選択は従来から行われてきた。しかし、フォント表現形式は、フォント関連タスクにおける機械学習モデルの計算性能に影響を与える要因の1つである。ここでは、PostScriptに基づくフォント表現が、Transformerベースのベクトルフォント分類においてTrueTypeアウトラインよりも優れていることを示す。 TrueTypeのアウトラインはポイントとその関連するフラグのシーケンスとして、PostScriptのアウトラインはそれらをコマンドのシーケンスとして表現する。以前の研究では、PostScriptのアウトラインは主にフォントをベクトルグラフィックスの一部として扱うときに使われており、TrueTypeのアウトラインは主にフォントのみに焦点を当てるときに使われている。 PostScriptかTrueTypeのアウトラインを使用するかは、主にパフォーマンスよりも、以前の研究におけるファイルフォーマットの仕様と前例の設定によって決定されている。これまで、どのアウトラインフォーマットがより良い埋め込み表現を提供するかについて比較した研究はほとんどない。その結果,情報集約はトランスフォーマーによるベクトルグラフィックスの深層学習において重要であり,言語モデルのトークン化やビットマップによる画像認識モデルのパッチ分割においても重要であることが示唆された。この洞察は、将来のベクトルグラフィックスの研究において、アウトラインフォーマットを選択するための貴重なガイダンスを提供する。

論文の概要: Transformer-Based Vector Font Classification Using Different Font Formats: TrueType versus PostScript

関連論文リスト