Fugu-MT 論文翻訳(概要): Learning Diachronic Representations of Ancient Greek Letterforms

論文の概要: Learning Diachronic Representations of Ancient Greek Letterforms

arxiv url: http://arxiv.org/abs/2606.24984v1
Date: Tue, 23 Jun 2026 14:13:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-25 17:05:30.081377
Title: Learning Diachronic Representations of Ancient Greek Letterforms
Title（参考訳）: 古代ギリシア文字のダイアクロニック表現の学習
Authors: John Pavlopoulos, Spyros Barbakos, Lavinia Ferretti, Dionysis Voulgarakis, Asimina Paparrigopoulou, Maria Konstantinidou, Giuseppe De Gregorio, Isabelle Marthot-Santaniello, Paraskevi Platanou, Holger Essler,
Abstract要約: 何世紀にもわたって手書きの変動に頑健な表現を学習することは、ダイアクロニック表現学習における重要な課題である。ダイアクロニック表現学習のための3つのデータセット: Hell-Char、紀元前3世紀から1世紀にかけての訓練セット、PaLit-CharとMed-Charの2つの評価セットを紹介する。そこで本研究では,クラス間類似度を動的に推定して埋め込みをバイアスする類似性重み付き教師付きコントラスト損失と,リアルな写本の破損をシミュレートするラグナ駆動拡張スキームを提案する。
参考スコア（独自算出の注目度）: 5.399565315727663
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Learning representations that remain robust across centuries of variation in handwriting is a key challenge in diachronic representation learning. Taking one of the longest continuously used writing systems, ancient Greek, as a case study, we introduce three datasets for diachronic representation learning: Hell-Char, a curated training set spanning the 3rd-1st centuries BCE, and two evaluation sets, PaLit-Char (2nd-5th c. CE) and Med-Char (9th-14th c. CE). To address the challenges of symbolic variation, scarce data, and systematic degradation, we propose: a similarity-weighted supervised contrastive loss that biases embeddings using dynamically estimated inter-class similarities, and a lacuna-driven augmentation scheme that simulates realistic manuscript corruptions. Trained with these strategies, both a lightweight CNN and a pretrained ResNet achieve strong recognition performance and produce embeddings that more coherently separate character classes than PCA or generic pretrained models. These embeddings enable clustering, identification of stylistic subgroups, and construction of prototype images that visualize diachronic evolution and transitional letterforms. Our results demonstrate that respecting intrinsic inter-letter relationships and augmenting with domain-informed corruptions yield robust, interpretable representations, offering a transferable paradigm for representation learning under scarce, temporally evolving, and noisy conditions. Code and data available at: https://github.com/ipavlopoulos/diachronic-greek-letterforms.
Abstract（参考訳）: 何世紀にもわたって手書きの変動に頑健な表現を学習することは、ダイアクロニック表現学習における重要な課題である。これは紀元前3世紀から1世紀にかけての訓練セットであり、PaLit-Char (2nd-5th CE) とMed-Char (9th-14th CE) の2つの評価セットである。シンボリック変動, 不足データ, 系統的劣化の課題に対処するため, 動的に推定されるクラス間の類似性を用いて埋め込みをバイアスする類似性重み付き教師付きコントラスト損失と, 現実的な写本の破損をシミュレートするラグナ駆動の増補スキームを提案する。これらの戦略で訓練された軽量CNNと事前訓練されたResNetは、強力な認識性能を達成し、PCAや一般的な事前訓練モデルよりも整合的に文字クラスを分離した埋め込みを生成する。これらの埋め込みにより、クラスタリング、スタイリスティックな部分群の同定、およびダイアクロニック進化と遷移レターフォームを視覚化するプロトタイプ画像の構築が可能になる。本研究の結果は,本文間の関係を尊重し,ドメイン情報による汚職を増大させることによって,表現学習のための伝達可能なパラダイムを,希少かつ時間的に進化し,騒々しい条件下で提供し,堅牢で解釈可能な表現が得られることを示す。コードとデータは、https://github.com/ipavlopoulos/diachronic-greek-letterforms.comで公開されている。

論文の概要: Learning Diachronic Representations of Ancient Greek Letterforms

関連論文リスト