Fugu-MT 論文翻訳(概要): Decoding Ancient Oracle Bone Script via Generative Dictionary Retrieval

論文の概要: Decoding Ancient Oracle Bone Script via Generative Dictionary Retrieval

arxiv url: http://arxiv.org/abs/2604.09668v1
Date: Wed, 01 Apr 2026 09:28:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-19 19:09:11.609368
Title: Decoding Ancient Oracle Bone Script via Generative Dictionary Retrieval
Title（参考訳）: 生成辞書検索による古代Oracle Bone Scriptのデコード
Authors: Yin Wu, Gangjian Zhang, Jiayu Chen, Chang Xu, Yuyu Luo, Nan Tang, Hui Xiong,
Abstract要約: 中国の上海王朝の Oracle Bone Script (OBS) はこの課題を実証している。現代漢字に対する可塑性OBS変種の合成辞書を生成する。 54.3%のTop-10と86.6%のTop-50の精度を達成した。
参考スコア（独自算出の注目度）: 39.098205969594424
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Understanding humanity's earliest writing systems is crucial for reconstructing civilization's origins, yet many ancient scripts remain undeciphered. Oracle Bone Script (OBS) from China's Shang dynasty exemplifies this challenge: only approximately 1,500 of roughly 4,600 characters have been decoded, and a substantial portion of these 3,000-year-old inscriptions remains only partially understood. Limited by extreme data scarcity, existing computational methods achieve under 3% accuracy on unseen characters -- the core palaeographic challenge. We overcome this by reframing decipherment from classification to dictionary-based retrieval. Using deep learning guided by character evolution principles, we generate a comprehensive synthetic dictionary of plausible OBS variants for modern Chinese characters. Scholars query unknown inscriptions to retrieve visually similar candidates with transparent evidence, replacing algorithmic black boxes with interpretable hypotheses. Our approach achieves 54.3% Top-10 and 86.6% Top-50 accuracy for unseen characters. This scalable, transparent framework accelerates decipherment of a pivotal undeciphered script and establishes a generalizable methodology for AI-assisted archaeological discovery.
Abstract（参考訳）: 人類の最も初期の文字体系を理解することは文明の起源の再構築に不可欠であるが、多くの古代の文字は解読されていない。約4,600文字の約1,500文字が解読され、3,000年前の碑文のかなりの部分が部分的には理解されていない。極端なデータ不足によって制限された既存の計算手法は、未確認文字の精度を3%以下に抑える。我々は、解読を分類から辞書ベースの検索に再定義することでこれを克服する。文字進化原理で導かれた深層学習を用いて,現代漢字の可塑性OBS変種を包括的に合成した辞書を生成する。学者は未知の碑文をクエリして、視覚的に類似した候補を透明な証拠で検索し、アルゴリズム的なブラックボックスを解釈可能な仮説で置き換える。提案手法は54.3%のTop-10と86.6%のTop-50の精度を実現する。このスケーラブルで透明なフレームワークは、重要な未解読スクリプトの解読を加速し、AI支援考古学的発見のための一般化可能な方法論を確立する。

論文の概要: Decoding Ancient Oracle Bone Script via Generative Dictionary Retrieval

関連論文リスト