Fugu-MT 論文翻訳(概要): DKDS: A Benchmark Dataset of Degraded Kuzushiji Documents with Seals for Detection and Binarization

論文の概要: DKDS: A Benchmark Dataset of Degraded Kuzushiji Documents with Seals for Detection and Binarization

arxiv url: http://arxiv.org/abs/2511.09117v1
Date: Thu, 13 Nov 2025 01:33:12 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-13 22:34:54.416242
Title: DKDS: A Benchmark Dataset of Degraded Kuzushiji Documents with Seals for Detection and Binarization
Title（参考訳）: DKDS: 検出とバイナリ化のためのシール付き劣化クズシジ文書のベンチマークデータセット
Authors: Rui-Yang Ju, Kohei Yamashita, Hirotaka Kameko, Shinsuke Mori,
Abstract要約: 近世以前の日本語の筆跡である葛紙字は、現在、日本の数万の熟練した専門家によって読解されている。現在の光学文字認識技術は、文書の劣化や封印など、様々な種類のノイズを考慮できない。関連タスクの新たなベンチマークとして,シールスデータセットを用いた分解クズシジ文書を紹介した。
参考スコア（独自算出の注目度）: 4.045683514325492
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Kuzushiji, a pre-modern Japanese cursive script, can currently be read and understood by only a few thousand trained experts in Japan. With the rapid development of deep learning, researchers have begun applying Optical Character Recognition (OCR) techniques to transcribe Kuzushiji into modern Japanese. Although existing OCR methods perform well on clean pre-modern Japanese documents written in Kuzushiji, they often fail to consider various types of noise, such as document degradation and seals, which significantly affect recognition accuracy. To the best of our knowledge, no existing dataset specifically addresses these challenges. To address this gap, we introduce the Degraded Kuzushiji Documents with Seals (DKDS) dataset as a new benchmark for related tasks. We describe the dataset construction process, which required the assistance of a trained Kuzushiji expert, and define two benchmark tracks: (1) text and seal detection and (2) document binarization. For the text and seal detection track, we provide baseline results using multiple versions of the You Only Look Once (YOLO) models for detecting Kuzushiji characters and seals. For the document binarization track, we present baseline results from traditional binarization algorithms, traditional algorithms combined with K-means clustering, and Generative Adversarial Network (GAN)-based methods. The DKDS dataset and the implementation code for baseline methods are available at https://ruiyangju.github.io/DKDS.
Abstract（参考訳）: 近世以前の日本語の筆跡である葛紙字は、現在、日本の数万の熟練した専門家によって読解されている。深層学習の急速な発展に伴い、研究者らはクズシジを現代日本語に書き起こすために光学文字認識(OCR)技術を適用し始めた。既存のOCR法は、クズシジで書かれた清潔な日本の文書ではよく機能するが、文書の劣化や封印など様々な種類のノイズを考慮せず、認識精度に大きな影響を及ぼすことが多い。私たちの知る限りでは、これらの課題に対処する既存のデータセットはありません。このギャップに対処するため、我々は、関連するタスクの新たなベンチマークとして、DKDSデータセットを用いた分解クズシジ文書(Degraded Kuzushiji Documents with Seals)を紹介した。筆者らは,(1)テキストとシール検出,(2)文書のバイナライゼーションの2つのベンチマークトラックを定義した。テキストとアザラシ検出トラックでは、クズシジ文字とアザラシを検出するために、You Only Look Once(YOLO)モデルの複数バージョンを用いてベースライン結果を提供する。文書バイナライゼーショントラックでは、従来のバイナライゼーションアルゴリズム、K平均クラスタリングと組み合わせた従来のアルゴリズム、GAN(Generative Adversarial Network)ベースの手法のベースライン結果を示す。 DKDSデータセットとベースラインメソッドの実装コードはhttps://ruiyangju.github.io/DKDSで公開されている。

論文の概要: DKDS: A Benchmark Dataset of Degraded Kuzushiji Documents with Seals for Detection and Binarization

関連論文リスト