Fugu-MT 論文翻訳(概要): ChartZero: Synthetic Priors Enable Zero Shot Chart Data Extraction

論文の概要: ChartZero: Synthetic Priors Enable Zero Shot Chart Data Extraction

arxiv url: http://arxiv.org/abs/2605.05820v1
Date: Thu, 07 May 2026 07:57:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-08 22:27:11.610733
Title: ChartZero: Synthetic Priors Enable Zero Shot Chart Data Extraction
Title（参考訳）: ChartZero: ゼロショットチャートデータの抽出を可能にするシンセティックプライオリティ
Authors: Md Touhidul Islam, Yasir Mahmud, Sujan Kumar Saha, Mark Tehranipoor, Farimah Farahmandi,
Abstract要約: 合成先行情報を活用する解析フレームワークであるChartZeroを導入し,ロバストなゼロショットチャートデータ抽出を実現する。我々は,Global Orthogonal Instance (GOI) の損失によって曲線の断片化を克服し,不安定な空間ルールをオープン語彙であるVLM(Vision-Language Model)に置き換える。
参考スコア（独自算出の注目度）: 2.825872203744638
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Automated data extraction from line charts remains fundamentally bottlenecked by extreme stylistic diversity and a severe scarcity of comprehensively annotated, real-world datasets. Current end-to-end pipelines depend heavily on costly manual annotations, crippling their ability to generalize across arbitrary aesthetics and grid layouts. Furthermore, existing models suffer from two critical failure modes during reconstruction. First, extracting thin, intersecting curves frequently causes structural fragmentation and the erasure of fine visual details, as standard architectures struggle against complex backgrounds. Second, semantic association is notoriously error-prone; current pipelines rely on rigid spatial heuristics that easily break down against the unpredictable legend placements of in-the-wild charts. Finally, measuring true progress is hindered by evaluation protocols that assess isolated sub-tasks rather than holistic, end-to-end data reconstruction. To address these foundational issues, we introduce ChartZero, a parsing framework that leverages synthetic priors to enable robust zero-shot chart data extraction. By training exclusively on a purely synthetic dataset of simple mathematical functions, our model completely bypasses the real-world annotation bottleneck. We overcome curve fragmentation via a novel Global Orthogonal Instance (GOI) loss, and replace brittle spatial rules with an open-vocabulary, Vision-Language Model (VLM)-guided legend matching strategy. Accompanied by a new metric and benchmark specifically designed for full end-to-end reconstruction, our evaluations demonstrate that ChartZero significantly advances generalized plot digitization without requiring real-world supervision. Code and dataset will be released upon acceptance.
Abstract（参考訳）: ラインチャートからの自動データ抽出は、極端なスタイリスティックな多様性と、包括的な注釈付き実世界のデータセットの不足によって、基本的にボトルネックになっている。現在のエンドツーエンドパイプラインは、コストのかかる手作業によるアノテーションに大きく依存しており、任意の美学とグリッドレイアウトをまたいで一般化する能力を制限している。さらに、既存モデルは再建時に2つの重要な障害モードに悩まされる。まず、細く交差する曲線を抽出すると、しばしば構造的な断片化や、標準的なアーキテクチャが複雑な背景と戦っているため、細かな視覚的詳細が消去される。第二に、セマンティック・アソシエーションはエラーを起こしやすいことで知られており、現在のパイプラインは、予測不能な伝説的なチャートの配置に容易に逆らう、厳密な空間的ヒューリスティックに依存している。最後に、完全なエンドツーエンドのデータ再構成ではなく、孤立したサブタスクを評価する評価プロトコルによって、真の進捗を測定することが妨げられる。このような基礎的な問題に対処するために、我々は、ゼロショットチャートデータの堅牢な抽出を可能にするために、合成先行値を活用する解析フレームワークであるChartZeroを紹介した。単純な数学的関数の純粋に合成されたデータセットのみをトレーニングすることにより、われわれのモデルは実世界のアノテーションボトルネックを完全に回避する。我々は,Global Orthogonal Instance (GOI) の損失によって曲線の断片化を克服し,不安定な空間ルールをオープン語彙であるVLM(Vision-Language Model)に置き換える。実世界の監視を必要とせずに,ChartZeroが一般化されたプロットのデジタル化を著しく向上させることを示す。コードとデータセットは受理時にリリースされる。

論文の概要: ChartZero: Synthetic Priors Enable Zero Shot Chart Data Extraction

関連論文リスト