Fugu-MT 論文翻訳(概要): Aligned Multi-View Scripts for Universal Chart-to-Code Generation

論文の概要: Aligned Multi-View Scripts for Universal Chart-to-Code Generation

arxiv url: http://arxiv.org/abs/2604.24559v1
Date: Mon, 27 Apr 2026 14:47:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:08.100338
Title: Aligned Multi-View Scripts for Universal Chart-to-Code Generation
Title（参考訳）: ユニバーサルチャート・ツー・コード生成のための多視点スクリプトのアライメント
Authors: Zhihan Zhang, Lizi Liao,
Abstract要約: 既存のメソッドは大部分がPython中心であり、実用的な使用を制限し、重要な監視源を見落としている。 Chart2NCodeは176Kチャートのデータセットで、Python、R、視覚的に等価な出力をレンダリングする視覚化と整列したスクリプトを組み合わせます。 LLaVAスタイルのアーキテクチャ上に構築されたCharLuMAは,低ランク部分空間の言語条件の混合でマルチモーダルプロジェクタを拡張可能なパラメータ効率適応モジュールである。
参考スコア（独自算出の注目度）: 25.240854955272912
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Chart-to-code generation converts a chart image into an executable plotting script, enabling faithful reproduction and editable visualizations. Existing methods are largely Python-centric, limiting practical use and overlooking a critical source of supervision: the same chart can be expressed by semantically equivalent scripts in different plotting languages. To fill this gap, we introduce Chart2NCode, a dataset of 176K charts paired with aligned scripts in Python, R, and LaTeX that render visually equivalent outputs, constructed via a metadata-to-template pipeline with rendering verification and human quality checks. Building on a LLaVA-style architecture, we further propose CharLuMA, a parameter-efficient adaptation module that augments the multimodal projector with a language-conditioned mixture of low-rank subspaces, allowing the model to share core chart understanding while specializing code generation to the target language through lightweight routing. Extensive experiments show consistent gains in executability and visual fidelity across all languages, outperforming strong open-source baselines and remaining competitive with proprietary systems. Further analyses reveal that balanced multi-language supervision benefits all languages and that the adapter allocates a compact shared core plus language-specific capacity. Codes and data are available at https://github.com/Zhihan72/CharLuMA.
Abstract（参考訳）: Chart-to-code生成は、チャートイメージを実行可能なプロットスクリプトに変換し、忠実な再現と編集可能な視覚化を可能にする。既存のメソッドはPython中心であり、実用的な使用を制限し、監督の重要なソースを見渡す:同じチャートは異なるプロット言語で意味的に等価なスクリプトで表現できる。このギャップを埋めるために、Python、R、LaTeXの整列スクリプトと組み合わせた176KチャートのデータセットであるChart2NCodeを紹介します。 LLaVAスタイルのアーキテクチャ上に構築されたCharLuMAは,マルチモーダルプロジェクタを低ランク部分空間の言語条件の混合で拡張するパラメータ効率適応モジュールである。大規模な実験は、すべての言語で実行可能性と視覚的忠実度が一貫して向上し、強力なオープンソースベースラインを上回り、プロプライエタリなシステムとの競争力を維持していることを示している。さらに分析したところ、バランスの取れた多言語指導は全ての言語に利益をもたらし、アダプタはコンパクトな共有コアと言語固有の容量を割り当てていることがわかった。コードとデータはhttps://github.com/Zhihan72/CharLuMA.comで公開されている。

論文の概要: Aligned Multi-View Scripts for Universal Chart-to-Code Generation

関連論文リスト