Fugu-MT 論文翻訳(概要): Chart-RL: Generalized Chart Comprehension via Reinforcement Learning with Verifiable Rewards

論文の概要: Chart-RL: Generalized Chart Comprehension via Reinforcement Learning with Verifiable Rewards

arxiv url: http://arxiv.org/abs/2603.06958v1
Date: Sat, 07 Mar 2026 00:26:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-10 15:13:13.498582
Title: Chart-RL: Generalized Chart Comprehension via Reinforcement Learning with Verifiable Rewards
Title（参考訳）: Chart-RL: 検証可能なリワードによる強化学習による一般化チャート理解
Authors: Xin Zhang, Xingyu Li, Rongguang Wang, Ruizhong Miao, Zheng Wang, Dan Roth, Chenyang Li,
Abstract要約: 本稿では,有効強化学習(RL)手法であるChart-RLを紹介する。実験の結果,Chart-RLは異なるチャート理解ベンチマークで教師付き微調整(SFT)を一貫して上回っていることがわかった。
参考スコア（独自算出の注目度）: 64.70559607526398
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Accurate chart comprehension represents a critical challenge in advancing multimodal learning systems, as extensive information is compressed into structured visual representations. However, existing vision-language models (VLMs) frequently struggle to generalize on unseen charts because it requires abstract, symbolic, and quantitative reasoning over structured visual representations. In this work, we introduce Chart-RL, an effective reinforcement learning (RL) method that employs mathematically verifiable rewards to enhance chart question answering in VLMs. Our experiments demonstrate that Chart-RL consistently outperforms supervised fine-tuning (SFT) across different chart understanding benchmarks, achieving relative improvements of 16.7% on MutlChartQA, and 11.5% on ChartInsights. We conduct robustness analysis, where Chart-RL achieves enhanced performance in 18 of 25 perturbed chart categories, demonstrating strong consistency and reasoning capability across visual variations. Furthermore, we demonstrate that task difficulty and inherent complexity are more critical than data quantity in RL training. For instance, Chart-RL trained on merely 10 complex chart-query examples significantly outperforms models trained on over 6,000 simple examples. Additionally, training on challenging reasoning tasks not only improves in-domain generalization relative to simpler tasks, but also facilitate strong transfer to out-of-domain visual mathematical problems.
Abstract（参考訳）: 正確なチャートの理解は、広範囲な情報が構造化された視覚表現に圧縮されるため、マルチモーダル学習システムを進める上で重要な課題である。しかしながら、既存の視覚言語モデル(VLM)は、構造化された視覚表現よりも抽象的で象徴的で定量的な推論を必要とするため、目に見えないチャートを一般化するのにしばしば苦労する。本稿では,実効強化学習(RL)手法であるChart-RLを紹介する。我々の実験では、Chart-RLは異なるチャート理解ベンチマークで教師付き微調整(SFT)を一貫して上回り、MutlChartQAでは16.7%、ChartInsightsでは11.5%の相対的な改善を実現している。頑健性解析を行い、25の摂動チャートカテゴリのうち18のチャートにおいて、Chart-RLが性能を向上し、視覚的変動にまたがる強い一貫性と推論能力を示す。さらに、RLトレーニングにおいて、タスクの難易度と固有の複雑さがデータ量よりも重要であることを示す。例えば、Chart-RLは、わずか10の複雑なチャートクエリの例でトレーニングされただけで、6000以上の単純な例でトレーニングされたモデルよりも大幅に優れています。さらに、課題推論タスクのトレーニングは、単純なタスクに対してドメイン内の一般化を改善するだけでなく、領域外視覚数学的問題への強い移行を促進する。

論文の概要: Chart-RL: Generalized Chart Comprehension via Reinforcement Learning with Verifiable Rewards

関連論文リスト