Fugu-MT 論文翻訳(概要): DIAGRAMS: A Review Framework for Reasoning-Level Attribution in Diagram QA

論文の概要: DIAGRAMS: A Review Framework for Reasoning-Level Attribution in Diagram QA

arxiv url: http://arxiv.org/abs/2605.00905v1
Date: Wed, 29 Apr 2026 02:34:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:49.470098
Title: DIAGRAMS: A Review Framework for Reasoning-Level Attribution in Diagram QA
Title（参考訳）: DIAGRAMS: ダイアグラムQAにおけるReasoning-Level属性のレビューフレームワーク
Authors: Anirudh Iyengar Kaniyar Narayana Iyengar, Tampu Ravi Kumar, Manan Suri, Raviteja Bommireddy, Dinesh Manocha, Puneet Mathur, Vivek Gupta,
Abstract要約: ダイアグラム質問応答(ダイアグラムQA)は、各問合せ対を答えを導き出すために必要なすべての視覚領域にリンクする推論レベルの属性を必要とする。私たちは、データセット固有の構造からインターフェースロジックを分離する軽量でスキーマ駆動のレビューフレームワークであるDIAGRAMSを紹介します。
参考スコア（独自算出の注目度）: 56.73431446011309
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Diagram question answering (Diagram QA) requires reasoning-level attribution that links each question-answer pair to all visual regions needed to derive the answer, rather than only the region containing the final response. Creating such structured evidence across diagrams, charts, maps, circuits, and infographics is time-consuming, and existing annotation tools tightly couple their interfaces to dataset-specific formats. We present DIAGRAMS, a lightweight, schema-driven review framework that decouples interface logic from dataset-specific JSON structures through an internal meta-schema and dataset adapters. Given an image and QA pair with optional candidate regions, the system performs QA-conditioned evidence selection and proposes the regions required for reasoning. When QA pairs or candidate regions are missing, it generates them and supports human verification and refinement. Across six Diagram QA datasets, model-suggested evidence achieves 85.39% precision and 75.30% recall against reviewer-final selections (micro-averaged). These results indicate that the review-first framework reduces manual region creation while maintaining high agreement with final reasoning-level attributions. We release a public demo and installable package to support dataset auditing, grounded supervision creation, and grounded evaluation.
Abstract（参考訳）: ダイアグラム質問応答(ダイアグラムQA)は、最終応答を含む領域だけでなく、各問合せ対を答えを導き出すために必要なすべての視覚領域にリンクする推論レベルの属性を必要とする。図、チャート、マップ、サーキット、インフォグラフィックにまたがってそのような構造化されたエビデンスを作成するのに時間がかかり、既存のアノテーションツールは、インターフェースをデータセット固有のフォーマットに密に結合する。内部メタスキーマとデータセットアダプタを通じて、データセット固有のJSON構造からインターフェースロジックを分離する、軽量でスキーマ駆動のレビューフレームワークであるDIAGRAMSを紹介します。任意の候補領域と画像とQAペアが与えられた場合、システムはQA条件付きエビデンス選択を行い、推論に必要な領域を提案する。 QAペアや候補領域が欠落すると、それを生成し、人間の検証と改善をサポートする。 6つのダイアグラムQAデータセット全体で、モデル推奨の証拠は85.39%の精度と75.30%のリコールをレビュアー-ファイナルセレクションに対して達成している。これらの結果は、レビューファーストフレームワークが最終的な推論レベルの属性との高い一致を維持しながら、手動の領域作成を減らすことを示唆している。私たちは、データセットの監査、接地された監督作成、接地された評価をサポートする公開デモとインストール可能なパッケージをリリースしました。

論文の概要: DIAGRAMS: A Review Framework for Reasoning-Level Attribution in Diagram QA

関連論文リスト