Fugu-MT 論文翻訳(概要): SciHorizon-DataEVA: An Agentic System for AI-Readiness Evaluation of Heterogeneous Scientific Data

論文の概要: SciHorizon-DataEVA: An Agentic System for AI-Readiness Evaluation of Heterogeneous Scientific Data

arxiv url: http://arxiv.org/abs/2604.26645v1
Date: Wed, 29 Apr 2026 13:11:53 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-30 15:59:36.411579
Title: SciHorizon-DataEVA: An Agentic System for AI-Readiness Evaluation of Heterogeneous Scientific Data
Title（参考訳）: SciHorizon-DataEVA:異種科学データのAI可読性評価のためのエージェントシステム
Authors: Dianyu Liu, Chuan Qin, Xi Chen, Xiaohan Li, Wenxi Xu, Yuyang Wang, Xin Chen, Yuanchun Zhou, Hengshu Zhu,
Abstract要約: SciHorizon-DataEVAは、科学データのスケーラブルなAI可読性評価のための新しいエージェントシステムである。評価基準レベルでは、Sci-TQA2原則を導入し、信頼の信頼性、データ品質、AI適合性、科学的適応性の4つの相補的な側面にAI対応を整理する。我々のSci-TQA2-Evalは、軽量なデータセットプロファイリング、応用性を考慮したメトリックアクティベーション、知識に富んだ計画を組み合わせることで、データセット認識評価仕様を動的に構築する。
参考スコア（独自算出の注目度）: 31.749503531377812
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: AI-for-Science (AI4Science) is increasingly transforming scientific discovery by embedding machine learning models into prediction, simulation, and hypothesis generation workflows across domains. However, the effectiveness of these models is fundamentally constrained by the AI-readiness of scientific data, for which no scalable and systematic evaluation mechanism currently exists. In this work, we propose SciHorizon-DataEVA, a novel agentic system to scalable AI-readiness evaluation of heterogeneous scientific data. At the evaluation-criteria level, we introduce the Sci-TQA2 principles, which organize AI-readiness into four complementary dimensions: Governance Trustworthiness, Data Quality, AI Compatibility, and Scientific Adaptability. Each dimension is decomposed into measurable atomic elements that enable fine-grained and executable assessment. To operationalize these principles at scale, we develop Sci-TQA2-Eval, a hierarchical multi-agent evaluation approach orchestrated through a directed, cyclic workflow. Our Sci-TQA2-Eval dynamically constructs dataset-aware evaluation specifications by combining lightweight dataset profiling, applicability-aware metric activation, and knowledge-augmented planning grounded in domain constraints and dataset-paper signals. These specifications are executed through an adaptive, tool-centric evaluation mechanism with built-in verification and self-correction, enabling scalable and reliable assessment across heterogeneous scientific data. Extensive experiments on scientific datasets spanning multiple domains demonstrate the effectiveness and generality of SciHorizon-DataEVA for principled AI-readiness evaluation.
Abstract（参考訳）: AI-for-Science(AI4Science)は、機械学習モデルを予測、シミュレーション、仮説生成ワークフローに組み込むことによって、科学的な発見をドメイン全体に拡大している。しかし、これらのモデルの有効性は、現在、スケーラブルで体系的な評価機構が存在しない科学データのAI可読性によって根本的に制約されている。本研究では,異種科学データのスケーラブルなAI可読性評価のためのエージェントシステムであるSciHorizon-DataEVAを提案する。評価基準レベルでは、Sci-TQA2原則を導入し、信頼の信頼性、データ品質、AI適合性、科学的適応性の4つの相補的な側面にAI対応を整理する。各次元は、微細で実行可能な評価を可能にする測定可能な原子要素に分解される。 Sci-TQA2-Evalという階層型マルチエージェント評価手法を開発し,これらの原理を大規模に運用する。我々のSci-TQA2-Evalは、軽量なデータセットプロファイリング、適用性を考慮したメトリックアクティベーション、ドメイン制約とデータセットペーパー信号に基づく知識強化計画を組み合わせることで、データセット認識評価仕様を動的に構築する。これらの仕様は、組み込まれた検証と自己補正を備えた適応的なツール中心の評価メカニズムを通じて実行され、異種科学データ間のスケーラブルで信頼性の高い評価を可能にする。複数の領域にまたがる科学データセットに関する大規模な実験は、原則化されたAIの可読性評価のためのSciHorizon-DataEVAの有効性と汎用性を実証している。

論文の概要: SciHorizon-DataEVA: An Agentic System for AI-Readiness Evaluation of Heterogeneous Scientific Data

関連論文リスト