Fugu-MT 論文翻訳(概要): SciImpact: A Multi-Dimensional, Multi-Field Benchmark for Scientific Impact Prediction

論文の概要: SciImpact: A Multi-Dimensional, Multi-Field Benchmark for Scientific Impact Prediction

arxiv url: http://arxiv.org/abs/2604.17141v2
Date: Tue, 21 Apr 2026 07:55:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-22 14:04:47.914685
Title: SciImpact: A Multi-Dimensional, Multi-Field Benchmark for Scientific Impact Prediction
Title（参考訳）: SciImpact: 科学的影響予測のための多次元マルチフィールドベンチマーク
Authors: Hangxiao Zhu, Yuyu Zhang, Ping Nie, Yu Zhang,
Abstract要約: SciImpactは19分野にわたる科学的影響予測のための大規模多次元ベンチマークである。 215,928枚のコントラスト紙で構成され、短期と長期の両方で有意義な影響差を反映している。我々は、SciImpact上で広く使われている11の大規模言語モデル(LLM)を評価した。
参考スコア（独自算出の注目度）: 10.526822187607658
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The rapid growth of scientific literature calls for automated methods to assess and predict research impact. Prior work has largely focused on citation-based metrics, leaving limited evaluation of models' capability to reason about other impact dimensions. To this end, we introduce SciImpact, a large-scale, multi-dimensional benchmark for scientific impact prediction spanning 19 fields. SciImpact captures various forms of scientific influence, ranging from citation counts to award recognition, media attention, patent reference, and artifact adoption, by integrating heterogeneous data sources and targeted web crawling. It comprises 215,928 contrastive paper pairs reflecting meaningful impact differences in both short-term (e.g., Best Paper Award) and long-term settings (e.g., Nobel Prize). We evaluate 11 widely used large language models (LLMs) on SciImpact. Results show that off-the-shelf models exhibit substantial variability across dimensions and fields, while multi-task supervised fine-tuning consistently enables smaller LLMs (e.g., 4B) to markedly outperform much larger models (e.g., 30B) and surpass powerful closed-source LLMs (e.g., o4-mini). These results establish SciImpact as a challenging benchmark and demonstrate its value for multi-dimensional, multi-field scientific impact prediction. Our project homepage is https://flypig23.github.io/sciimpact-homepage/
Abstract（参考訳）: 科学文献の急速な成長は、研究影響を評価し予測するための自動的な方法を要求する。先行研究は引用に基づくメトリクスに主に焦点を合わせており、他の影響次元を推論するためにモデルの能力について限られた評価を残している。この目的のために、19分野にわたる科学的影響予測のための大規模多次元ベンチマークであるSciImpactを紹介した。 SciImpactは、引用数からメディアの注目、特許参照、アーティファクトの採用に至るまで、異種データソースとターゲットWebクローリングを統合することで、様々な科学的影響を捉えている。 215,928枚のコントラスト紙で、短期(例えば、ベストペーパー賞)と長期(例えば、ノーベル賞)の両方に有意義な影響差を反映している。我々は、SciImpact上で広く使われている11の大規模言語モデル(LLM)を評価した。一方,マルチタスク制御による微調整により,より小さなLCM(e g , 4B)がより大きなモデル(e g , 30B)を著しく上回り,強力なクローズドソースLCM(e g , o4-mini)を上回った。これらの結果は、SciImpactを挑戦的なベンチマークとして確立し、多次元の多分野科学的影響予測にその価値を実証する。私たちのプロジェクトのホームページはhttps://flypig23.github.io/sciimpact-homepage/です。

論文の概要: SciImpact: A Multi-Dimensional, Multi-Field Benchmark for Scientific Impact Prediction

関連論文リスト