Fugu-MT 論文翻訳(概要): What Do Biomedical NER and Entity Linking Benchmarks Measure? A Corpus-Centric Diagnostic Framework

論文の概要: What Do Biomedical NER and Entity Linking Benchmarks Measure? A Corpus-Centric Diagnostic Framework

arxiv url: http://arxiv.org/abs/2605.20537v1
Date: Tue, 19 May 2026 22:19:22 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-21 19:19:56.398448
Title: What Do Biomedical NER and Entity Linking Benchmarks Measure? A Corpus-Centric Diagnostic Framework
Title（参考訳）: バイオメディカルNERとエンティティリンクベンチマークは何か? コーパス中心診断フレームワーク
Authors: Robert Leaman, Rezarta Islamaj, Zhiyong Lu,
Abstract要約: 注釈付きコーパスからベンチマーク関連プロパティを診断するためのコーパス中心のフレームワークを提案する。この枠組みを病気、化学物質、細胞タイプにまたがる9つのコーパスに適用すると、コーパスの性質は著しく異なることが判明した。
参考スコア（独自算出の注目度）: 6.2370872977761715
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: Biomedical named entity recognition (NER) and entity linking (EL) strongly depend on annotated corpora, but the utility of these resources for benchmarking is often assumed rather than characterized. We present a corpus-centric framework for diagnosing benchmark-relevant properties directly from corpus annotations, concept links, train-test splits, document metadata, and terminology mappings. The framework organizes standardized statistics into five families: (1) scale, density and label distribution, (2) lexical and conceptual structure, (3) train-test overlap, (4) metadata composition, and (5) terminology coverage where applicable. Applying the framework to nine corpora spanning diseases, chemicals, and cell types, we find that corpus properties can differ substantially, even when they address the same apparent task. We find differences in the evaluation signal they provide, the generalization demands they impose, the degree of train-test reuse they permit, and the regions of biomedical literature and concept space they represent. These differences suggest that commonly reported corpus statistics can be insufficient to characterize what biomedical NER and EL benchmarks evaluate. We argue that corpus-centric diagnostics provide a practical framework for analyzing corpora beyond surface descriptors such as corpus size and entity type, for identifying potential transfer risks, and for interpreting the scope of benchmarking conclusions. We release the framework as open-source code with an interactive dashboard to support reproducing our analyses and characterizing additional corpora.
Abstract（参考訳）: バイオメディカルな名前付きエンティティ認識(NER)とエンティティリンク(EL)は注釈付きコーパスに強く依存するが、これらのリソースをベンチマークに利用することは、特徴よりもむしろ想定されることが多い。コーパスアノテーション,概念リンク,トレイン-テスト分割,文書メタデータ,用語マッピングから直接,ベンチマーク関連プロパティを診断するためのコーパス中心のフレームワークを提案する。本フレームワークは,(1)スケール,密度,ラベル分布,(2)語彙構造,概念構造,(3)テストオーバーラップ,(4)メタデータ構成,(5)用語カバレッジの5つのカテゴリに分類する。この枠組みを病気、化学物質、細胞タイプにまたがる9つのコーパスに適用すると、コーパス特性は同一の課題に対処しても大きく異なることが分かる。提案する評価信号の差異や,それらが課す一般化要求,許容する列車試験の程度,およびそれらが表す生物医学文献や概念空間の領域に違いを見出した。これらの違いは、一般的に報告されているコーパス統計が、バイオメディカルNERとELベンチマークの評価を特徴づけるには不十分であることを示している。コーパス中心の診断は、コーパスサイズやエンティティタイプといった表面記述子以外のコーパスを解析し、潜在的な転送リスクを特定し、ベンチマーク結果の範囲を解釈するための実践的な枠組みを提供する。我々はこのフレームワークをインタラクティブなダッシュボードを備えたオープンソースコードとしてリリースし、分析を再現し、追加コーパスを特徴付けるのをサポートする。

論文の概要: What Do Biomedical NER and Entity Linking Benchmarks Measure? A Corpus-Centric Diagnostic Framework

関連論文リスト