Fugu-MT 論文翻訳(概要): DiZiNER: Disagreement-guided Instruction Refinement via Pilot Annotation Simulation for Zero-shot Named Entity Recognition

論文の概要: DiZiNER: Disagreement-guided Instruction Refinement via Pilot Annotation Simulation for Zero-shot Named Entity Recognition

arxiv url: http://arxiv.org/abs/2604.15866v1
Date: Fri, 17 Apr 2026 09:16:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-20 22:00:19.851801
Title: DiZiNER: Disagreement-guided Instruction Refinement via Pilot Annotation Simulation for Zero-shot Named Entity Recognition
Title（参考訳）: DiZiNER:Zero-shot Named Entity Recognition のためのパイロットアノテーションシミュレーションによる診断誘導型インストラクションリファインメント
Authors: Siun Kim, Hyung-Jin Yoon,
Abstract要約: 大規模言語モデル(LLM)は、ゼロショットおよび少数ショット名付きエンティティ認識(NER)を有効にすることで、高度な情報抽出(IE)を行う。命令の微調整の進歩にもかかわらず、ゼロショットのNERは教師付きシステムよりずっと遅れている。我々は、パイロットアノテーションプロセスをシミュレートするフレームワークであるDiZiNERを紹介し、LLMをアノテータとスーパーバイザの両方として利用する。 18のベンチマークで、DiZiNERは14のデータセットでゼロショットSOTA結果を達成し、事前のベストを+8.0 F1に改善し、ゼロショットから教師付きギャップを+11ポイント以上削減した。
参考スコア（独自算出の注目度）: 0.061386715480643554
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) have advanced information extraction (IE) by enabling zero-shot and few-shot named entity recognition (NER), yet their generative outputs still show persistent and systematic errors. Despite progress through instruction fine-tuning, zero-shot NER still lags far behind supervised systems. These recurring errors mirror inconsistencies observed in early-stage human annotation processes that resolve disagreements through pilot annotation. Motivated by this analogy, we introduce DiZiNER (Disagreement-guided Instruction Refinement via Pilot Annotation Simulation for Zero-shot Named Entity Recognition), a framework that simulates the pilot annotation process, employing LLMs to act as both annotators and supervisors. Multiple heterogeneous LLMs annotate shared texts, and a supervisor model analyzes inter-model disagreements to refine task instructions. Across 18 benchmarks, DiZiNER achieves zero-shot SOTA results on 14 datasets, improving prior bests by +8.0 F1 and reducing the zero-shot to supervised gap by over +11 points. It also consistently outperforms its supervisor, GPT-5 mini, indicating that improvements stem from disagreement-guided instruction refinement rather than model capacity. Pairwise agreement between models shows a strong correlation with NER performance, further supporting this finding.
Abstract（参考訳）: 大規模言語モデル(LLM)は、ゼロショットおよび少数ショット名付きエンティティ認識(NER)を有効にすることで、高度な情報抽出(IE)を行うが、生成出力は持続的かつ体系的なエラーを示す。命令の微調整の進歩にもかかわらず、ゼロショットのNERは教師付きシステムよりずっと遅れている。これらの繰り返しエラーは、パイロットアノテーションを通して不一致を解消する初期の人間のアノテーションプロセスで観察された矛盾を反映している。このアナロジーに触発されたDiZiNER (Disagreement-guided Instruction Refinement via Pilot Annotation Simulation for Zero-shot Named Entity Recognition) は、パイロットアノテーションプロセスをシミュレートし、LLMを用いてアノテーションとスーパーバイザの両方として機能するフレームワークである。複数のヘテロジニアスLSMは共有テキストに注釈を付け、スーパーバイザーモデルはモデル間不一致を分析してタスク命令を洗練させる。 18のベンチマークで、DiZiNERは14のデータセットでゼロショットSOTA結果を達成し、事前のベストを+8.0 F1に改善し、ゼロショットから教師付きギャップを+11ポイント以上削減した。また、そのスーパーバイザーであるGPT-5 miniを一貫して上回り、改良はモデルキャパシティではなく、不一致誘導命令の改良によるものであることを示している。モデル間のペアワイドな一致は、NERのパフォーマンスと強い相関を示し、この発見をさらに支持している。

論文の概要: DiZiNER: Disagreement-guided Instruction Refinement via Pilot Annotation Simulation for Zero-shot Named Entity Recognition

関連論文リスト