Fugu-MT 論文翻訳(概要): Interpretable Chinese Metaphor Identification via LLM-Assisted MIPVU Rule Script Generation: A Comparative Protocol Study

論文の概要: Interpretable Chinese Metaphor Identification via LLM-Assisted MIPVU Rule Script Generation: A Comparative Protocol Study

arxiv url: http://arxiv.org/abs/2603.10784v1
Date: Wed, 11 Mar 2026 13:55:34 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-12 16:22:32.979594
Title: Interpretable Chinese Metaphor Identification via LLM-Assisted MIPVU Rule Script Generation: A Comparative Protocol Study
Title（参考訳）: LLMを用いたMIPVUルールスクリプト生成による中国語メタファー識別:比較プロトコルによる検討
Authors: Weihang Huang, Mengna Liu,
Abstract要約: 4つの比喩識別プロトコルを運用するLLM支援パイプラインを提案する。各プロトコルは、制御されたLLM呼び出しにインターリーブされた決定論的ステップのモジュラーチェーンである。トークン、文、およびスパンレベルのアノテーションにまたがる7つの中国語メタファーデータセットを評価する。
参考スコア（独自算出の注目度）: 1.482489178857542
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Metaphor identification is a foundational task in figurative language processing, yet most computational approaches operate as opaque classifiers offering no insight into why an expression is judged metaphorical. This interpretability gap is especially acute for Chinese, where rich figurative traditions, absent morphological cues, and limited annotated resources compound the challenge. We present an LLM-assisted pipeline that operationalises four metaphor identification protocols--MIP/MIPVU lexical analysis, CMDAG conceptual-mapping annotation, emotion-based detection, and simile-oriented identification--as executable, human-auditable rule scripts. Each protocol is a modular chain of deterministic steps interleaved with controlled LLM calls, producing structured rationales alongside every classification decision. We evaluate on seven Chinese metaphor datasets spanning token-, sentence-, and span-level annotation, establishing the first cross-protocol comparison for Chinese metaphor identification. Within-protocol evaluation shows Protocol A (MIP) achieves an F1 of 0.472 on token-level identification, while cross-protocol analysis reveals striking divergence: pairwise Cohen's kappa between Protocols A and D is merely 0.001, whereas Protocols B and C exhibit near-perfect agreement (kappa = 0.986). An interpretability audit shows all protocols achieve 100% deterministic reproducibility, with rationale correctness from 0.40 to 0.87 and editability from 0.80 to 1.00. Error analysis identifies conceptual-domain mismatch and register sensitivity as dominant failure modes. Our results demonstrate that protocol choice is the single largest source of variation in metaphor identification, exceeding model-level variation, and that rule-script architectures achieve competitive performance while maintaining full transparency.
Abstract（参考訳）: メタファー識別は比喩的言語処理の基本的なタスクであるが、ほとんどの計算手法は不透明な分類器として機能し、なぜ表現が比喩的に判断されるのかについての洞察を与えない。この解釈可能性のギャップは、中国では特に深刻であり、豊かな比喩的伝統、形態的手がかりの欠如、限られたアノテート資源が課題を複雑にしている。本稿では,MIP/MIPVU語彙解析,CMDAG概念マッピングアノテーション,感情に基づく検出,シミュラ指向の識別を実行可能な人間監査可能なルールスクリプトとして,4つのメタファ識別プロトコルを運用するLLM支援パイプラインを提案する。それぞれのプロトコルは、制御されたLLM呼び出しにインターリーブされた決定論的ステップのモジュラー連鎖であり、すべての分類決定と共に構造的論理を生成する。トークン,文,およびスパンレベルのアノテーションにまたがる7つの中国語メタファーデータセットを評価し,中国のメタファー識別に対する最初のクロスプロトコール比較を確立した。プロトコルA(MIP)はトークン単位の識別において0.472のF1を達成する一方、クロスプロトコール分析では顕著なばらつきが明らかである: プロトコルAとDの間のCohenのカッパは0.001に過ぎず、ProtocolBとCはほぼ完全に一致している(kappa = 0.986)。解釈可能性監査は、すべてのプロトコルが100%決定論的再現性、合理性は0.40から0.87、編集性は0.80から1.00であることを示す。エラー解析は、概念ドメインミスマッチとレジスタ感度を支配的な障害モードとして識別する。この結果から,プロトコル選択はメタファー識別における唯一の最大の変化源であり,モデルレベルの変動を超越したものであり,ルール記述アーキテクチャは完全な透明性を維持しながら競合性能を実現することが示唆された。

論文の概要: Interpretable Chinese Metaphor Identification via LLM-Assisted MIPVU Rule Script Generation: A Comparative Protocol Study

関連論文リスト