Fugu-MT 論文翻訳(概要): Semantic Anchors in In-Context Learning: Why Small LLMs Cannot Flip Their Labels

論文の概要: Semantic Anchors in In-Context Learning: Why Small LLMs Cannot Flip Their Labels

arxiv url: http://arxiv.org/abs/2511.21038v1
Date: Wed, 26 Nov 2025 04:14:33 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-27 18:37:58.959887
Title: Semantic Anchors in In-Context Learning: Why Small LLMs Cannot Flip Their Labels
Title（参考訳）: 文脈内学習におけるセマンティックアンカー : 小さなLLMがラベルをフリップできない理由
Authors: Anantha Padmanaban Krishna Kumar,
Abstract要約: コンテキスト内学習は、事前訓練されたラベルセマンティクスをオーバーライドすることができるが、単に既存のセマンティクスバックボーンを洗練するだけなのか? ICLの動作を3つのアライメント指標(真実、事前、即時アライメント)に分解し、フリップしたセマンティックスの下で正当性として定義されたセマンティックオーバーライド率を導入する。分類タスクやオープンソース LLM のセマンティックアンカービューに対する一貫した証拠を見出す。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Can in-context learning (ICL) override pre-trained label semantics, or does it merely refine an existing semantic backbone? We address this question by treating LLMs as prompt-induced classifiers and contrasting their behavior under \emph{natural} demonstrations (with correct labels) and \emph{inverted} demonstrations (systematically flipping label meanings). We decompose ICL behavior into three alignment metrics (truth, prior, and prompt alignment) and introduce a semantic override rate, defined as correctness under flipped semantics. Across eight classification tasks and eight open-source LLMs (1--12B parameters), we find consistent evidence for a semantic anchor view. With natural demonstrations, ICL improves accuracy while maintaining strong prior alignment; most correct predictions coincide with zero-shot behavior, even when the prior is weak. With inverted demonstrations, models cannot learn coherent anti-semantic classifiers: prompt alignment increases only by sacrificing accuracy, and semantic override rates remain exactly zero in our few-shot 1--12B setting. Rather than flexibly remapping label meanings, ICL primarily adjusts how inputs project onto stable semantic directions learned during pre-training, clarifying fundamental limits of few-shot prompting and suggesting that overriding label semantics at these scales requires interventions beyond ICL. All code is available at: https://github.com/AnanthaPadmanaban-KrishnaKumar/semantic-anchors-icl.
Abstract（参考訳）: インコンテキスト学習(ICL)は、事前訓練されたラベルセマンティクスをオーバーライドできるのか、それとも単に既存のセマンティクスバックボーンを洗練させるだけなのか? 本稿では, LLM をプロンプト誘導型分類器として扱い, それらの動作を (正しいラベルを持つ) \emph{natural} 実演と \emph{inverted} 実演(システム的にラベルの意味を反転させる)で対比することで, この問題に対処する。 ICLの動作を3つのアライメント指標(真実、事前、即時アライメント)に分解し、フリップしたセマンティックスの下で正当性として定義されたセマンティックオーバーライド率を導入する。 8つの分類タスクと8つのオープンソース LLM (1--12B パラメータ) にまたがって、セマンティックアンカービューの一貫性のある証拠を見出す。自然なデモンストレーションでは、ICLは強い事前アライメントを維持しながら精度を向上させる。 Inverted demonstrationsでは、モデルはコヒーレントなアンチセマンティックな分類法を学べず、即列アライメントは精度を犠牲にしてのみ増加し、セマンティックオーバライドレートは、我々の数ショット1--12B設定で完全にゼロのままである。ラベルの意味を柔軟に書き換えるのではなく、ICLはプロジェクトの入力を事前学習中に学んだ安定した意味的な方向にどのように調整するかを考慮し、数発のプロンプトの基本的な限界を明確にし、これらのスケールでのラベルの意味をオーバーライドするにはICL以上の介入が必要であることを示唆している。すべてのコードは、https://github.com/AnanthaPadmanaban-KrishnaKumar/semantic-anchors-iclで入手できる。

論文の概要: Semantic Anchors in In-Context Learning: Why Small LLMs Cannot Flip Their Labels

関連論文リスト