Fugu-MT 論文翻訳(概要): Perturbation Dose Responses in Recursive LLM Loops: Raw Switching, Stochastic Floors, and Persistent Escape under Append, Replace, and Dialog Updates

論文の概要: Perturbation Dose Responses in Recursive LLM Loops: Raw Switching, Stochastic Floors, and Persistent Escape under Append, Replace, and Dialog Updates

arxiv url: http://arxiv.org/abs/2605.02236v1
Date: Mon, 04 May 2026 05:16:43 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:50.146275
Title: Perturbation Dose Responses in Recursive LLM Loops: Raw Switching, Stochastic Floors, and Persistent Escape under Append, Replace, and Dialog Updates
Title（参考訳）: 再帰的LLMループにおける摂動線量応答:順応・置換・ダイアログ更新時の生スイッチング, 確率床, 永続エスケープ
Authors: Pawel Kaplanski,
Abstract要約: 帰納的な言語モデルループは、しばしば認識可能なアトラクションのようなパターンに落ち着く。我々は、他のどこかで落ち着いたループを動かすのに、注入されたテキストがどれだけ必要か、そしてそれが継続するかどうかを調査する。均質な摂動制御は、目的地-コヒーレント永続性において高線量非単調ディップを再現した。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recursive language-model loops often settle into recognizable attractor-like patterns. The practical question is how much injected text is needed to move a settled loop somewhere else, and whether that move lasts. We study this in 30-step recursive loops by separating the model from the context-update rule: append, replace, and dialog updates expose different histories to the same generator. The main result is that persistent redirection in append-mode recursive loops is memory-policy-conditioned. Under a 12,000-character tail clip, destination-coherent persistence plateaus near 16 percent and retained source-basin escape near 36 percent at dose 400; neither crosses 50 percent. Under a full-history protocol, retained source-basin escape crosses 50 percent near 400 tokens and saturates at 75-80 percent by 1,500 tokens, while destination-coherent persistence first reaches 0.50 near 1,500 tokens with a Wilson 95 percent CI of [0.41, 0.61]. For raw switching, adversarial continuations yield an ED50 near 40 tokens, with paired-control floors near 35 percent and net switching never reaching +50 percentage points within 5-400 tokens. Replace-mode raw switching is near-saturated but largely reflects state-reset overwrite: insert-mode probes drop it to 12-32 percent. A homogeneous-perturbation control reproduced the high-dose non-monotonic dip in destination-coherent persistence, refuting perturbation heterogeneity as the cause; the dip appears structural, with mechanism unresolved. We report 37 experiments on gpt-4o-mini with within-vendor replication on gpt-4.1-nano. Recursive-loop evaluations should distinguish transient movement from durable escape, subtract stochastic floors, and treat context-update rules as first-class safety-relevant design choices.
Abstract（参考訳）: 再帰的な言語モデルループは、しばしば認識可能なアトラクションのようなパターンに落ち着く。現実的な疑問は、落ち着いたループを他のどこかに移すのに、どの程度のインジェクトテキストが必要か、そしてそれが継続するかどうかである。これを30ステップの再帰ループで研究し、モデルとコンテキスト更新ルールを区別する:追加、置換、ダイアログ更新は、異なる履歴を同じジェネレータに公開する。主な結果は、追加モードの再帰ループにおける永続的なリダイレクトは、メモリポリシー条件である。 12,000文字の尾クリップの下では、目的地のコヒーレントな持続性台地は16%近くあり、ソース・バスンの脱出は400回で36%近くであり、どちらも50%を超えない。フルヒストリープロトコルの下では、ソースベースエスケープは400トークンの50%近くを横切り、1500トークンの75～80%で飽和する一方、宛先コヒーレント永続性は最初、1500トークン近くの0.50に到達し、Wilson 95%CIは[0.41, 0.61]である。生のスイッチングでは、敵の継続は40トークン近くでED50となり、ペアコントロールフロアは35%近く、ネットスイッチングは5-400トークン内で50ポイント以上に達することはない。置換モードの生のスイッチングは、ほぼ飽和しているが、ほとんどは状態リセットのオーバーライトを反映している。均質な摂動制御は、目的地のコヒーレントな持続性において高線量非単調なディップを再現し、不均一な摂動を原因として、ジップは構造的であり、機構は未解決である。我々は gpt-4.1-nano 上での gpt-4o-mini の再現実験を37回報告した。 Recursive-loop 評価では,永続的エスケープからの過渡移動,確率的フロアの抽出,コンテキスト更新ルールを第1級安全関連設計選択として扱う必要がある。

論文の概要: Perturbation Dose Responses in Recursive LLM Loops: Raw Switching, Stochastic Floors, and Persistent Escape under Append, Replace, and Dialog Updates

関連論文リスト