Fugu-MT 論文翻訳(概要): Perturbation Dose Responses in Recursive LLM Loops: Raw Switching, Stochastic Floors, and Persistent Escape under Append, Replace, and Dialog Updates

論文の概要: Perturbation Dose Responses in Recursive LLM Loops: Raw Switching, Stochastic Floors, and Persistent Escape under Append, Replace, and Dialog Updates

arxiv url: http://arxiv.org/abs/2605.02236v2
Date: Tue, 05 May 2026 08:03:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-06 14:45:21.330759
Title: Perturbation Dose Responses in Recursive LLM Loops: Raw Switching, Stochastic Floors, and Persistent Escape under Append, Replace, and Dialog Updates
Title（参考訳）: 再帰的LLMループにおける摂動線量応答:順応・置換・ダイアログ更新時の生スイッチング, 確率床, 永続エスケープ
Authors: Pawel Kaplanski,
Abstract要約: 我々は、他のどこかで落ち着いたループを動かすのに、注入されたテキストがどれだけ必要か、そしてそれが継続するかどうかを調査する。 12,000文字の尾クリップの下では、目的地とコヒーレントな持続性台地が16%近く、ソースとバスンの脱出は36%近くで400。 4段階のファルシフィケーション・バッテリは、高線量のデスティネーションコヒーレントディップを有限水平、エンドポイント定義感度の特徴として再キャストする。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recursive language-model loops often settle into recognizable attractor-like patterns. The practical question is how much injected text is needed to move a settled loop somewhere else, and whether that move lasts. We study this in 30-step recursive loops by separating the model from the context-update rule: append, replace, and dialog updates expose different histories to the same generator. The main result is that persistent redirection in append-mode recursive loops is memory-policy-conditioned. Under a 12,000-character tail clip, destination-coherent persistence plateaus near 16 percent and retained source-basin escape near 36 percent at dose 400; neither crosses 50 percent. Under a full-history protocol, retained source-basin escape crosses 50 percent near 400 tokens and saturates at 75-80 percent by 1,500 tokens; destination-coherent persistence first reaches 0.50 near 1,500 tokens (Wilson 95 percent CI [0.41, 0.61]). A four-step falsification battery (heterogeneity control, granularity sweep with hierarchical macro-merge, transition-entropy diagnostic, and long-horizon trajectory continuation) recasts the high-dose destination-coherent dip as a finite-horizon, endpoint-definition-sensitive feature rather than a stable structural asymmetry. Half the canonical magnitude is endpoint timing; the residual drops 73 percent from -0.143 at step 29 to -0.039 at step 79 under the frozen canonical cluster basis, bootstrap interval straddling zero. Replace-mode raw switching is near-saturated under the default protocol but largely reflects state-reset overwrite: insert-mode probes drop it to 12-32 percent. We report 37 experiments on gpt-4o-mini with within-vendor replication on gpt-4.1-nano. Recursive-loop evaluations should distinguish transient movement from durable escape, subtract stochastic floors, and treat context-update rules as safety-relevant design choices.
Abstract（参考訳）: 再帰的な言語モデルループは、しばしば認識可能なアトラクションのようなパターンに落ち着く。現実的な疑問は、落ち着いたループを他のどこかに移すのに、どの程度のインジェクトテキストが必要か、そしてそれが継続するかどうかである。これを30ステップの再帰ループで研究し、モデルとコンテキスト更新ルールを区別する:追加、置換、ダイアログ更新は、異なる履歴を同じジェネレータに公開する。主な結果は、追加モードの再帰ループにおける永続的なリダイレクトは、メモリポリシー条件である。 12,000文字の尾クリップの下では、目的地のコヒーレントな持続性台地は16%近くあり、ソース・バスンの逃避は400回で36%近くあり、どちらも50%を超えない。フルヒストリープロトコルの下では、ソースベースエスケープは400トークン近くで50%を超え、1500トークンで75～80%飽和し、先導コヒーレント永続性は1500トークン付近で最初に0.50に達する(Wilson 95% CI [0.41, 0.61])。 4段階のファルシフィケーション電池(階層的なマクロマージ、トランジッションエントロピー診断、ロングホライゾン軌道継続による粒状化)は、安定な構造非対称性ではなく、有限ホライゾン、終端ディップとして高密度のデスティネーションコヒーレントディップをリキャストする。残りはステップ29で-0.143からステップ79で-0.039に減少し、ブートストラップ間隔はゼロになる。置換モードの生のスイッチングはデフォルトのプロトコルではほぼ飽和しているが、状態リセットのオーバーライトを反映している。我々は gpt-4.1-nano 上での gpt-4o-mini の再現実験を37回報告した。 Recursive-loop評価は、永続的エスケープからの過渡移動を区別し、確率的フロアを抽出し、コンテキスト更新ルールを安全関連設計選択として扱うべきである。

論文の概要: Perturbation Dose Responses in Recursive LLM Loops: Raw Switching, Stochastic Floors, and Persistent Escape under Append, Replace, and Dialog Updates

関連論文リスト