Fugu-MT 論文翻訳(概要): Reforming the Mechanism: Editing Reasoning Patterns in LLMs with Circuit Reshaping

論文の概要: Reforming the Mechanism: Editing Reasoning Patterns in LLMs with Circuit Reshaping

arxiv url: http://arxiv.org/abs/2603.06923v1
Date: Fri, 06 Mar 2026 22:51:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-10 15:13:13.407151
Title: Reforming the Mechanism: Editing Reasoning Patterns in LLMs with Circuit Reshaping
Title（参考訳）: メカニズムの改革:回路整形によるLCMの共振パターンの編集
Authors: Zhenyu Lei, Qiong Wu, Jianxiong Dong, Yinhan He, Emily Dodwell, Yushun Dong, Jundong Li,
Abstract要約: 大規模言語モデル(LLM)は、信頼性を損なう欠点のある推論能力を示すことが多い。推論を改善する既存のアプローチは一般的に、一般的なモノリシックなスキルとして扱い、非効率で特定の推論エラーをターゲットできない広範囲なトレーニングを適用する。本稿では,LLMにおける特定の推論パターンを選択的に修正し,他の推論経路を保存するためのパラダイムであるReasoning Editingを紹介する。
参考スコア（独自算出の注目度）: 51.33479921152361
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) often exhibit flawed reasoning ability that undermines reliability. Existing approaches to improving reasoning typically treat it as a general and monolithic skill, applying broad training which is inefficient and unable to target specific reasoning errors. We introduce Reasoning Editing, a paradigm for selectively modifying specific reasoning patterns in LLMs while preserving other reasoning pathways. This task presents a fundamental trade-off between Generality, the ability of an edit to generalize across different tasks sharing the same reasoning pattern, and Locality, the ability to preserve other reasoning capabilities. Through systematic investigation, we uncover the Circuit-Interference Law: Edit interference between reasoning patterns is proportional to the overlap of their neural circuits. Guided by this principle, we propose REdit, the first framework to actively reshape neural circuits before editing, thereby modulating interference between reasoning patterns and mitigating the trade-off. REdit integrates three components: (i) Contrastive Circuit Reshaping, which directly addresses the generality-locality trade-off by disentangling overlapping circuits; (ii) Meta-Contrastive Learning, which extends transferability to novel reasoning patterns; and (iii) Dual-Level Protection, which preserves preexisting abilities by constraining reshaping update directions and regularizing task-level predictions. Extensive experiments with Qwen-2.5-3B on propositional logic reasoning tasks across three difficulty levels demonstrate that REdit consistently achieves superior generality and locality compared to baselines, with additional validation in mathematics showing broader potential. Our code is available at https://github.com/LzyFischer/REdit.
Abstract（参考訳）: 大規模言語モデル(LLM)は、信頼性を損なう欠点のある推論能力を示すことが多い。推論を改善する既存のアプローチは一般的に、一般的なモノリシックなスキルとして扱い、非効率で特定の推論エラーをターゲットできない広範囲なトレーニングを適用する。本稿では,LLMにおける特定の推論パターンを選択的に修正し,他の推論経路を保存するためのパラダイムであるReasoning Editingを紹介する。このタスクは、一般性、同一の推論パターンを共有する異なるタスクをまたがる編集能力、および他の推論能力を保持する機能である局所性との根本的なトレードオフを示す。推論パターン間の干渉の編集は、ニューラルネットワークの重複に比例する。この原理で導かれたREditは、編集前に神経回路を積極的に再構築し、推論パターン間の干渉を調整し、トレードオフを緩和する最初のフレームワークである。 REditは3つのコンポーネントを統合する。一重複回路を混在させることにより、一般性-局所性トレードオフに直接対処するコントラスト回路の整形 (ii)メタコントラスト学習(メタコントラスト学習) 三二つのレベル保護更新方向の調整及びタスクレベルの予測の調整により、既存の能力を維持すること。 Qwen-2.5-3B による3つの難解レベルにわたる命題論理推論のタスクに関する広範な実験は、REdit がベースラインよりも優れた一般化と局所性を一貫して達成し、より広い可能性を示す数学のさらなる検証が示されることを示した。私たちのコードはhttps://github.com/LzyFischer/REditで公開されています。

論文の概要: Reforming the Mechanism: Editing Reasoning Patterns in LLMs with Circuit Reshaping

関連論文リスト