Fugu-MT 論文翻訳(概要): Orthogonal Concept Erasure for Diffusion Models

論文の概要: Orthogonal Concept Erasure for Diffusion Models

arxiv url: http://arxiv.org/abs/2605.28902v1
Date: Wed, 27 May 2026 15:58:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-30 02:45:55.039224
Title: Orthogonal Concept Erasure for Diffusion Models
Title（参考訳）: 拡散モデルにおける直交的概念消去
Authors: Yuhao Sun, Lingyun Yu, Haoxiang Xu, Fengyuan Miao, Zhuoer Xu, Hongtao Xie,
Abstract要約: 概念消去は、拡散モデルにおける望ましくない、あるいは安全でないコンテンツを緩和するための有望なアプローチとして現れている。我々は,編集に基づく消去を乗算パラメータ更新として再構成する直交概念消去(OCE)を提案する。 OCEは、パラメータに閉形式解から導出される層準変換を適用し、正確な概念消去を可能にする。
参考スコア（独自算出の注目度）: 46.450614977061484
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Concept erasure has emerged as a promising approach to mitigate undesired or unsafe content in diffusion models, yet existing methods still face significant limitations. While training-based methods are effective, their high computational cost limits scalability. Editing-based methods are more efficient and deployment-friendly, yet they struggle to simultaneously achieve precise concept erasure and preserve overall generative capacity. We identify this core limitation of the editing-based methods as reliance on additive parameter updates. Our empirical analysis reveals that concept semantics primarily depend on neuron direction rather than neuron magnitude, while overall generative capacity relies on the angular geometry of neurons. As additive updates inherently entangle direction, magnitude, and angular geometry, they inevitably introduce unintended interference between concept erasure and overall generation performance. To address this, we propose Orthogonal Concept Erasure (OCE), which reformulates editing-based erasure as multiplicative parameter updates from a geometric perspective. Specifically, OCE applies layer-wise orthogonal transformations derived from a closed-form solution to the parameters, enabling precise concept erasure while preserving the neuron magnitude and angular geometry. Furthermore, to address conflicting constraints in multi-concept erasure, OCE introduces a subspace-level objective with structured subspace manipulation, yielding a more effective and scalable erasure. Extensive experiments on single- and multi-concept erasure demonstrate that OCE outperforms existing methods in concept erasure and non-target preservation, erasing up to 100 concepts in 4.3 s. Code: https://github.com/HansSunY/OCE.
Abstract（参考訳）: 概念消去は、拡散モデルにおける望ましくない、あるいは安全でないコンテンツを緩和するための有望なアプローチとして現れてきたが、既存の手法は依然として重大な制限に直面している。トレーニングベースの手法は有効であるが、その高い計算コストはスケーラビリティを制限している。編集ベースの手法はより効率的でデプロイしやすいが、正確な概念の消去を同時に達成し、全体の生成能力を維持するのに苦労している。我々は、この編集ベースのメソッドのコア制限を、追加パラメータの更新に依存するものとして識別する。我々の経験的分析によると、概念意味論は主にニューロンの大きさよりもニューロンの方向に依存しているのに対し、全体的な生成能力はニューロンの角形状に依存している。加法的更新は本質的に方向、大きさ、角形状が絡み合うため、必然的に概念の消去と全体的な生成性能の間に意図しない干渉をもたらす。そこで我々は,幾何的な視点から,編集に基づく消去を乗算的パラメータ更新として再構成する直交概念消去(OCE)を提案する。具体的には、OCEは閉形式解から導かれる層状直交変換をパラメータに適用し、ニューロンの大きさと角形状を保ちながら正確な概念消去を可能にする。さらに、マルチコンセプト消去における競合する制約に対処するため、OCEは構造化されたサブスペース操作によるサブスペースレベルの目的を導入し、より効率的でスケーラブルな消去を実現する。シングルコンセプトとマルチコンセプトの消去に関する大規模な実験により、OCEは概念消去と非ターゲット保存において既存の手法よりも優れており、4.3秒で最大100のコンセプトが消去されることを示した。コード:https://github.com/HansSunY/OCE

論文の概要: Orthogonal Concept Erasure for Diffusion Models

関連論文リスト