Fugu-MT 論文翻訳(概要): CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation

論文の概要: CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation

arxiv url: http://arxiv.org/abs/2605.25378v2
Date: Wed, 27 May 2026 03:52:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-28 17:38:54.984328
Title: CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation
Title（参考訳）: CollectionLoRA: マルチTeacher on-Policy蒸留による1 LoRAの50効果収集
Authors: Fangtai Wu, Hailong Guo, Shijie Huang, Jiayi Song, Yubo Huang, Mushui Liu, Zhao Wang, Yunlong Yu, Jiaming Liu, Ruihua Huang,
Abstract要約: CollectionLoRAは、エフェクトと数ステップ生成を単一のLoRAに蒸留するフレームワークである。独立して訓練された教師モデルに匹敵する、あるいは優れた概念の忠実性を達成しながら、デプロイメントのオーバーヘッドを低減する。
参考スコア（独自算出の注目度）: 24.705092676391477
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Customized image editing aims to equip pre-trained diffusion models with specific visual effects using limited paired data, typically via Low-Rank Adaptation (LoRA). As the number of desired effects grows, storing and dynamically loading numerous these effect LoRAs significantly increases deployment overhead. Furthermore, current pipelines typically cascade these effect LoRAs with acceleration modules for fast generation, which triggers severe parameter interference and results in concept bleeding and style degradation. We propose CollectionLoRA, a multi-teacher on-policy distillation framework capable of distilling the concepts of up to 50 different effect LoRAs along with few-step generation capabilities into a single LoRA. This fundamentally resolves the feature interference issue and significantly reduces deployment costs. Specifically, the method introduces (i) a Probabilistic Dual-Stream Routing mechanism that enables the model to randomly switch between data sources during training, effectively enhancing its generalization in unseen scenarios; (ii) an Asymmetric Orthogonal Prompting strategy to achieve concept isolation within the prompt space; (iii) a Coarse-to-Fine Distillation Objective to mitigate the distribution gap between the teacher and student models. Extensive evaluations show that CollectionLoRA distills all customized effects and few-step generation into a single LoRA, reducing deployment overhead while achieving concept fidelity comparable to or better than independently trained teacher models. Code: https://github.com/Qwen-Applications/CollectionLoRA
Abstract（参考訳）: カスタマイズされた画像編集は、通常LoRA(Lo-Rank Adaptation)を介して、限られたペアデータを使用して、特定の視覚効果を持つ事前訓練された拡散モデルを装備することを目的としている。望ましいエフェクトの数が増えるにつれて、これらのエフェクトを保存および動的にロードすることで、デプロイメントのオーバーヘッドが大幅に増加する。さらに、現在のパイプラインは一般的にこれらの効果をLoRAに分解し、加速モジュールを高速に生成し、厳しいパラメータ干渉を引き起こし、概念的出血とスタイル劣化を引き起こす。最大50の異なる効果を持つロラの概念を蒸留し,数段階の発電能力を1つのロラに組み込むことが可能な多教師オンライン蒸留フレームワークであるコレクションロラを提案する。これにより、機能の干渉が根本的に解決され、デプロイメントコストが大幅に削減される。具体的には、その方法を紹介します。 (i)訓練中のデータソースをランダムに切り替える確率的デュアルストリームルーティング機構で、目に見えないシナリオにおけるその一般化を効果的に強化する。 (ii) 即時空間内で概念分離を達成するための非対称直交プロンプト戦略 (三)教師モデルと学生モデルとの分配ギャップを軽減するための粗口蒸留対象物。 CollectionLoRAは、すべてのカスタマイズされたエフェクトと数ステップ生成を単一のLoRAに蒸留し、デプロイメントのオーバーヘッドを低減し、独立したトレーニングされた教師モデルに匹敵するコンセプト忠実性を達成している。コード:https://github.com/Qwen-Applications/CollectionLoRA

論文の概要: CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation

関連論文リスト