Fugu-MT 論文翻訳(概要): InjectFlow: Weak Guides Strong via Orthogonal Injection for Flow Matching

論文の概要: InjectFlow: Weak Guides Strong via Orthogonal Injection for Flow Matching

arxiv url: http://arxiv.org/abs/2603.20303v1
Date: Thu, 19 Mar 2026 11:07:14 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-24 19:11:38.822261
Title: InjectFlow: Weak Guides Strong via Orthogonal Injection for Flow Matching
Title（参考訳）: インジェクションFlow:フローマッチングのための直交注入による弱みガイド
Authors: Dayu Wang, Jiaye Yang, Weikang Li, Jiahui Liang, Yang Li,
Abstract要約: フローマッチング (FM) は高忠実度視覚生成の先駆的アプローチとして浮上している。 FMモデルはデータセットバイアスに敏感であり、アウト・オブ・ディストリビューションやマイノリティ・クラスのサンプルを生成する際に深刻な意味劣化を引き起こす。 InjectFlowは、初期速度場中に意味論を注入することで、新しいトレーニング不要な手法である。
参考スコア（独自算出の注目度）: 1.974921946982281
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Flow Matching (FM) has recently emerged as a leading approach for high-fidelity visual generation, offering a robust continuous-time alternative to ordinary differential equation (ODE) based models. However, despite their success, FM models are highly sensitive to dataset biases, which cause severe semantic degradation when generating out-of-distribution or minority-class samples. In this paper, we provide a rigorous mathematical formalization of the ``Bias Manifold'' within the FM framework. We identify that this performance drop is driven by conditional expectation smoothing, a mechanism that inevitably leads to trajectory lock-in during inference. To resolve this, we introduce InjectFlow, a novel, training-free method by injecting orthogonal semantics during the initial velocity field computation, without requiring any changes to the random seeds. This design effectively prevents the latent drift toward majority modes while maintaining high generative quality. Extensive experiments demonstrate the effectiveness of our approach. Notably, on the GenEval dataset, InjectFlow successfully fixes 75% of the prompts that standard flow matching models fail to generate correctly. Ultimately, our theoretical analysis and algorithm provide a ready-to-use solution for building more fair and robust visual foundation models.
Abstract（参考訳）: フローマッチング(FM)は、最近、通常の微分方程式(ODE)モデルに代わる堅牢な連続時間を提供する、高忠実度視覚生成の先駆的なアプローチとして登場した。しかし、その成功にもかかわらず、FMモデルはデータセットバイアスに非常に敏感であり、アウト・オブ・ディストリビューションやマイノリティ・クラスのサンプルを生成する際に深刻な意味劣化を引き起こす。本稿では,FMフレームワーク内での 'Bias Manifold'' の厳密な数学的形式化について述べる。この性能低下は、必然的に推論中に軌道ロックインにつながるメカニズムである条件付き期待平滑化によって引き起こされる。そこで本研究では,初期速度場計算中に直交意味論を注入することで,ランダムな種子の変更を必要とせず,新しいトレーニング不要なInjectFlowを提案する。この設計は、高い生成品質を維持しつつ、潜伏する多数決モードへのドリフトを効果的に防止する。大規模な実験は、我々のアプローチの有効性を実証する。特に、GenEvalデータセットでは、InjectFlowが標準フローマッチングモデルが正しく生成できないというプロンプトの75%をうまく修正している。究極的には、我々の理論的分析とアルゴリズムは、より公平で堅牢な視覚基盤モデルを構築するのに使えるソリューションを提供する。

論文の概要: InjectFlow: Weak Guides Strong via Orthogonal Injection for Flow Matching

関連論文リスト