Fugu-MT 論文翻訳(概要): Flow Matching with Semidiscrete Couplings

論文の概要: Flow Matching with Semidiscrete Couplings

arxiv url: http://arxiv.org/abs/2509.25519v1
Date: Mon, 29 Sep 2025 21:22:35 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-01 17:09:04.329588
Title: Flow Matching with Semidiscrete Couplings
Title（参考訳）: 半離散結合による流れのマッチング
Authors: Alireza Mousavi-Hosseini, Stephen Y. Zhang, Michal Klein, Marco Cuturi,
Abstract要約: 時間依存速度場としてパラメータ化された流れモデルは、ODEを統合することでノイズからデータを生成することができる。これらのモデルは、しばしばフローマッチング、すなわちランダムなノイズ対とターゲットポイントをサンプリングすることによって訓練される。理論上は有望ではあるが、OT-FM(OT flow matching)アプローチは実際には広く使われていない。
参考スコア（独自算出の注目度）: 29.980130720112726
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Flow models parameterized as time-dependent velocity fields can generate data from noise by integrating an ODE. These models are often trained using flow matching, i.e. by sampling random pairs of noise and target points $(\mathbf{x}_0,\mathbf{x}_1)$ and ensuring that the velocity field is aligned, on average, with $\mathbf{x}_1-\mathbf{x}_0$ when evaluated along a segment linking $\mathbf{x}_0$ to $\mathbf{x}_1$. While these pairs are sampled independently by default, they can also be selected more carefully by matching batches of $n$ noise to $n$ target points using an optimal transport (OT) solver. Although promising in theory, the OT flow matching (OT-FM) approach is not widely used in practice. Zhang et al. (2025) pointed out recently that OT-FM truly starts paying off when the batch size $n$ grows significantly, which only a multi-GPU implementation of the Sinkhorn algorithm can handle. Unfortunately, the costs of running Sinkhorn can quickly balloon, requiring $O(n^2/\varepsilon^2)$ operations for every $n$ pairs used to fit the velocity field, where $\varepsilon$ is a regularization parameter that should be typically small to yield better results. To fulfill the theoretical promises of OT-FM, we propose to move away from batch-OT and rely instead on a semidiscrete formulation that leverages the fact that the target dataset distribution is usually of finite size $N$. The SD-OT problem is solved by estimating a dual potential vector using SGD; using that vector, freshly sampled noise vectors at train time can then be matched with data points at the cost of a maximum inner product search (MIPS). Semidiscrete FM (SD-FM) removes the quadratic dependency on $n/\varepsilon$ that bottlenecks OT-FM. SD-FM beats both FM and OT-FM on all training metrics and inference budget constraints, across multiple datasets, on unconditional/conditional generation, or when using mean-flow models.
Abstract（参考訳）: 時間依存速度場としてパラメータ化された流れモデルは、ODEを統合することでノイズからデータを生成することができる。これらのモデルは、例えば、ランダムなノイズ対とターゲットポイントの$(\mathbf{x}_0,\mathbf{x}_1)$をサンプリングし、速度場が平均的に、$\mathbf{x}_1-\mathbf{x}_0$と、$\mathbf{x}_0$を$\mathbf{x}_1$にリンクすることで評価される。これらのペアはデフォルトでは独立してサンプリングされるが、最適なトランスポート(OT)ソルバを使用して、$n$ノイズと$n$ターゲットポイントのバッチをマッチングすることで、より慎重に選択することもできる。理論上は有望ではあるが、OT-FM(OT flow matching)アプローチは実際には広く使われていない。 Zhang et al (2025) は先日,バッチサイズが$n$が大幅に大きくなると OT-FM が真に支払を開始し,Sinkhorn アルゴリズムのマルチGPU実装のみが処理可能であることを指摘した。残念ながら、Sinkhornの実行コストは急速に変動し、速度場に合うために使われる$n$ペアごとに$O(n^2/\varepsilon^2)$演算が必要になる。 OT-FMの理論的約束を満たすために、我々はバッチOTから離れ、ターゲットデータセットの分布が通常有限サイズ$N$であるという事実を活用する半離散的な定式化に頼ることを提案する。 SD-OT問題は、SGDを用いて双対ポテンシャルベクトルを推定することにより解決され、そのベクトルを用いて、列車時に新しくサンプリングされたノイズベクトルを最大内部積探索(MIPS)のコストでデータポイントとマッチングすることができる。半離散FM(SD-FM)は、OT-FMのボトルネックとなる$n/\varepsilon$の二次的依存を取り除く。 SD-FMは、トレーニングメトリクスと推論予算の制約、複数のデータセット、無条件/条件生成、あるいは平均フローモデルを使用する場合の両方でFMとOT-FMを上回ります。

論文の概要: Flow Matching with Semidiscrete Couplings

関連論文リスト