Fugu-MT 論文翻訳(概要): Beyond Linear Diffusions: Improved Representations for Rare Conditional Generative Modeling

論文の概要: Beyond Linear Diffusions: Improved Representations for Rare Conditional Generative Modeling

arxiv url: http://arxiv.org/abs/2510.02499v1
Date: Thu, 02 Oct 2025 19:06:14 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-06 16:35:52.142146
Title: Beyond Linear Diffusions: Improved Representations for Rare Conditional Generative Modeling
Title（参考訳）: 線形拡散を超えて:希少条件生成モデリングのための表現の改善
Authors: Kulunu Dharmakeerthi, Yousef El-Laham, Henry H. Wong, Vamsi K. Potluru, Changhong He, Taosong He,
Abstract要約: 条件空間の低確率領域において、スコアベース生成モデル学習のサンプル複雑さが小さくなるように、データ表現とフォワードスキームを適応させることが可能であることを示す。非線形ドリフト項のデータ駆動的選択による拡散が,データの適切な表現の下での尾イベントのモデル化に最も適していることを示す。
参考スコア（独自算出の注目度）: 4.527435625329663
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Diffusion models have emerged as powerful generative frameworks with widespread applications across machine learning and artificial intelligence systems. While current research has predominantly focused on linear diffusions, these approaches can face significant challenges when modeling a conditional distribution, $P(Y|X=x)$, when $P(X=x)$ is small. In these regions, few samples, if any, are available for training, thus modeling the corresponding conditional density may be difficult. Recognizing this, we show it is possible to adapt the data representation and forward scheme so that the sample complexity of learning a score-based generative model is small in low probability regions of the conditioning space. Drawing inspiration from conditional extreme value theory we characterize this method precisely in the special case in the tail regions of the conditioning variable, $X$. We show how diffusion with a data-driven choice of nonlinear drift term is best suited to model tail events under an appropriate representation of the data. Through empirical validation on two synthetic datasets and a real-world financial dataset, we demonstrate that our tail-adaptive approach significantly outperforms standard diffusion models in accurately capturing response distributions at the extreme tail conditions.
Abstract（参考訳）: 拡散モデルは、機械学習や人工知能システムにまたがって広く応用される強力な生成フレームワークとして登場した。現在の研究は主に線形拡散に焦点が当てられているが、これらの手法は条件分布をモデル化する際には、$P(Y|X=x)$、$P(X=x)$が小さい場合には、大きな課題に直面することがある。これらの領域では、トレーニングに利用できるサンプルが少ないため、対応する条件密度をモデル化することは困難である。これを認識し、条件空間の低確率領域において、スコアベースの生成モデルを学ぶ際のサンプルの複雑さが小さくなるように、データ表現とフォワードスキームを適応させることが可能であることを示す。条件付き極値理論からインスピレーションを得て、条件付き変数の尾領域の特別な場合、$X$を正確に特徴付ける。非線形ドリフト項のデータ駆動的選択による拡散が,データの適切な表現の下での尾イベントのモデル化に最も適していることを示す。 2つの合成データセットと実世界の財務データセットに対する実証的検証を通じて、我々のテール適応アプローチは、極端尾条件での応答分布を正確に把握する上で、標準拡散モデルを大幅に上回っていることを示す。

論文の概要: Beyond Linear Diffusions: Improved Representations for Rare Conditional Generative Modeling

関連論文リスト