Fugu-MT 論文翻訳(概要): Contrastive Entropy Bounds for Density and Conditional Density Decomposition

論文の概要: Contrastive Entropy Bounds for Density and Conditional Density Decomposition

arxiv url: http://arxiv.org/abs/2511.12903v1
Date: Mon, 17 Nov 2025 02:49:08 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-18 14:36:24.621612
Title: Contrastive Entropy Bounds for Density and Conditional Density Decomposition
Title（参考訳）: 密度・条件密度分解のためのコントラストエントロピー境界
Authors: Bo Hu, Jose C. Principe,
Abstract要約: ヒルベルト空間と分解を用いて、多重出力ネットワークがガウス混合を定義する複数の中心を生成する場合に対処する。本稿では,デコーダを複数出力するエンコーダ/ミキサー/デコーダアーキテクチャを提案する。
参考スコア（独自算出の注目度）: 6.697324698653934
License: http://creativecommons.org/licenses/by/4.0/
Abstract: This paper studies the interpretability of neural network features from a Bayesian Gaussian view, where optimizing a cost is reaching a probabilistic bound; learning a model approximates a density that makes the bound tight and the cost optimal, often with a Gaussian mixture density. The two examples are Mixture Density Networks (MDNs) using the bound for the marginal and autoencoders using the conditional bound. It is a known result, not only for autoencoders, that minimizing the error between inputs and outputs maximizes the dependence between inputs and the middle. We use Hilbert space and decomposition to address cases where a multiple-output network produces multiple centers defining a Gaussian mixture. Our first finding is that an autoencoder's objective is equivalent to maximizing the trace of a Gaussian operator, the sum of eigenvalues under bases orthonormal w.r.t. the data and model distributions. This suggests that, when a one-to-one correspondence as needed in autoencoders is unnecessary, we can instead maximize the nuclear norm of this operator, the sum of singular values, to maximize overall rank rather than trace. Thus the trace of a Gaussian operator can be used to train autoencoders, and its nuclear norm can be used as divergence to train MDNs. Our second test uses inner products and norms in a Hilbert space to define bounds and costs. Such bounds often have an extra norm compared to KL-based bounds, which increases sample diversity and prevents the trivial solution where a multiple-output network produces the same constant, at the cost of requiring a sample batch to estimate and optimize. We propose an encoder-mixture-decoder architecture whose decoder is multiple-output, producing multiple centers per sample, potentially tightening the bound. Assuming the data are small-variance Gaussian mixtures, this upper bound can be tracked and analyzed quantitatively.
Abstract（参考訳）: 本稿では,ニューラルネットワークの特徴の解釈可能性についてベイジアン・ガウスの視点から検討する。そこでは,コストの最適化が確率的境界に達している。モデル学習は,境界をきつくし,コストを最適にする密度を,しばしばガウス混合密度で近似する。 2つの例はMixture Density Networks (MDN) で、境界境界と条件境界を用いたオートエンコーダのバウンダを使用する。これはオートエンコーダだけでなく、入力と出力の間の誤差を最小限に抑えることで、入力と中央間の依存性を最大化する、既知の結果である。ヒルベルト空間と分解を用いて、多重出力ネットワークがガウス混合を定義する複数の中心を生成する場合に対処する。我々の最初の発見は、オートエンコーダの目的がガウス作用素のトレースの最大化、ベースの下での固有値の和、データとモデル分布の正規化である。これは、オートエンコーダで必要となる1対1対応が不要である場合、代わりにこの作用素の核ノルム(特異値の和)を最大化し、トレースよりも全体的なランクを最大化することができることを示唆している。したがって、ガウス作用素のトレースはオートエンコーダの訓練に利用することができ、その核ノルムはMDNの訓練の分岐として利用することができる。 2つ目のテストでは、ヒルベルト空間の内積とノルムを使って境界とコストを定義する。このような境界は、サンプルの多様性を高め、複数の出力ネットワークが同じ定数を生成するような自明な解を、推定と最適化のためにサンプルバッチを必要とするコストで防止するKLベースの境界よりも、余分なノルムを持つことが多い。本稿では,デコーダを複数出力するエンコーダ/ミキサー/デコーダアーキテクチャを提案する。データが小分散ガウス混合であると仮定すると、この上限は追跡され、定量的に分析される。

論文の概要: Contrastive Entropy Bounds for Density and Conditional Density Decomposition

関連論文リスト