Fugu-MT 論文翻訳(概要): A Data-Informed Variational Clustering Framework for Noisy High-Dimensional Data

論文の概要: A Data-Informed Variational Clustering Framework for Noisy High-Dimensional Data

arxiv url: http://arxiv.org/abs/2604.06864v1
Date: Wed, 08 Apr 2026 09:25:44 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-09 17:30:51.450974
Title: A Data-Informed Variational Clustering Framework for Noisy High-Dimensional Data
Title（参考訳）: ノイズの多い高次元データのためのデータインフォームド変分クラスタリングフレームワーク
Authors: Wan Ping Chen,
Abstract要約: DIVIは、グローバル機能ゲーティングと分割ベースの適応型構造成長を組み合わせた、データインフォームの変動クラスタリングフレームワークである。その結果、DIVIは厳しい特徴雑音下で競合的に動作し、計算可能のままであり、解釈可能な特徴ゲーティング動作が得られることがわかった。全体として、DIVIはベイズ生成の完全な解というよりは、ノイズの多い高次元データのための実用的な変動クラスタリングフレームワークであると見なされている。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Clustering in high-dimensional settings with severe feature noise remains challenging, especially when only a small subset of dimensions is informative and the final number of clusters is not specified in advance. In such regimes, partition recovery, feature relevance learning, and structural adaptation are tightly coupled, and standard likelihood-based methods can become unstable or overly sensitive to noisy dimensions. We propose DIVI, a data-informed variational clustering framework that combines global feature gating with split-based adaptive structure growth. DIVI uses informative prior initialization to stabilize optimization, learns feature relevance in a differentiable manner, and expands model complexity only when local diagnostics indicate underfit. Beyond clustering performance, we also examine runtime scalability and parameter sensitivity in order to clarify the computational and practical behavior of the framework. Empirically, we find that DIVI performs competitively under severe feature noise, remains computationally feasible, and yields interpretable feature-gating behavior, while also exhibiting conservative growth and identifiable failure regimes in challenging settings. Overall, DIVI is best viewed as a practical variational clustering framework for noisy high-dimensional data rather than as a fully Bayesian generative solution.
Abstract（参考訳）: 厳密な特徴雑音を伴う高次元設定でのクラスタリングは、特に少数の次元のサブセットだけが情報であり、最終的なクラスタ数が事前に指定されていない場合、依然として困難である。このような状況下では、分割回復、特徴関連学習、構造適応が緊密に結合され、標準的可能性に基づく手法はノイズに不安定または過度に敏感になる可能性がある。グローバルな特徴ゲーティングと分割型適応型構造成長を組み合わせたデータインフォーム型変分クラスタリングフレームワークであるDIVIを提案する。 DIVIは情報的事前初期化を使用して最適化を安定化し、特徴の関連性を異なる方法で学習し、局所的な診断が不適当である場合にのみ、モデルの複雑さを拡大する。また,クラスタリング性能だけでなく,実行時のスケーラビリティやパラメータの感度も検討して,フレームワークの計算的および実践的挙動を明らかにする。実験的に、DIVIは厳しい特徴雑音の下で競争的に機能し、計算可能でありながら、解釈可能な特徴ゲーティング行動をもたらし、また、困難な状況下では保守的な成長と識別可能な障害状態を示す。全体として、DIVIはベイズ生成の完全な解というよりは、ノイズの多い高次元データのための実用的な変動クラスタリングフレームワークであると見なされている。

論文の概要: A Data-Informed Variational Clustering Framework for Noisy High-Dimensional Data

関連論文リスト