Fugu-MT 論文翻訳(概要): VAE-Inf: A statistically interpretable generative paradigm for imbalanced classification

論文の概要: VAE-Inf: A statistically interpretable generative paradigm for imbalanced classification

arxiv url: http://arxiv.org/abs/2604.25334v1
Date: Tue, 28 Apr 2026 07:50:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-29 16:49:17.766262
Title: VAE-Inf: A statistically interpretable generative paradigm for imbalanced classification
Title（参考訳）: VAE-Inf:不均衡分類のための統計的解釈可能な生成パラダイム
Authors: Hongfei Wu, Ruijian Han, Yancheng Yuan,
Abstract要約: 生成的モデリングと識別的分類のギャップを埋める2段階の枠組みを提案する。推論のために、自然な仮説テストの解釈を受け入れるプロジェクションベースのスコアを導入する。様々な実世界のベンチマークの実験は、我々のフレームワークが他のアプローチと競合する性能を達成していることを示している。
参考スコア（独自算出の注目度）: 8.677199689027772
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Imbalanced classification remains a pervasive challenge in machine learning, particularly when minority samples are too scarce to provide a robust discriminative boundary. In such extreme scenarios, conventional models often suffer from unstable decision boundaries and a lack of reliable error control. To bridge the gap between generative modeling and discriminative classification, we propose a two-stage framework \textbf{VAE-Inf} that integrates deep representation learning with statistically interpretable hypothesis testing. In the first stage, we adopt a one-class modeling perspective by training a variational autoencoder (VAE) exclusively on majority-class data to capture the underlying reference distribution. The resulting latent posteriors are aggregated via a Wasserstein barycenter to construct a global Gaussian reference model, providing a geometrically principled baseline for the majority class. In the second stage, we transform this generative foundation into a discriminative classifier by fine-tuning the encoder with limited minority samples. This is achieved through a novel distribution-aware loss that enforces probabilistic separation between classes based on variance-normalized projection statistics. For inference, we introduce a projection-based score that admits a natural hypothesis testing interpretation, allowing for a distribution-free calibration procedure. This approach yields exact finite-sample control of the Type-I error (false positive rate) without relying on restrictive parametric assumptions. Extensive experiments on diverse real-world benchmarks demonstrate that our framework achieves competitive performance against other approaches. The codes are available upon request.
Abstract（参考訳）: 不均衡な分類は、マシンラーニングにおいて、特に少数派のサンプルが不足しているため、堅牢な差別的境界を提供する場合において、広く普及する課題である。このような極端なシナリオでは、従来のモデルは不安定な決定境界と信頼性のあるエラー制御の欠如に悩まされることが多い。生成的モデリングと識別的分類のギャップを埋めるために、深層表現学習と統計的に解釈可能な仮説テストを統合する2段階のフレームワーク「textbf{VAE-Inf}」を提案する。第一段階では、基礎となる参照分布を捉えるために、多数クラスデータにのみ依存する変分オートエンコーダ(VAE)を訓練することにより、一クラスモデリングの視点を採用する。得られた潜在後部はワッサーシュタイン・バリセンタを介して集約され、大域ガウス参照モデルを構築し、多元類に対する幾何学的原理化されたベースラインを提供する。第2段階では、この生成基盤を、限られた少数サンプルを用いてエンコーダを微調整することにより、識別的分類器に変換する。これは分散正規化射影統計に基づくクラス間の確率的分離を強制する分布認識損失によって達成される。推論のために、自然な仮説テスト解釈を許容するプロジェクションベースのスコアを導入し、分布のない校正手順を可能にする。このアプローチは、制限的なパラメトリック仮定に頼ることなく、Type-I誤差(偽陽性率)を正確に有限サンプル制御する。多様な実世界のベンチマークに関する大規模な実験は、我々のフレームワークが他のアプローチと競合する性能を達成していることを示している。コードは要求に応じて利用可能だ。

論文の概要: VAE-Inf: A statistically interpretable generative paradigm for imbalanced classification

関連論文リスト